mnn-llm doc
介绍
mnn-llm
编译执行
编译
运行
示例代码
示例代码
配置文件
推理配置文件
模型配置文件
模型
模型导出
模型下载
FAQ
常见问题与解答
API文档
C++ API
Python API
mnn-llm doc
模型下载
View page source
模型下载
Model
ModelScope
Hugging Face
Qwen-VL-Chat
Q4_1
Q4_1
Baichuan2-7B-Chat
Q4_1
Q4_1
bge-large-zh
Q4_1
Q4_1
chatglm-6b
Q4_1
Q4_1
chatglm2-6b
Q4_1
Q4_1
chatglm3-6b
Q4_1
Q4_1
codegeex2-6b
Q4_1
Q4_1
deepseek-llm-7b-chat
Q4_1
Q4_1
gemma-2-2b-it
Q4_1
Q4_1
glm-4-9b-chat
Q4_1
Q4_1
gte_sentence-embedding_multilingual-base
Q4_1
Q4_1
internlm-chat-7b
Q4_1
Q4_1
Llama-2-7b-chat
Q4_1
Q4_1
Llama-3-8B-Instruct
Q4_1
Q4_1
Llama-3.2-1B-Instruct
Q4_1
Q4_1
Llama-3.2-3B-Instruct
Q4_1
Q4_1
OpenELM-1_1B-Instruct
Q4_1
Q4_1
OpenELM-270M-Instruct
Q4_1
Q4_1
OpenELM-3B-Instruct
Q8_1
Q8_1
OpenELM-450M-Instruct
Q4_1
Q4_1
phi-2
Q4_1
Q4_1
qwen/Qwen-1_8B-Chat
Q4_1
Q4_1
Qwen-7B-Chat
Q4_1
Q4_1
Qwen1.5-0.5B-Chat
Q4_1
Q4_1
Qwen1.5-1.8B-Chat
Q4_1
Q4_1
Qwen1.5-4B-Chat
Q4_1
Q4_1
Qwen1.5-7B-Chat
Q4_1
Q4_1
Qwen2-0.5B-Instruct
Q4_1
Q4_1
Qwen2-1.5B-Instruct
Q4_1
Q4_1
Qwen2-7B-Instruct
Q4_1
Q4_1
Qwen2-VL-2B-Instruct
Q4_1
Q4_1
Qwen2-VL-7B-Instruct
Q4_1
Q4_1
Qwen2.5-0.5B-Instruct
Q4_1
Q4_1
Qwen2.5-1.5B-Instruct
Q4_1
Q4_1
Qwen2.5-3B-Instruct
Q4_1
Q4_1
Qwen2.5-7B-Instruct
Q4_1
Q4_1
Qwen2.5-Coder-1.5B-Instruct
Q4_1
Q4_1
Qwen2.5-Coder-7B-Instruct
Q4_1
Q4_1
Qwen2.5-Math-1.5B-Instruct
Q4_1
Q4_1
Qwen2.5-Math-7B-Instruct
Q4_1
Q4_1
reader-lm-0.5b
Q4_1
Q4_1
reader-lm-1.5b
Q4_1
Q4_1
TinyLlama-1.1B-Chat-v1.0
Q4_1
Q4_1
Yi-6B-Chat
Q4_1
Q4_1
MobileLLM-125M
Q4_1
Q4_1
MobileLLM-350M
Q4_1
Q4_1
MobileLLM-600M
Q4_1
Q4_1
MobileLLM-1B
Q4_1
Q4_1
SmolLM2-135M-Instruct
Q4_1
Q4_1
SmolLM2-360M-Instruct
Q4_1
Q4_1
SmolLM2-1.7B-Instruct
Q4_1
Q4_1