mnn-llm

mnn-llm

mnn-llm是基于MNN开发的LLM推理引擎,支持目前主流的开源LLM模型。

模型导出与下载

llm模型导出onnxmnn模型请使用llm-export

modelscope模型下载:

qwen
glm
llama
phi
embedding

构建

CI构建状态:

Build Status Build Status Build Status Build Status Build Status Build Status

4. 执行

# linux/macos
./cli_demo ./Qwen2-1.5B-Instruct-MNN/config.json # cli demo
./web_demo ./Qwen2-1.5B-Instruct-MNN/config.json ../web # web ui demo

# windows
.\Debug\cli_demo.exe ./Qwen2-1.5B-Instruct-MNN/config.json
.\Debug\web_demo.exe ./Qwen2-1.5B-Instruct-MNN/config.json ../web

# android
adb push libs/*.so build/libllm.so build/cli_demo /data/local/tmp
adb push model_dir /data/local/tmp
adb shell "cd /data/local/tmp && export LD_LIBRARY_PATH=. && ./cli_demo ./Qwen2-1.5B-Instruct-MNN/config.json"