rwkv-mobile

An inference runtime with multiple backends supported.

Goal:

Easy integration on different platforms using flutter or native cpp, including mobile devices.
Support inference using different hardware like Qualcomm Hexagon NPU, or general CPU/GPU.
Provide easy-to-use C apis
Provide an api server compatible with AI00_server(openai api)

Supported or planned backends:

WebRWKV (WebGPU): Compatible with most PC graphics cards, as well as macOS Metal. Doesn't work on Qualcomm's proprietary Adreno GPU driver though.
llama.cpp: Run on Android devices with CPU inference.
ncnn: Initial support for rwkv v6/v7 unquantized models (suitable for running tiny models everywhere).
Qualcomm Hexagon NPU: Based on Qualcomm's QNN SDK.
CoreML: (WIP) Running RWKV with Apple Neural Engine. Based on Apple's CoreML framework.
To be continued...

How to build:

Install rust and cargo (for building the web-rwkv backend)
git clone --recursive https://github.com/MollySophia/rwkv-mobile
cd rwkv-mobile && mkdir build && cd build
cmake ..
cmake --build . -j $(nproc)

TODO:

Better tensor abstraction for different backends
Batch inference for all backends

Name		Name	Last commit message	Last commit date
Latest commit History 492 Commits
.github/workflows		.github/workflows
assets		assets
cmake		cmake
converter		converter
examples		examples
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

rwkv-mobile

Goal:

Supported or planned backends:

How to build:

TODO:

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

MollySophia/rwkv-mobile

Folders and files

Latest commit

History

Repository files navigation

rwkv-mobile

Goal:

Supported or planned backends:

How to build:

TODO:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages