Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but also Mistral 7B on desktops and servers. ARM, x86, WASM, RISC-V supported. Accelerated by XNNPACK. Python, C# and JS(WASM) bindings available.
Stars
2.1k
Forks
91
Watchers
2.1k
Open Issues
67
Overall repository health assessment
No package.json found
This might not be a Node.js project
87
commits
20
commits
5
commits
1
commits
1
commits
build: Update XNNPACK version to commit 5671db0 (#133)
181bb8bView on GitHubbindings: python: Add alternative methods that don't require numpy
2c0d301View on GitHubbuild: Download and build XNNPACK automatically and update XNNPACK version
d5e31afView on GitHubdockerfile: Fix 'python3 not found' during XNNPACK build (#127)
cfcfa88View on GitHub