Back to search
OpenAI-compatible API server for running Qwen3.5-397B-A17B locally using AirLLM.
Stars
1
Forks
0
Watchers
1
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
5
commits
feat: v3.0.0 - dynamic VRAM scheduling, remove old wheel, add 3.0.0 wheel
5e974ceView on GitHubRevert "feat: auto VRAM detection for multi-layer GPU loading (v2.13.0)"
e618a32View on GitHubfeat: auto VRAM detection for multi-layer GPU loading (v2.13.0)
bfeca43View on GitHub