Back to search
Running large language models on a single GPU for throughput-oriented scenarios.
Stars
9.4k
Forks
594
Watchers
9.4k
Open Issues
58
Overall repository health assessment
No package.json found
This might not be a Node.js project
56
commits
16
commits
9
commits
5
commits
4
commits
3
commits
2
commits
1
commits
1
commits
1
commits