Deploy llama2 serving on multiple GPUs via flask
Stars
2
Forks
0
Watchers
Open Issues
Overall repository health assessment
No package.json found
This might not be a Node.js project
User
11
commits
enrich doc
7eb3ef0
fix bug and add explanation in gunicorn config
5590eb3
support gunicorn web serve
a98800d
modify readme
6b80206
update readme
cacc020
add cpu offload
f1f1976
modify client for streaming
9c5f95a
add streaming
edf36c0
add client and distributed support
e825b21
first commit with basic server
cdec02d
Initial commit
5e97c62