llama inference for tencentpretrain
Stars
99
Forks
11
Watchers
Open Issues
Overall repository health assessment
No package.json found
This might not be a Node.js project
User
59
commits
add gradio server.
75bc023
edit default top_k, top_p and temperature.
d1bcd4e
add multiple chunk model loader.
0ff6e4e
Merge pull request #9 from fengyh3/dev
7cd30d8
fix server first run.
d14c682
fix bug of multi-gpu int8 inference
03cbf5e
update
3e39cc9
b5b6c75
408c6d5
7c56bc9
Merge pull request #8 from fengyh3/dev
d2c7b80
update multi-gpu inference for server.
c0cc36a
5b00c62
a1c42f6
d4bdd6b