Python bindings for llama.cpp
Stars
10.1k
Forks
1.4k
Watchers
10.1k
Open Issues
706
Overall repository health assessment
No package.json found
This might not be a Node.js project
1.6k
commits
71
commits
18
commits
14
commits
13
commits
12
commits
9
commits
8
commits
7
commits
6
commits
feat: Update llama.cpp to ggerganov/llama.cpp@f49e9178767d557a522618b16ce8694f9ddac628 (#2169)
100b275View on GitHubfix(ci): publish distinct manylinux and musllinux cpu wheels (#2165)
fcd932aView on GitHubfeat: Update llama.cpp to ggerganov/llama.cpp@c0159f9c1f874da15e94f371d136f5920b4b5335 (#2161)
c670222View on GitHubfix(ci): reduce CUDA binary wheel size only including cubins for current arches and one PTX target for forward compatibility (#2158)
5f9c231View on GitHubfeat: expose attention_type parameter in Llama.__init__ (#2143)
7b38c31View on GitHub