GPU-accelerated Llama3.java inference in pure Java using TornadoVM.
Stars
247
Forks
33
Watchers
247
Open Issues
25
Overall repository health assessment
No package.json found
This might not be a Node.js project
657
commits
200
commits
24
commits
14
commits
5
commits
4
commits
4
commits
3
commits
3
commits
Merge pull request #101 from beehive-lab/refactor/simplify-layerplanner
8a00dedView on GitHubMerge pull request #89 from beehive-lab/ci/quarkus-langchain4j-IT
e12a4c1View on GitHubIntroduce DeepSeekR1Qwen model and integrate with Qwen2ModelLoader
a3f1450View on GitHub[refactor] Simplify and unify Activation task graph setup logic
3aa399bView on GitHub[refactor] Unify task graph setup for Logits layers and centralize shared logic into AbstractLogitsLayer
4be811aView on GitHub[refactor] Move FP16LayerPlanner and Q8_0LayerPlanner to quantized-specific subpackages
a26f2a9View on GitHub[refactor] Move QuantizedLayerPlanner to layerplanner package root-level
170db11View on GitHub[refactor] Simplify and unify layer planners by centralizing inference plan creation logic in layerplanner package
080fea4View on GitHub[refactor] Move QuantizationPlannerFactory to layerplanner package root level
dc76fdeView on GitHub[refactor] Introduce AbstractLogitsLayer to centralize shared logic for logits layers
3c09bcaView on GitHub[refactor] Move GenericLayerPlanner to layerplanner package
bf6823dView on GitHub