A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM
Stars
330
Forks
63
Watchers
330
Open Issues
49
Overall repository health assessment
No package.json found
This might not be a Node.js project
50
commits
38
commits
27
commits
19
commits
13
commits
8
commits
7
commits
6
commits
4
commits
3
commits
Add e2e smoke tests for the new datagen system (#378)
f84ff30View on GitHubrefresh cmds for compatibility with latest guidellm (#382)
e2abcebView on GitHubfix(e2e): pass --legacy-data for .pt-format dataset in finetuning sanity test (#373)
88837f2View on GitHubUpdate data generation to use vLLM extract_hidden_states (#353)
2a1443cView on GitHub"Save best" and other improvements to model checkpointing, Issue #307 (#334)
b32ae8cView on GitHub