Found 19 repositories(showing 19)
xlite-dev
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
chenhongyu2048
Summary of some awesome work for optimizing LLM inference
sihyeong
No description available
dongxiangjue
A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large Language Model Inference-Time Self-Improvement.
zenrran4nlp
No description available
withhaotian
A curated and up-to-date paper list of awesome efficient LLMs inference research.
CaiJichang212
No description available
Curated list of LLM training and inference frameworks, tools, and resources. Covers data processing, distributed training, quantization, deployment, and monitoring.
P-r-e-m-i-u-m
enjoy
tanquangduong
Best practices, awesome libraries / frameworks for LLM inference
adityajadhav2000
No description available
zhouzx17
No description available
Joao1PNM
Explore frameworks, tools, and resources for efficient large language model training and inference, covering data processing to deployment.
YannnnnnY
A curated list for Efficient Inference of Diffusion-based Large Language Models
No description available
✨✨Latest Advances on Efficient LLM Inference.
plll4zzx
No description available
ZJLi2013
a list of papers related to AI trends, in kernels, framework design, AIGC applications
Towards Highly Efficient Inference of Multimodal Large Language Models: A Survey
All 19 repositories loaded