Found 17 repositories(showing 17)
NVIDIA
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
No description available
AlongWY
wheels for TransformerEngine
Eric-is-good
TransformerEngineINT8 是一个为旧世代显卡(A系列和30系及以前)打造的高性能INT8量化加速训练框架。
ksivaman
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.
chirag-7
Mirror of NVIDIA/TransformerEngine
shizhengLi
TransformerEngine技术深度解析系列
zliu69
No description available
wuyufffan
TransformerEngine (TE) Development Toolkit for AMD ROCm/HIP
tangao11
main
leonardozcm
Summary Transformer Engine Update Weekly powered by AI
No description available
No description available
No description available
Accelerate HuggingFace LLaMA models with NVIDIA Transformer Engine FP8 training
No description available
Accelerate HuggingFace Gemma models with NVIDIA Transformer Engine for FP8 training and inference with KV cache support
All 17 repositories loaded