Back to search
52 Layers 4B(0.6B Active) MoE | Nemotron-3 Style + Teon Optimizer + Mamba-2 SSM + FP8 Training on H100
Stars
7
Forks
1
Watchers
7
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
3
commits