Build a tiny LLM from scratch.
Stars
18
Forks
0
Watchers
18
Open Issues
1
Overall repository health assessment
No package.json found
This might not be a Node.js project
5
commits
Add mechanisms to train.py so it can skip the pre-defined paramsif the pretrained model is provided (pretrain.py, train.py)
f37df3fView on GitHubMerge pull request #9 from Mikyx-1/vietlh/change_split_mechanism_in_pretraining
86e0aefView on GitHubChange the train/val split mechanism in pretrain.py (#8)
f74f873View on GitHubMake the pretrain.py minimal and move the tokenizer training to pretokenize.py
d2b7196View on GitHubAdd pretokenize.py to build .bin files for faster loading in pretraining stage
d69afd1View on GitHub