Building gpt-2 model from scratch
Stars
0
Forks
Watchers
Open Issues
Overall repository health assessment
No package.json found
This might not be a Node.js project
User
23
commits
Update README.md
b96e9b8
9798aec
1dfc5f4
eeabe4b
add validation
d346652
switch fineweb dataset
be1b7e9
add distribute data parallel
4a79f9d
gradient accumulation
10b995d
weight decay
736f98a
cosime learning rate
ea25473
gradient clip
1c7e631
AdamW params
04f8a69
clean notes
d27e76a
clean print
feaa36b
nice numbers
792e79c