Back to search
Refer to Youtube AndrejKarpathy's channel. Try to build a GPT from scratch.
Stars
1
Forks
0
Watchers
1
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
9
commits
finished merged matrix implementation version of multi-head attention. TODO: kv cache.
9fb8e54View on GitHubFinish dev.ipynb. Problem occurred when testing because of the wrong shape of tril. Guess it comes from torch.register_buffer.
f93e490View on GitHubFinish attention block with residual connection and batchNorm.
fd99a90View on GitHub