Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.
Stars
246
Forks
25
Watchers
Open Issues
3
Overall repository health assessment
No package.json found
This might not be a Node.js project
User
26
commits
2
Delete .DS_Store
9a89696
Update README.md
f663e88
Fix typo in README
0ff60da
d34b574
Add AFTLocal; Fix README with TODOs
e83b0fb
Merge pull request #4 from Datta0/main
907775e
Fix AFTLocal
a0ebad0
Add AFTLocal
2185298
Add Xavier Initialisation to learned position bias
f806620
Fix AFTFull implementation
785edc2
Update ArXiv reference in README
f2e82e8
Add AFTSimple; Remove timestep-wise summation in AFTFull
f3c95c9
Remove sequence length from API
9b07004
Convert from AFT-Simple to AFT-Full
58af6f1
4418645