The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised learning models. Additionally, it also collects many useful tutorials and tools in these related domains.
Stars
234
Forks
32
Watchers
234
Open Issues
0
Overall repository health assessment
No language data available
No package.json found
This might not be a Node.js project
Update and rename Self-supervised_learning_in_vision.md to Self-supervised_learning.md
b1bc986View on GitHub