Search Results

Found 8,730 repositories(showing 30)

transformers

huggingface

💚100

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

158.8k

32.7k

Apache-2.0

Python

Updated 25 minutes ago

audiodeep-learningdeepseek+16

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

36.6k

5.1k

Apache-2.0

Python

Updated 5 hours ago

augmixconvnextdistributed-training+17

vit-pytorch

lucidrains

💚100

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

25.0k

3.5k

MIT

Python

Updated 33 minutes ago

artificial-intelligenceattention-mechanismcomputer-vision+2

Swin-Transformer

microsoft

💚100

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

15.8k

2.2k

MIT

Python

Updated 5 hours ago

ade20kimage-classificationimagenet+5

pytorch-grad-cam

jacobgil

💚98

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

12.7k

1.7k

MIT

Python

Updated 1 hour ago

class-activation-mapscomputer-visiondeep-learning+14

vision_transformer

google-research

💛87

No description available

12.4k

1.5k

Apache-2.0

Jupyter Notebook

Updated 5 hours ago

dino

facebookresearch

💛83

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

7.5k

1.0k

Apache-2.0

Python

Updated 10 hours ago

Awesome-Transformer-Attention

cmhungsteve

💛75

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

5.0k

498

Updated 2 days ago

attention-mechanismattention-mechanismsawesome-list+15

Awesome-Visual-Transformer

dk-liang

💛73

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

3.6k

403

Updated 5 hours ago

detrtransformertransformer-awesome+3

big_vision

google-research

💛71

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

3.4k

218

Apache-2.0

Jupyter Notebook

Updated 12 hours ago

ViT-pytorch

jeonsworld

💛76

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

2.1k

397

MIT

Jupyter Notebook

Updated 3 days ago

MambaVision

NVlabs

💛73

[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

2.1k

133

NOASSERTION

Python

Updated 1 day ago

deep-learningfoundation-modelshuggingface-transformers+10

ml-fastvit

apple

🧡68

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

2.0k

122

NOASSERTION

Python

Updated 14 hours ago

ViTPose

ViTAE-Transformer

🧡69

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"

2.0k

247

Apache-2.0

Python

Updated 2 days ago

deep-learningdistillationmae+4

Cream

microsoft

💛74

This is a collection of our NAS and Vision Transformer work.

1.8k

241

MIT

Python

Updated 23 hours ago

automlefficiencyknowledge-distillation+4

ViLT

dandelin

💛74

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

1.5k

231

Apache-2.0

Python

Updated 2 days ago

vision-and-language

ViT-Adapter

czczup

🧡58

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

1.5k

153

NOASSERTION

Python

Updated 1 week ago

adapterobject-detectionsemantic-segmentation+1

Transformer-in-Computer-Vision

Yangzhangcst

🧡68

A paper list of some recent Transformer-based CV works.

1.4k

151

Updated 2 hours ago

awesomecomputer-visiondeep-learning+6

awesome-detection-transformer

IDEA-Research

🧡68

Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)

1.4k

117

Updated 2 days ago

Transformer-in-Vision

DirtyHarryLYL

🧡68

Recent Transformer-based CV and related works.

1.3k

142

Updated 4 days ago

computer-visiondeep-learningmulti-modal+5

T2T-ViT

yitu-opensource

🧡63

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

1.2k

176

NOASSERTION

Jupyter Notebook

Updated 1 week ago

t2t-transformervision-transformervit

ToMe

facebookresearch

🧡67

A method to increase the speed and lower the memory footprint of existing vision transformers.

1.2k

NOASSERTION

Python

Updated 3 hours ago

vit-explain

jacobgil

💛72

Explainability for Vision Transformers

1.1k

108

MIT

Python

Updated 6 days ago

deep-learningexplainable-aipytorch+2

hiera

facebookresearch

🧡67

Hiera: A fast, powerful, and simple hierarchical vision transformer.

1.1k

Apache-2.0

Python

Updated 23 hours ago

GeoSeg

WangLibo1995

💛73

UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image and UAV image segmentation.

1.1k

148

GPL-3.0

Python

Updated 10 hours ago

cnndeep-learningpytorch+6

DAT

LeapLabTHU

💛72

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

929

Apache-2.0

Python

Updated 2 days ago

deep-learningdeformable-attentionimage-classification+2

FasterViT

NVlabs

💛72

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention

911

NOASSERTION

Python

Updated 4 days ago

ade20kbackbonecoco+10

PyTorch-Pretrained-ViT

lukemelas

🧡57

Vision Transformer (ViT) in PyTorch

853

127

Python

Updated 2 weeks ago

how-do-vits-work

xxxnell

❤️47

(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"

821

Apache-2.0

Python

Updated 1 month ago

loss-landscapepytorchself-attention+2

vision-transformers-cifar10

kentaroy47

💛72

Let's train vision transformers (ViT) for cifar 10 / cifar 100!

712

137

MIT

Python

Updated 16 hours ago

GitHub Explorer

Search Results

transformers

pytorch-image-models

vit-pytorch

Swin-Transformer

pytorch-grad-cam

vision_transformer

dino

Awesome-Transformer-Attention

Awesome-Visual-Transformer

big_vision

ViT-pytorch

MambaVision

ml-fastvit

ViTPose

Cream

ViLT

ViT-Adapter

Transformer-in-Computer-Vision

awesome-detection-transformer

Transformer-in-Vision

T2T-ViT

ToMe

vit-explain

hiera

GeoSeg

DAT

FasterViT

PyTorch-Pretrained-ViT

how-do-vits-work

vision-transformers-cifar10

transformers

pytorch-image-models

vit-pytorch

Swin-Transformer

pytorch-grad-cam

vision_transformer

dino

Awesome-Transformer-Attention

Awesome-Visual-Transformer

big_vision

ViT-pytorch

MambaVision

ml-fastvit

ViTPose

Cream

ViLT

ViT-Adapter

Transformer-in-Computer-Vision

awesome-detection-transformer

Transformer-in-Vision

T2T-ViT

ToMe

vit-explain

hiera

GeoSeg

DAT

FasterViT

PyTorch-Pretrained-ViT

how-do-vits-work

vision-transformers-cifar10