Found 103,483 repositories(showing 30)
ultralytics
Ultralytics YOLO 🚀
huggingface
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
HumanSignal
Label Studio is a multi-type data labeling and annotation tool with standardized output format
WZMIAOMIAO
deep learning for image processing including classification and object-detection etc.
lucidrains
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
HumanSignal
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
microsoft
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
cvat-ai
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
albumentations-team
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
chenyuntc
PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch:入门与实战》)
jacobgil
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
extreme-assistant
cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集,极市团队整理
voxel51
Refine high-quality datasets and visual AI models
satellite-image-deep-learning
Techniques for deep learning with satellite & aerial imagery
OpenGVLab
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
microsoft
Best Practices, code samples, and documentation for Computer Vision.
roboflow
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM 3, and Qwen3-VL.
NirantK
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
CVHub520
Effortless data labeling with AI support from Segment Anything and other awesome models.
microsoft
Experience, Learn and Code the latest breakthrough innovations with Microsoft AI
dmlc
Gluon CV Toolkit
PaddlePaddle
A treasure chest for visual classification and recognition powered by PaddlePaddle
An absolute beginner's guide to Machine Learning and Image Classification with Neural Networks
weiaicunzai
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, NasNet, Residual Attention Network, SENet, WideResNet)
quark0
Differentiable architecture search for convolutional and recurrent networks
open-mmlab
OpenMMLab Pre-training Toolbox and Benchmark
tensorflow
A library for transfer learning by reusing parts of TensorFlow models.
catalyst-team
Accelerated deep learning R&D
weiaicunzai
A curated list of deep learning image classification papers and codes
osmr
Sandbox for training deep learning networks