Found 301,262 repositories(showing 30)
Significant-Gravitas
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
huggingface
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
opencv
Open Source Computer Vision Library
oobabooga
The original local LLM interface. Text, vision, tool-calling, training, and more. 100% offline.
mudler
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
roboflow
We write your reusable computer vision tools. 💜
huggingface
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
XTLS
Xray, Penetrates Everything. Also the best v2ray-core. Where the magic happens. An open platform for various uses.
danny-avila
Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active.
BVLC
Caffe: a fast open framework for deep learning.
500 AI Machine learning Deep learning Computer vision NLP Projects with code
bytedance
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
lucidrains
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
microsoft
A simple screen parsing tool towards pure vision based GUI agent
OpenBMB
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
pytorch
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
jbhuang0604
A curated list of awesome computer vision resources
Skyvern-AI
Automate browser based workflows with AI
screenpipe
run agents that work for you in the background based on what you do
pytorch
Datasets, Transforms and Models specific to Computer Vision
microsoft
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
kmario23
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
jacobgil
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
web-infra-dev
AI-powered, vision-driven UI automation for every platform.
google-research
No description available
getomni-ai
OCR & Document Extraction using vision models
salesforce
LAVIS - A One-stop Library for Language-Vision Intelligence
kornia
🐍 Geometric Computer Vision Library for Spatial AI
kjw0612
A curated list of deep learning resources for computer vision
microsoft
Best Practices, code samples, and documentation for Computer Vision.