Found 7,788 repositories(showing 30)
hpcaitech
Making large AI models cheaper, faster and more accessible
myshell-ai
Instant voice cloning by MIT and MyShell. Audio foundation model.
musistudio
Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.
invoke-ai
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
haotian-liu
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
microsoft
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
deepseek-ai
Janus-Series: Unified Multimodal Understanding and Generation Models
google-research
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
shiyu-coder
Kronos: A Foundation Model for the Language of Financial Markets
facebookresearch
Foundational Models for State-of-the-Art Speech and Text Translation
kyutai-labs
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
yzhao062
A Python Library for Outlier and Anomaly Detection on Tabular, Text, and Image Data
roboflow
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM 3, and Qwen3-VL.
OptimalScale
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
LiheYoung
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
boson-ai
Text-audio foundation model from Boson AI
DepthAnything
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
QwenLM
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
NVIDIA
NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.
datajuicer
Data processing for and with foundation models! ๐ ๐ ๐ฝ โก๏ธ โก๏ธ๐ธ ๐น ๐ท
multimodal-art-projection
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
PriorLabs
โก TabPFN: Foundation Model for Tabular Data โก
amazon-science
Chronos: Pretrained Models for Time Series Forecasting
MoonshotAI
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
mosaicml
LLM training code for Databricks foundation models
joanrod
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision.
zai-org
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
ace-step
ACE-Step: A Step Towards Music Generation Foundation Model
metavoiceio
Foundational model for human-like, expressive TTS
deepseek-ai
DeepSeek-VL: Towards Real-World Vision-Language Understanding