Search Results

Found 7,788 repositories(showing 30)

ColossalAI

hpcaitech

💚100

Making large AI models cheaper, faster and more accessible

41.4k

4.5k

Apache-2.0

Python

Updated 4 hours ago

aibig-modeldata-parallelism+9

OpenVoice

myshell-ai

💚100

Instant voice cloning by MIT and MyShell. Audio foundation model.

36.2k

4.0k

MIT

Python

Updated 6 hours ago

text-to-speechttsvoice-clone+1

claude-code-router

musistudio

💚100

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

31.8k

2.5k

MIT

TypeScript

Updated 2 minutes ago

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.

27.0k

2.8k

Apache-2.0

TypeScript

Updated 44 minutes ago

ai-artartificial-intelligencegenerative-art+10

LLaVA

haotian-liu

💚100

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

24.7k

2.8k

Apache-2.0

Python

Updated 57 minutes ago

chatbotchatgptfoundation-models+10

unilm

microsoft

💚100

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

22.1k

2.7k

MIT

Python

Updated 9 hours ago

beitbeit-3bitnet+17

Janus

deepseek-ai

💚95

Janus-Series: Unified Multimodal Understanding and Generation Models

17.7k

2.2k

MIT

Python

Updated 36 minutes ago

any-to-anyfoundation-modelsllm+3

timesfm

google-research

💚94

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

15.7k

1.4k

Apache-2.0

Python

Updated 7 minutes ago

Kronos

shiyu-coder

💚97

Kronos: A Foundation Model for the Language of Financial Markets

11.8k

2.5k

MIT

Python

Updated 1 minute ago

seamless_communication

facebookresearch

💛88

Foundational Models for State-of-the-Art Speech and Text Translation

11.8k

1.2k

NOASSERTION

Jupyter Notebook

Updated 9 hours ago

moshi

kyutai-labs

💛84

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

10.0k

929

Apache-2.0

Python

Updated 1 hour ago

pyod

yzhao062

💚94

A Python Library for Outlier and Anomaly Detection on Tabular, Text, and Image Data

9.8k

1.5k

BSD-2-Clause

Python

Updated 17 hours ago

anomalyanomaly-detectionautoencoder+16

notebooks

roboflow

💛89

A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM 3, and Qwen3-VL.

9.3k

1.4k

Jupyter Notebook

Updated 6 hours ago

automatic-labeling-systemcomputer-visiondeep-learning+17

LMFlow

OptimalScale

💛82

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

8.5k

831

Apache-2.0

Python

Updated 8 hours ago

chatgptdeep-learninginstruction-following+4

Depth-Anything

LiheYoung

💛84

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

8.1k

611

Apache-2.0

Python

Updated 14 hours ago

depth-estimationimage-synthesismetric-depth-estimation+1

higgs-audio

boson-ai

💛84

Text-audio foundation model from Boson AI

8.0k

616

Apache-2.0

Python

Updated 8 hours ago

Depth-Anything-V2

DepthAnything

💛86

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

7.9k

804

Apache-2.0

Python

Updated 8 hours ago

monocular-depth-estimation

Qwen-Image

QwenLM

💛78

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

7.7k

476

Apache-2.0

Python

Updated 3 hours ago

Isaac-GR00T

NVIDIA

💛88

NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.

6.6k

1.1k

NOASSERTION

Jupyter Notebook

Updated 1 hour ago

data-juicer

datajuicer

💛80

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

6.2k

356

Apache-2.0

Python

Updated 8 hours ago

datadata-analysisdata-pipeline+11

YuE

multimodal-art-projection

💛78

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

6.1k

726

Apache-2.0

Python

Updated 9 hours ago

aiaudio-generationdeep-learning+8

TabPFN

PriorLabs

💛82

⚡ TabPFN: Foundation Model for Tabular Data ⚡

6.0k

613

NOASSERTION

Python

Updated 3 hours ago

data-sciencefoundation-modelsmachine-learning+2

chronos-forecasting

amazon-science

💛76

Chronos: Pretrained Models for Time Series Forecasting

5.1k

606

Apache-2.0

Python

Updated 3 hours ago

artificial-intelligenceforecastingfoundation-models+10

Kimi-Audio

MoonshotAI

🧡68

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

4.6k

343

Python

Updated 1 hour ago

llm-foundry

mosaicml

💛80

LLM training code for Databricks foundation models

4.4k

587

Apache-2.0

Python

Updated 8 hours ago

deep-learningllmneural-networks+2

star-vector

joanrod

💛77

StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision.

4.3k

240

Apache-2.0

Python

Updated 9 hours ago

llmmultimodal-large-language-modelssvg+1

GLM-4.5

zai-org

💛74

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

4.3k

448

Apache-2.0

Python

Updated 5 hours ago

agentglmllm+2

ACE-Step

ace-step

💛75

ACE-Step: A Step Towards Music Generation Foundation Model

4.3k

538

Apache-2.0

Python

Updated 4 hours ago

metavoice-src

metavoiceio

💛81

Foundational model for human-like, expressive TTS

4.2k

691

Apache-2.0

Python

Updated 4 hours ago

aideep-learningpytorch+6

DeepSeek-VL

deepseek-ai

💛75

DeepSeek-VL: Towards Real-World Vision-Language Understanding

4.1k

585

MIT

Python

Updated 1 day ago

foundation-modelsvision-language-modelvision-language-pretraining

GitHub Explorer

Search Results

ColossalAI

OpenVoice

claude-code-router

InvokeAI

LLaVA

unilm

Janus

timesfm

Kronos

seamless_communication

moshi

pyod

notebooks

LMFlow

Depth-Anything

higgs-audio

Depth-Anything-V2

Qwen-Image

Isaac-GR00T

data-juicer

YuE

TabPFN

chronos-forecasting

Kimi-Audio

llm-foundry

star-vector

GLM-4.5

ACE-Step

metavoice-src

DeepSeek-VL

ColossalAI

OpenVoice

claude-code-router

InvokeAI

LLaVA

unilm

Janus

timesfm

Kronos

seamless_communication

moshi

pyod

notebooks

LMFlow

Depth-Anything

higgs-audio

Depth-Anything-V2

Qwen-Image

Isaac-GR00T

data-juicer

YuE

TabPFN

chronos-forecasting

Kimi-Audio

llm-foundry

star-vector

GLM-4.5

ACE-Step

metavoice-src

DeepSeek-VL