Search Results

Found 76,785 repositories(showing 30)

transformers

huggingface

💚100

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

159.0k

32.8k

Apache-2.0

Python

Updated just now

audiodeep-learningdeepseek+16

llama.cpp

ggml-org

💚100

LLM inference in C/C++

102.3k

16.5k

MIT

C++

Updated 1 minute ago

ggml

vllm

vllm-project

💚90

A high-throughput and memory-efficient inference and serving engine for LLMs

75.6k

15.3k

Apache-2.0

Python

Updated 6 minutes ago

amdblackwellcuda+17

llama

meta-llama

💚95

Inference code for Llama models

59.3k

9.8k

NOASSERTION

Python

Updated 41 minutes ago

segment-anything

facebookresearch

💚95

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

53.9k

6.3k

Apache-2.0

Jupyter Notebook

Updated 1 hour ago

whisper.cpp

ggml-org

💚100

Port of OpenAI's Whisper model in C/C++

48.4k

5.4k

MIT

C++

Updated 4 minutes ago

inferenceopenaispeech-recognition+3

zod

colinhacks

💚100

TypeScript-first schema validation with static type inference

42.3k

1.9k

MIT

TypeScript

Updated 7 minutes ago

runtime-validationschema-validationstatic-types+2

DeepSpeed

deepspeedai

💚95

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

42.0k

4.8k

Apache-2.0

Python

Updated 29 minutes ago

billion-parameterscompressiondata-parallelism+10

ColossalAI

hpcaitech

💚100

Making large AI models cheaper, faster and more accessible

41.4k

4.5k

Apache-2.0

Python

Updated 28 minutes ago

aibig-modeldata-parallelism+9

BitNet

microsoft

💚100

Official inference framework for 1-bit LLMs

37.4k

3.3k

MIT

Python

Updated just now

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

36.6k

5.1k

Apache-2.0

Python

Updated 3 hours ago

augmixconvnextdistributed-training+17

mediapipe

google-ai-edge

💚100

Cross-platform, customizable ML solutions for live and streaming media.

34.6k

5.9k

Apache-2.0

C++

Updated 12 minutes ago

androidaudio-processingc-plus-plus+14

sglang

sgl-project

💛85

SGLang is a high-performance serving framework for large language models and multimodal models.

25.5k

5.2k

Apache-2.0

Python

Updated 30 minutes ago

attentionblackwellcuda+15

flux

black-forest-labs

💚100

Official inference repo for FLUX.1 models

25.4k

1.9k

Apache-2.0

Python

Updated 2 hours ago

ncnn

Tencent

💚95

ncnn is a high-performance neural network inference framework optimized for the mobile platform

23.1k

4.4k

NOASSERTION

C++

Updated 32 minutes ago

androidarm-neonartificial-intelligence+17

faster-whisper

SYSTRAN

💚95

Faster Whisper transcription with CTranslate2

22.0k

1.8k

MIT

Python

Updated 1 minute ago

deep-learninginferenceopenai+5

CosyVoice

FunAudioLLM

💚95

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

20.4k

2.3k

Apache-2.0

Python

Updated 6 minutes ago

audio-generationcantonesechatbot+16

onnxruntime

microsoft

💚95

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

19.8k

3.8k

MIT

C++

Updated 34 minutes ago

ai-frameworkdeep-learninghardware-acceleration+6

llama2.c

karpathy

💚100

Inference Llama 2 in one file of pure C

19.4k

2.5k

MIT

Updated 40 minutes ago

sam2

facebookresearch

💚95

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

18.9k

2.4k

Apache-2.0

Jupyter Notebook

Updated 1 hour ago

NemoClaw

NVIDIA

💚100

Run OpenClaw more securely inside NVIDIA OpenShell with managed inference

18.7k

2.2k

Apache-2.0

JavaScript

Updated 5 minutes ago

llama-cookbook

meta-llama

💚95

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

18.3k

2.7k

MIT

Jupyter Notebook

Updated 34 minutes ago

aifinetuninglangchain+7

free-llm-api-resources

cheahjs

💚90

A list of free LLM inference resources accessible via API.

18.1k

1.8k

Python

Updated 34 minutes ago

aiclaudegemini+3

web-llm

mlc-ai

💚97

High-performance In-browser LLM Inference Engine

17.7k

1.2k

Apache-2.0

TypeScript

Updated 13 minutes ago

chatgptdeep-learninglanguage-model+4

ml-engineering

stas00

💚96

Machine Learning Engineering Open Book

17.6k

1.1k

CC-BY-SA-4.0

Python

Updated 1 hour ago

aidebugginggpus+13

ktransformers

kvcache-ai

💚93

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

16.9k

1.3k

Apache-2.0

Python

Updated 4 minutes ago

codellama

meta-llama

💚100

Inference code for CodeLlama models

16.3k

1.9k

NOASSERTION

Python

Updated 7 hours ago

airllm

lyogavin

💚100

AirLLM 70B inference with single 4GB GPU

15.1k

1.5k

Apache-2.0

Jupyter Notebook

Updated 9 minutes ago

chinese-llmchinese-nlpfinetune+10

ts-pattern

gvergnaud

💛87

🎨 The exhaustive Pattern Matching library for TypeScript, with smart type inference.

14.9k

163

MIT

TypeScript

Updated 1 hour ago

branchingconditionsexhaustive+8

MNN

alibaba

💚100

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

14.8k

2.3k

Apache-2.0

C++

Updated 54 minutes ago

armconvolutiondeep-learning+8

GitHub Explorer

Search Results

transformers

llama.cpp

vllm

llama

segment-anything

whisper.cpp

zod

DeepSpeed

ColossalAI

BitNet

pytorch-image-models

mediapipe

sglang

flux

ncnn

faster-whisper

CosyVoice

onnxruntime

llama2.c

sam2

NemoClaw

llama-cookbook

free-llm-api-resources

web-llm

ml-engineering

ktransformers

codellama

airllm

ts-pattern

MNN

transformers

llama.cpp

vllm

llama

segment-anything

whisper.cpp

zod

DeepSpeed

ColossalAI

BitNet

pytorch-image-models

mediapipe

sglang

flux

ncnn

faster-whisper

CosyVoice

onnxruntime

llama2.c

sam2

NemoClaw

llama-cookbook

free-llm-api-resources

web-llm

ml-engineering

ktransformers

codellama

airllm

ts-pattern

MNN