Search Results

Found 822 repositories(showing 30)

DeepSpeed

deepspeedai

💚95

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

42.0k

4.8k

Apache-2.0

Python

Updated 2 hours ago

billion-parameterscompressiondata-parallelism+10

accelerate

huggingface

💚93

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

9.6k

1.3k

Apache-2.0

Python

Updated 25 minutes ago

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

8.8k

1.4k

Apache-2.0

Python

Updated 5 hours ago

gpullmpytorch+1

lmdeploy

InternLM

💛80

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

7.8k

680

Apache-2.0

Python

Updated 22 minutes ago

codellamacuda-kernelsdeepspeed+8

gpt-neox

EleutherAI

💛88

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

7.4k

1.1k

Apache-2.0

Python

Updated 10 hours ago

deepspeed-librarygpt-3language-model+1

DeepSpeedExamples

deepspeedai

💛88

Example models using DeepSpeed

6.8k

1.1k

Apache-2.0

Python

Updated 23 hours ago

cube-studio

tencentmusic

💛84

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台，mlops算法链路全流程，算力租赁平台，notebook在线开发，拖拉拽任务流pipeline编排，多机多卡分布式训练，超参搜索，推理服务VGPU虚拟化，边缘计算，标注平台自动化标注，deepseek等大模型sft微调/奖励模型/强化学习训练，vllm/ollama/mindie大模型多机推理，私有知识库，AI模型市场，支持国产cpu/gpu/npu 昇腾生态，支持RDMA，支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/ray/volcano等分布式

4.9k

871

NOASSERTION

Python

Updated 4 hours ago

aiaihubargo+14

alltalk_tts

erew123

💛75

AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.

2.3k

277

AGPL-3.0

HTML

Updated 10 hours ago

DeepSpeed-MII

deepspeedai

🧡69

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

2.1k

190

Apache-2.0

Python

Updated 3 days ago

deep-learninginferencepytorch

cube-studio

data-infra

💛74

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台/MaaS/mlops/人工智能平台/训推平台，算法全链路流程，算力租赁平台，拖拉拽任务流pipeline编排，多机多卡分布式训练，超参搜索，推理服务，VGPU虚拟化，云边端协同，边缘计算，自动化标注平台，deepseek等大模型sft微调/奖励模型/强化学习训练，vllm/ollama/mindie大模型多机推理，私有知识库llmops智能体，AI模型市场，支持国产异构算力调度,昇腾/寒武纪/海光/摩尔/沐曦等，支持ib/roce/RDMA，支持pytorch/deepspeed/colossalai/ray等分布式

2.1k

157

NOASSERTION

Python

Updated 2 hours ago

safe-rlhf

PKU-Alignment

🧡68

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

1.6k

132

Apache-2.0

Python

Updated 2 days ago

ai-safetyalpacabeaver+17

Megatron-DeepSpeed

bigscience-workshop

🧡54

Ongoing research training transformer language models at scale, including: BERT & GPT-2

1.4k

226

NOASSERTION

Python

Updated 1 week ago

KnowLM

zjunlp

🧡68

An Open-sourced Knowledgable Large Language Model Framework.

1.4k

133

MIT

Python

Updated 2 days ago

bilingualchinesedeep-learning+17

MPP-LLaVA

Coobiw

🧡66

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.

670

Jupyter Notebook

Updated 7 hours ago

deepspeedfine-tuningmllm+7

distributed-training-guide

LambdaLabsML

💛71

Best practices & guides on how to write distributed pytorch training code

604

MIT

Python

Updated 1 hour ago

clustercudadeepspeed+11

glake

antgroup

🧡66

GLake: optimizing GPU memory management and IO transmission.

501

Apache-2.0

Python

Updated 5 days ago

deepspeedgpullm+3

LLaMA-Cult-and-More

shm007g

💛71

Large Language Models for All, 🦙 Cult and More, Stay in touch !

451

MIT

HTML

Updated 20 hours ago

alpacachatgptdeepspeed+11

finetune-gpt2xl

Xirider

🧡51

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

434

MIT

Python

Updated 1 month ago

deepspeedfinetuninggpt-neo+5

CoLLiE

OpenMOSS

🧡56

Collaborative Training of Large Language Models in an Efficient Way

420

Apache-2.0

Python

Updated 1 week ago

deep-learningdeepspeednlp+1

Chatglm_lora_multi-gpu

liangwq

❤️41

chatglm多gpu用deepspeed和

408

Python

Updated 1 month ago

ReaLHF

openpsi-project

💛71

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

335

Apache-2.0

Python

Updated 1 day ago

deepspeeddistributed-computingdistributed-systems+9

LLM-Pretrain-FineTune

X-jun-0130

🧡66

Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调

298

Python

Updated 2 days ago

RLHF

sunzeyeah

🧡66

Implementation of Chinese ChatGPT

288

Python

Updated 4 hours ago

chatgptdeep-learningdeepspeed+4

nano-deepspeed

zv1131860787

🧡56

No description available

262

Python

Updated 4 hours ago

llms_tool

stanleylsx

🧡60

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

225

Apache-2.0

Python

Updated 2 weeks ago

aquilaaquila2baichuan+15

transpeeder

HuangLK

❤️45

train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism

224

Apache-2.0

Python

Updated 1 month ago

LLM-SFT

yongzhuo

🧡60

中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微调, 推理, 测评, 接口)等.

217

Apache-2.0

Python

Updated 2 weeks ago

LearnDeepSpeed

bobo0810

💛70

DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）

189

MIT

Python

Updated 5 days ago

deepspeedexampleslarge-language-models

llama2-lora-fine-tuning

git-cloner

❤️35

llama2 finetuning with deepspeed and lora

176

MIT

Python

Updated 3 months ago

deepspeedfinetuningllama2+1

Train-llm-from-scratch

wei-potato

🧡65

使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力

162

Python

Updated 1 day ago

GitHub Explorer

Search Results

DeepSpeed

accelerate

ipex-llm

lmdeploy

gpt-neox

DeepSpeedExamples

cube-studio

alltalk_tts

DeepSpeed-MII

cube-studio

safe-rlhf

Megatron-DeepSpeed

KnowLM

MPP-LLaVA

distributed-training-guide

glake

LLaMA-Cult-and-More

finetune-gpt2xl

CoLLiE

Chatglm_lora_multi-gpu

ReaLHF

LLM-Pretrain-FineTune

RLHF

nano-deepspeed

llms_tool

transpeeder

LLM-SFT

LearnDeepSpeed

llama2-lora-fine-tuning

Train-llm-from-scratch

DeepSpeed

accelerate

ipex-llm

lmdeploy

gpt-neox

DeepSpeedExamples

cube-studio

alltalk_tts

DeepSpeed-MII

cube-studio

safe-rlhf

Megatron-DeepSpeed

KnowLM

MPP-LLaVA

distributed-training-guide

glake

LLaMA-Cult-and-More

finetune-gpt2xl

CoLLiE

Chatglm_lora_multi-gpu

ReaLHF

LLM-Pretrain-FineTune

RLHF

nano-deepspeed

llms_tool

transpeeder

LLM-SFT

LearnDeepSpeed

llama2-lora-fine-tuning

Train-llm-from-scratch