Search Results

Found 1,840 repositories(showing 30)

unsloth

unslothai

💚95

Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.

59.3k

5.0k

Apache-2.0

Python

Updated 5 minutes ago

agentdeepseekfine-tuning+16

llmfit

AlexsJones

💚97

Hundreds of models & providers. One command to find what runs on your hardware.

21.0k

1.2k

MIT

Rust

Updated 18 minutes ago

ggufllmlocalai+3

notebooks

unslothai

💛78

250+ Fine-tuning & RL Notebooks for text, vision, audio, embedding, TTS models.

5.1k

825

LGPL-3.0

Jupyter Notebook

Updated 3 hours ago

unsloth

mlx-tune

ARahim3

🧡67

Fine-tune LLMs on your Mac with Apple Silicon. SFT, DPO, GRPO, and Vision fine-tuning — natively on MLX. Unsloth-compatible API.

968

Apache-2.0

Python

Updated 2 hours ago

apple-silicondeep-learninghuggingface+17

deepfabric

always-further

💛72

Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline

851

Apache-2.0

Python

Updated 1 day ago

agentsaidata-science+14

transformers-qwen3-moe-fused

woct0rdho

🧡65

Fused Qwen3 MoE layer for faster training, compatible with Transformers, LoRA, bnb 4-bit quant, Unsloth. Also possible to train LoRA over GGUF

247

Apache-2.0

Python

Updated 14 hours ago

unsloth-zoo

unslothai

🧡53

Utils for Unsloth https://github.com/unslothai/unsloth

229

233

LGPL-3.0

Python

Updated 20 hours ago

Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc GRPO log diagnostics, evaluation, and export end-to-end. Part of the Gaslamp AI platform.

200

Python

Updated 20 hours ago

apple-siliconclaude-codedpo+10

Vodalus-Expert-LLM-Forge

severian42

🧡55

Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation editor Gradio UI.

194

Jupyter Notebook

Updated 1 week ago

ToolBrain

🧡65

A framework for agentic tool use training with reinforcement learning

163

Python

Updated 5 days ago

agentic-aidpofine-tuning+7

TuneKit

riyanshibohra

🧡55

Upload your data → Get a fine-tuned SLM. Free.

142

MIT

Python

Updated 1 week ago

fine-tuningllmllms+4

Indic-gemma-7b-Navarasa

TeluguLLMLabs

🧡60

Repository for fine-tuning gemma models using unsloth for indic languages

Jupyter Notebook

Updated 1 day ago

llm-finetuning-resources

vossenwout

🧡65

Unsloth fine-tuning resources.

Jupyter Notebook

Updated 3 days ago

vlm-grpo

GAD-cell

❤️40

An implementation of GRPO for Unsloth's VLMs training

Python

Updated 1 month ago

grpogrpotrainerhuggingface+4

Unsloth-Windows-fineTuning-Qwen2

v3ucn

🧡65

Unsloth框架在Windows平台微调训练Qwen2大模型，非WSL

Python

Updated 11 hours ago

unsloth-5090-multiple

thad0ctor

🧡65

unsloth-5090-multiple

Python

Updated 3 days ago

MirrorFlow

qqqqqf-q

🧡65

从对话数据到训练:数字分身 + 模型蒸馏 From Dialogue Data to Training Closed-Loop: Digital Twin + Model Distillation

Apache-2.0

Python

Updated 3 days ago

4oaiai-avatar+13

grpo_unsloth_docker

ArturTanona

❤️25

No description available

MIT

Python

Updated 11 months ago

oreilly-pytorch-dl

sinanuozdemir

❤️45

Code for Deep Learning for Modern AI

Jupyter Notebook

Updated 2 months ago

bertclipdeep-learning+12

MedCoT-7B

Breeze648

💛70

本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调，通过 QLoRA 量化和 Unsloth 加速训练，显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势，实现高效、准确且具有解释性的医学问答系统。

NOASSERTION

Python

Updated 1 day ago

4-bit-quantizationaichain-of-thought+9

unsloth-llama3-alpaca-lora

Cre4T3Tiv3

🧡65

Custom model training using modern architectures. 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs. Published adapter on HuggingFace. From training pipeline to deployed model.

Apache-2.0

Jupyter Notebook

Updated 13 hours ago

4bitalpacacolab+12

deepgym

DeepGym

🧡65

RL training environments with verifiable rewards for coding agents. Works with TRL, Unsloth, verl, OpenRLHF.

Python

Updated 1 day ago

ai-agentscode-executioncoding-agents+15

CTune-MLX

Lt2023

🧡50

一个基于 MLX 的一键微调工具,主要用 Python和 Shell 实现unsloth在mac和cpu缺失的问题

NOASSERTION

Shell

Updated 2 months ago

Unsloth_Ollama

gdmuna

❤️35

基于Unsloth框架下，使用llama3大模型为基底的模型微调

Jupyter Notebook

Updated 5 months ago

unsloth-docker

eightBEC

❤️45

A Dockerfile for LLM training with Unsloth

Python

Updated 2 months ago

nlcli-wizard

pranavkumaarofficial

🧡50

Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)

MIT

Python

Updated 1 month ago

cli-toolsfine-tuninggemma+9

Production-Ready-Instruction-Finetuning-of-Meta-Llama-3.2-3B-Instruct-Project

shaheennabi

🧡50

Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-aware responses based on conversational inputs. Using the Kannada Instruct dataset for fine-tuning! Happy Finetuning 🎋

MIT

Jupyter Notebook

Updated 1 month ago

4bit-quantize4bitprecisionanthropic-hh-golden+17

Astor-AI

SrikarVeluvali

❤️45

AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented Generation (RAG) and the advanced LLama 3 model. It offers real-time, accurate responses to a wide range of medical queries, ensuring privacy and security in every interaction. Designed for ease of use, AstorAI provides reliable health information on various topics 24/7.

Jupyter Notebook

Updated 1 month ago

flaskhuggingfacellama3+6

Make-AI-Clone-of-Yourself

Eviltr0N

❤️45

Cloning Yourself using your whatsapp chat history and training a model on it.

Jupyter Notebook

Updated 1 month ago

aiai-clonesai-project+12

Unsloth-VLLM-RTX5090-Ubuntu

oteroantoniogom

❤️45

Automated bash script to set up a high-performance environment on Ubuntu Linux with RTX5090, including installations of PyTorch, Unsloth, vLLM, Triton, Xformers. This script handles system dependencies, creates a Python virtual environment, compiles libraries from source, and verifies installations to ensure an optimal AI and deep learning setup.

MIT

Shell

Updated 1 month ago

GitHub Explorer

Search Results

unsloth

llmfit

notebooks

mlx-tune

deepfabric

transformers-qwen3-moe-fused

unsloth-zoo

unsloth-buddy

Vodalus-Expert-LLM-Forge

ToolBrain

TuneKit

Indic-gemma-7b-Navarasa

llm-finetuning-resources

vlm-grpo

Unsloth-Windows-fineTuning-Qwen2

unsloth-5090-multiple

MirrorFlow

grpo_unsloth_docker

oreilly-pytorch-dl

MedCoT-7B

unsloth-llama3-alpaca-lora

deepgym

CTune-MLX

Unsloth_Ollama

unsloth-docker

nlcli-wizard

Production-Ready-Instruction-Finetuning-of-Meta-Llama-3.2-3B-Instruct-Project

Astor-AI

Make-AI-Clone-of-Yourself

Unsloth-VLLM-RTX5090-Ubuntu

unsloth

llmfit

notebooks

mlx-tune

deepfabric

transformers-qwen3-moe-fused

unsloth-zoo

unsloth-buddy

Vodalus-Expert-LLM-Forge

ToolBrain

TuneKit

Indic-gemma-7b-Navarasa

llm-finetuning-resources

vlm-grpo

Unsloth-Windows-fineTuning-Qwen2

unsloth-5090-multiple

MirrorFlow

grpo_unsloth_docker

oreilly-pytorch-dl

MedCoT-7B

unsloth-llama3-alpaca-lora

deepgym

CTune-MLX

Unsloth_Ollama

unsloth-docker

nlcli-wizard

Production-Ready-Instruction-Finetuning-of-Meta-Llama-3.2-3B-Instruct-Project

Astor-AI

Make-AI-Clone-of-Yourself

Unsloth-VLLM-RTX5090-Ubuntu