Found 1,840 repositories(showing 30)
unslothai
Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.
AlexsJones
Hundreds of models & providers. One command to find what runs on your hardware.
unslothai
250+ Fine-tuning & RL Notebooks for text, vision, audio, embedding, TTS models.
ARahim3
Fine-tune LLMs on your Mac with Apple Silicon. SFT, DPO, GRPO, and Vision fine-tuning — natively on MLX. Unsloth-compatible API.
always-further
Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline
woct0rdho
Fused Qwen3 MoE layer for faster training, compatible with Transformers, LoRA, bnb 4-bit quant, Unsloth. Also possible to train LoRA over GGUF
unslothai
Utils for Unsloth https://github.com/unslothai/unsloth
TYH-labs
Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc GRPO log diagnostics, evaluation, and export end-to-end. Part of the Gaslamp AI platform.
severian42
Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation editor Gradio UI.
ToolBrain
A framework for agentic tool use training with reinforcement learning
riyanshibohra
Upload your data → Get a fine-tuned SLM. Free.
TeluguLLMLabs
Repository for fine-tuning gemma models using unsloth for indic languages
vossenwout
Unsloth fine-tuning resources.
GAD-cell
An implementation of GRPO for Unsloth's VLMs training
Unsloth框架在Windows平台微调训练Qwen2大模型,非WSL
thad0ctor
unsloth-5090-multiple
qqqqqf-q
从对话数据到训练:数字分身 + 模型蒸馏 From Dialogue Data to Training Closed-Loop: Digital Twin + Model Distillation
ArturTanona
No description available
sinanuozdemir
Code for Deep Learning for Modern AI
Breeze648
本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth 加速训练,显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势,实现高效、准确且具有解释性的医学问答系统。
Cre4T3Tiv3
Custom model training using modern architectures. 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs. Published adapter on HuggingFace. From training pipeline to deployed model.
DeepGym
RL training environments with verifiable rewards for coding agents. Works with TRL, Unsloth, verl, OpenRLHF.
Lt2023
一个基于 MLX 的一键微调工具,主要用 Python和 Shell 实现unsloth在mac和cpu缺失的问题
gdmuna
基于Unsloth框架下,使用llama3大模型为基底的模型微调
eightBEC
A Dockerfile for LLM training with Unsloth
pranavkumaarofficial
Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)
Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-aware responses based on conversational inputs. Using the Kannada Instruct dataset for fine-tuning! Happy Finetuning 🎋
SrikarVeluvali
AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented Generation (RAG) and the advanced LLama 3 model. It offers real-time, accurate responses to a wide range of medical queries, ensuring privacy and security in every interaction. Designed for ease of use, AstorAI provides reliable health information on various topics 24/7.
Eviltr0N
Cloning Yourself using your whatsapp chat history and training a model on it.
oteroantoniogom
Automated bash script to set up a high-performance environment on Ubuntu Linux with RTX5090, including installations of PyTorch, Unsloth, vLLM, Triton, Xformers. This script handles system dependencies, creates a Python virtual environment, compiles libraries from source, and verifies installations to ensure an optimal AI and deep learning setup.