Found 185 repositories(showing 30)
SUFE-AIFLM-Lab
Fin-R1 is a large language model for complex financial reasoning developed and open-sourced with the joint efforts of the SUFE-AIFLM-Lab at the School of Statistics and Data Science, Shanghai University of Finance and Economics and FinStep.AI.
基于200万条医疗数据对DeepSeek-R1-Distill-Qwen-32B进行fine tune且部署
jbarnes850
A step by step guide to fine-tuning the DeepSeek R1 Distilled models on Apple Silicon machines.
ben637482548
A comprehensive guide for fine-tuning DeepSeek-R1 (distilled Llama) locally on Apple Silicon Macs. Includes detailed step-by-step instructions, error resolution, and optimization techniques using UnslothAI and Ollama.
Fine-Tuning of DeepSeek-Style Reasoning Models | RL + Quantization Implementation
Finetuned Deepseek 8b model for finance reasoning
siddharth-Kharche
No description available
This repository contains code for fine-tuning the DeepSeek-R1-Distill-Llama-8B model on financial data using LoRA (Low-Rank Adaptation). The implementation uses the Unsloth library for efficient training and inference.
JohnYehyo
deepseek蒸馏版本微调
Hann-Fu
No description available
mraihan-gmu
A Python Notebook to prompt and finetune DeepSeek-R1
Ripan-Roy
No description available
Naominour
No description available
Amreldesouky
Adapted the DeepSeek model to a medical dataset to improve disease prediction accuracy through fine-tuning.
abdussahid26
General fine-tuning of the DeepSeek-R1-Distill-Qwen-1.5B model for code generation tasks.
This script demonstrates how to fine-tune a distilled Llama model using Unsloth for medical question answering.
Finetuned Deepseek 1.5b model for finance reasoning
SUFE-AIFLM-Lab
No description available
junaidariie
No description available
Fine tune DeepSeek
AhmadShayan1112
Fine-tune DeepSeek R1 using QLoRA for efficient, low-resource training on GPUs like NVIDIA P100. Leverages Hugging Face, PEFT, and bitsandbytes for 4-bit quantized LoRA training with mixed precision. Includes model saving, evaluation, and custom dataset support.
Efficient fine-tuning of DeepSeek-R1-Distill-Qwen-1.5B using LoRA and 8-bit quantization for optimized performance.
ZHOUcourier
FIN-R1 Empowered Adaptive Quantitative Investment Engine - FinLoom
eliotdgl
Machine Learning for Finance and Complex Systems: Semester Project
Laggers3301
No description available
J-aso-n
finetune deepseek
Shaku-Med
Follow the instruction given to run it
Ko-Yin-Maung
No description available
M-Abdullah-Jutt
No description available
goddev420
Auto snipe tokens via twitter post - deepseekr1 - priv rpc