Search Results

Found 33 repositories(showing 30)

Alpaca-CoT

PhoebusSi

💛70

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！

2.8k

251

Apache-2.0

Jupyter Notebook

Updated 20 hours ago

alpacachatglmchatgpt+12

parameter_efficient_instruction_tuning

AdaBit-AI

❤️36

No description available

600

Apache-2.0

Python

Updated 2 months ago

Parameter-Efficient-MoE

wuhy68

🧡50

Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)

145

Apache-2.0

Python

Updated 1 month ago

CodeUp

juyongjiang

❤️45

CodeUp: A Multilingual Code Generation Llama-X Model with Parameter-Efficient Instruction-Tuning

127

Apache-2.0

Python

Updated 1 month ago

code-generationconsumer-hardwareinstruction-tuning+3

astraios

bigcode-project

❤️35

Astraios: Parameter-Efficient Instruction Tuning Code Language Models

MIT

Jupyter Notebook

Updated 4 months ago

FLAN-FinXC

subhendukhatuya

❤️30

Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling

Python

Updated 3 months ago

Accurate-and-Interpretable-Postmenstrual-Age-Prediction

JimCui0508

❤️40

In this work, we address the dual challenge of accuracy and interpretability by adapting a multimodal large language model to perform both precise PMA prediction and clinical-relevant explanation generation. We introduce a parameter-efficient fine-tuning strategy using Instruction Tuning and Low-Rank Adaptation applied to the Qwen2.5-VL-7B model.

Apache-2.0

Python

Updated 8 months ago

Qwen3-1.7b-MentalHealth-finetuned

SubinoyBera

🧡50

End-to-End parameter-efficient domain adaptation and instruction fine-tuning of Qwen3-1.7b LLM

MIT

Jupyter Notebook

Updated 2 months ago

8-bitfine-tuninglora+3

SSGPF

MartinYuanNJU

❤️35

Appendices of paper "Stepwise Schema-Guided Prompting Framework with Parameter Efficient Instruction Tuning for Multimedia Event Extraction".

Updated 3 months ago

AI_Advertisement_Generator_-Phi-3-Mini-Fine-Tuning-LoRA-

ChanukaWelagedara

❤️35

This repository includes code and experiments for fine-tuning the microsoft/Phi-3-mini-4k-instruct model using Parameter-Efficient Fine-Tuning (PEFT) with LoRA. It is focused on instruction-based tuning and is optimized for generating natural language product advertisements.

Jupyter Notebook

Updated 6 months ago

AIST5030-Mini-Project-Parameter-Efficient-Instruction-Tuning-via-Orthogonal-Finetuning

ytdeng110

❤️45

AIST5030 Mini-Project: Parameter-Efficient Instruction Tuning via Orthogonal Finetuning

Python

Updated 1 month ago

CIQA-DST

dk008652

❤️45

Question-Guided Multi-Task Instruction Tuning for Parameter-Efficient Dialogue State Tracking

Updated 1 month ago

llm-finetuning-qlora-pipeline

hsb943

❤️45

Parameter-efficient fine-tuning pipeline for domain-specific LLMs using QLoRA, synthetic data generation, and instruction tuning.

Python

Updated 2 months ago

Task-1-comparative-LLM-fine-tuning

roshan-176

❤️45

A COMPARATIVE STUDY OF PARAMETER-EFFICIENT FINE-TUNING FOR INSTRUCTION-FOLLOWING LARGE LANGUAGE MODELS

Updated 1 month ago

instruct-pythia-ptuning

andersonbcdefg

❤️40

Using p-tuning (a form of parameter-efficient fine-tuning) to tune Pythia models on natural language instructions.

Apache-2.0

Python

Updated 1 year ago

llm-fine-tuning

Khaled-Karam

❤️45

Parameter-Efficient Fine-Tuning of FLAN-T5 using LoRA and IA³ on the Databricks Dolly 15K dataset for instruction tuning.

Jupyter Notebook

Updated 1 month ago

PEFT-LLM-FLANT5

Abhisekh97

❤️40

parameter efficient fine tuning on flan t5 opensource llm and comparison between instruction fine tuning and evaluation using ROUGE Score

MIT

Jupyter Notebook

Updated 2 years ago

grit-multimodal-llms

shanayghag

❤️35

Geometric Reprojection Instruction Tuning (GRIT): A parameter-efficient fine-tuning framework that combines LoRA with curvature-aware optimization and neural reprojection for efficient adaptation of vision-language models

Python

Updated 6 months ago

gpt2-medium-lora-instruct

adilzubair

❤️30

A fine-tuned GPT-2 Medium model using LoRA (Low-Rank Adaptation) on the Alpaca instruction dataset. This project demonstrates efficient parameter-efficient fine-tuning for instruction-following tasks.

Jupyter Notebook

Updated 3 months ago

FullFineTuningAndLoRAPEFT_FLANT5

ashishsamant2311

❤️35

Instruction fine-tuning a FLAN-T5-Base model using the SQuAD Dataset. Parameter efficient fine tuning the FLAN-T5-Base model using LoRA on SQuAD

Jupyter Notebook

Updated 12 months ago

Text-Summarization-with-RLHF

Isha-singh-01

❤️45

Flan-T5 model for dialogue summarization, integrating instruction-based fine-tuning, parameter-efficient fine-tuning (PEFT), and Reinforcement Learning with Human Feedback (RLHF) for output detoxification.

Jupyter Notebook

Updated 2 months ago

lora-instruction-finetuning

deepdmk

❤️45

Parameter-efficient instruction fine-tuning using LoRA to adapt OPT-350M for code generation. Trains <1% of parameters (1.7M/350M) on CodeAlpaca-20k with SACREBLEU evaluation. Demonstrates practical PEFT techniques for SLMs.

Jupyter Notebook

Updated 1 month ago

Fine-Tuning-TinyLlama-1.1B-with-LoRA

chiaoya

🧡60

This repository contains a Jupyter Notebook for fine-tuning the TinyLlama-1.1B model using Parameter-Efficient Fine-Tuning (PEFT) techniques, specifically LoRA. The model is trained on the Alpaca dataset to improve its instruction-following capabilities.

MIT

Jupyter Notebook

Updated 1 week ago

litgpt

MaryamAlipourH

❤️40

Implemented parameter-efficient instruction tuning by applying LoRA to TinyLlama and Pythia on LIMA and benchmarking zero-shot performance on HellaSwag, then applied an improved LoRA variant and compared results.

Apache-2.0

Python

Updated 6 months ago

Fine_tunning-LLMs

Kaleemullah-Younas

❤️35

This repository contains two self-contained fine-tuning workflows: Fine-tunning LLama 2: Parameter-efficient fine-tuning of Llama-2-7B-Chat using QLoRA and PEFT on a small instruction dataset. FIne-tunning using unsloth: A fast, memory-efficient fine-tuning of Phi-3 Mini (4k instruct, 4-bit) using Unsloth with LoRA.

Jupyter Notebook

Updated 7 months ago

finetuning-llmsllama2lora+3

Fine-tuning-the-LLAMA-2

muhammadHussainRaza

❤️35

Fine-Tuning LLaMA 2 with Hugging Face Transformers This repository contains code and configuration for fine-tuning Meta's LLaMA 2 model using the Hugging Face transformers and peft (Parameter-Efficient Fine-Tuning) libraries. It supports training on custom datasets for instruction tuning, Q&A, or domain-specific NLP tasks.

Jupyter Notebook

Updated 8 months ago

Parameter-Efficient-Fine-Tuning-of-Gemma-2B-for-XSum-Summarization-Under-Limited-Compute-

Fardeen2351

❤️45

LLMs based on transformers excel at abstractive summarization, but complete fine-tuning is usually unfeasible with restricted computing resources. This study assesses a Gemma-2B instruction-tuned model on the XSum summarization task, contrasting a baseline (without fine-tuning) system with parameter-efficient fine-tuning utilizing QLoRA.

Python

Updated 1 month ago

Phi2-Qlora-amazon_product-description-generation

Pavan-220405

❤️35

This project Instruction fine-tunes the Phi-2 language model using QLoRA to generate Amazon product names and descriptions. It demonstrates efficient parameter-efficient fine-tuning on custom text data using low-bit quantization and Hugging Face tools.

Jupyter Notebook

Updated 3 months ago

RISC-V-Random-Assembly-Generator

Manoj2409

❤️35

Implemented transfer learning, Parameter-Efficient Fine-Tuning (PEFT), and Low-Rank Adaptation (LoRA) on the LLaMA 3.2-3B-Instruct model to generate RISC-V assembly instructions for load and store operations.

Jupyter Notebook

Updated 1 year ago

Fine-Tunning-Gemma

MayankSethi27

❤️35

Fine-tuning Google’s Gemma 2B model using KerasNLP and the Dolly-15k dataset in Google Colab. This project covers data preparation, LoRA-based parameter-efficient fine-tuning, and generation using instruction-context prompts. Focused on low-resource environments and practical GenAI training.

Jupyter Notebook

Updated 8 months ago

GitHub Explorer

Search Results

Alpaca-CoT

parameter_efficient_instruction_tuning

Parameter-Efficient-MoE

CodeUp

astraios

FLAN-FinXC

Accurate-and-Interpretable-Postmenstrual-Age-Prediction

Qwen3-1.7b-MentalHealth-finetuned

SSGPF

AI_Advertisement_Generator_-Phi-3-Mini-Fine-Tuning-LoRA-

AIST5030-Mini-Project-Parameter-Efficient-Instruction-Tuning-via-Orthogonal-Finetuning

CIQA-DST

llm-finetuning-qlora-pipeline

Task-1-comparative-LLM-fine-tuning

instruct-pythia-ptuning

llm-fine-tuning

PEFT-LLM-FLANT5

grit-multimodal-llms

gpt2-medium-lora-instruct

FullFineTuningAndLoRAPEFT_FLANT5

Text-Summarization-with-RLHF

lora-instruction-finetuning

Fine-Tuning-TinyLlama-1.1B-with-LoRA

litgpt

Fine_tunning-LLMs

Fine-tuning-the-LLAMA-2

Parameter-Efficient-Fine-Tuning-of-Gemma-2B-for-XSum-Summarization-Under-Limited-Compute-

Phi2-Qlora-amazon_product-description-generation

RISC-V-Random-Assembly-Generator

Fine-Tunning-Gemma

Alpaca-CoT

parameter_efficient_instruction_tuning

Parameter-Efficient-MoE

CodeUp

astraios

FLAN-FinXC

Accurate-and-Interpretable-Postmenstrual-Age-Prediction

Qwen3-1.7b-MentalHealth-finetuned

SSGPF

AI_Advertisement_Generator_-Phi-3-Mini-Fine-Tuning-LoRA-

AIST5030-Mini-Project-Parameter-Efficient-Instruction-Tuning-via-Orthogonal-Finetuning

CIQA-DST

llm-finetuning-qlora-pipeline

Task-1-comparative-LLM-fine-tuning

instruct-pythia-ptuning

llm-fine-tuning

PEFT-LLM-FLANT5

grit-multimodal-llms

gpt2-medium-lora-instruct

FullFineTuningAndLoRAPEFT_FLANT5

Text-Summarization-with-RLHF

lora-instruction-finetuning

Fine-Tuning-TinyLlama-1.1B-with-LoRA

litgpt

Fine_tunning-LLMs

Fine-tuning-the-LLAMA-2

Parameter-Efficient-Fine-Tuning-of-Gemma-2B-for-XSum-Summarization-Under-Limited-Compute-

Phi2-Qlora-amazon_product-description-generation

RISC-V-Random-Assembly-Generator

Fine-Tunning-Gemma