Search Results

Found 11 repositories(showing 11)

alignment-handbook

huggingface

💛80

Robust recipes to align language models with human and AI preferences

5.6k

480

Apache-2.0

Python

Updated 17 hours ago

llmrlhftransformers

notus

argilla-io

🧡60

Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach

168

MIT

Python

Updated 1 week ago

alignment-handbookdpofine-tuning+4

alignment-handbook

yashiqxlabai

❤️30

No description available

Apache-2.0

Python

Updated 1 year ago

alignment-handbook

Red-Fairy

❤️30

No description available

Apache-2.0

Python

Updated 1 year ago

hf_alignment_handbook

spsanps

❤️30

No description available

Apache-2.0

Python

Updated 2 years ago

alignment-handbook-thesis

Ally-Ha

🧡50

No description available

Apache-2.0

Python

Updated 1 week ago

SFT_alignment_handbook

Ekanshqx

❤️35

This is the final code for SFT full finetuning on multiple GPU

Apache-2.0

Python

Updated 1 year ago

workflow-huggingface-alignment-handbook-sft-dpo-alignment-pipeline

leeroopedia

❤️35

No description available

Python

Updated 1 month ago

workflow-huggingface-alignment-handbook-orpo-single-stage-alignment

leeroopedia

❤️35

No description available

Python

Updated 1 month ago

workflow-huggingface-alignment-handbook-qlora-single-gpu-finetuning

leeroopedia

❤️35

No description available

Python

Updated 1 month ago

workflow-huggingface-alignment-handbook-multi-stage-post-training

leeroopedia

❤️35

No description available

Python

Updated 1 month ago

All 11 repositories loaded

GitHub Explorer

Search Results

alignment-handbook

notus

alignment-handbook

alignment-handbook

hf_alignment_handbook

alignment-handbook-thesis

SFT_alignment_handbook

workflow-huggingface-alignment-handbook-sft-dpo-alignment-pipeline

workflow-huggingface-alignment-handbook-orpo-single-stage-alignment

workflow-huggingface-alignment-handbook-qlora-single-gpu-finetuning

workflow-huggingface-alignment-handbook-multi-stage-post-training

alignment-handbook

notus

alignment-handbook

alignment-handbook

hf_alignment_handbook

alignment-handbook-thesis

SFT_alignment_handbook

workflow-huggingface-alignment-handbook-sft-dpo-alignment-pipeline

workflow-huggingface-alignment-handbook-orpo-single-stage-alignment

workflow-huggingface-alignment-handbook-qlora-single-gpu-finetuning

workflow-huggingface-alignment-handbook-multi-stage-post-training