Search Results

Found 24 repositories(showing 24)

distilabel

argilla-io

💛75

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

3.2k

232

Apache-2.0

Python

Updated 2 days ago

aihuggingfacellms+6

TimeHC-RL

ZJU-REAL

❤️35

This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).

Updated 3 months ago

Synthetic-Data-Generation-using-LLM

AIAnytime

🧡60

Synthetic Data Generation using LLM via Argilla, Distilabel, ChatGPT, etc.

MIT

Jupyter Notebook

Updated 1 week ago

distilabel-spin-dibt

argilla-io

🧡50

Repository containing the SPIN experiments on the DIBT 10k ranked prompts

Apache-2.0

Python

Updated 2 months ago

distilabel-workbench

argilla-io

❤️20

A working repository for experimental pipelines in distilabel

Jupyter Notebook

Updated 1 year ago

Synthetic-Data-Generation-using-LLM

GURPREETKAURJETHRA

❤️40

Synthetic Data Generation using LLM via Argilla, Distilabel, ChatGPT, etc.

MIT

Jupyter Notebook

Updated 1 year ago

argillachatgptgenerative-ai+2

distilabel

lightonai

🧡50

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Apache-2.0

Python

Updated 1 month ago

finetune-medical

497429018

❤️45

Distilabel DeepSeek-R1 模型蒸馏实战

Python

Updated 2 months ago

distilabel-helm-instruct-adaptable-evaluation-criteria

argilla-io

❤️40

A repo that implements Stanford CRFM their HELM Instruct with adaptable evaluation criteria

Apache-2.0

Jupyter Notebook

Updated 11 months ago

distilabel-cost-calculator

djellalmohamedaniss

❤️35

A custom Step for LLM API cost calculation for the distilabel library.

Python

Updated 1 year ago

synthetic-data

inclusion_dataset

johannhartmann

❤️40

A simple distilabel generation pipeline to create a dataset for inclusivity training for language models.

MIT

Python

Updated 9 months ago

alvaldes

❤️35

A simplified Distilabel-inspired pipeline for synthetic dataset creation using pandas DataFrames and local Ollama models.

Jupyter Notebook

Updated 4 months ago

synthetic-datasets

thibaud-perrin

❤️30

Generate synthetic datasets for instruction tuning and preference alignment using tools like `distilabel` for efficient and scalable data creation.

Jupyter Notebook

Updated 1 year ago

aiinstruction-tuningllm+2

Real-time-rag-pipeline-for-rag-research-papers

AreebAhmad-02

❤️35

the repo is for the real time rag pipeline for the research papers , extract all the rag research papers from the arxiv and the semantic chunkin is done on it , then the embedding model finetuning is done to make cluster for finetuning embedding model the distilabel is used for generating synthetic data set

Jupyter Notebook

Updated 1 year ago

All 24 repositories loaded

GitHub Explorer

Search Results

distilabel

TimeHC-RL

Synthetic-Data-Generation-using-LLM

distilabel-spin-dibt

distilabel-workbench

Synthetic-Data-Generation-using-LLM

distilabel

finetune-medical

distilabel-helm-instruct-adaptable-evaluation-criteria

distilabel-cost-calculator

inclusion_dataset

mockdata

Distilabel

distilabel

ml-distilabel

distilabel_trigger

distilabel-feedstock

distilabel_triggers

distilabel-credit-risk-applications

distilabel-steps-library

synthetic_data_generation_distilabel

synthetic_data_generation

synthetic-datasets

Real-time-rag-pipeline-for-rag-research-papers

distilabel

TimeHC-RL

Synthetic-Data-Generation-using-LLM

distilabel-spin-dibt

distilabel-workbench

Synthetic-Data-Generation-using-LLM

distilabel

finetune-medical

distilabel-helm-instruct-adaptable-evaluation-criteria

distilabel-cost-calculator

inclusion_dataset

mockdata

Distilabel

distilabel

ml-distilabel

distilabel_trigger

distilabel-feedstock

distilabel_triggers

distilabel-credit-risk-applications

distilabel-steps-library

synthetic_data_generation_distilabel

synthetic_data_generation

synthetic-datasets

Real-time-rag-pipeline-for-rag-research-papers