jasonvanf/llama-trl - GitHub Explorer | GitHub Explorer | Trending

Stars

240

Forks

24

Watchers

240

Open Issues

8

Repository Health Score

🧡

60/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Regular updates - updated this month

20/30

67%

Recent Commits

Merge pull request #8 from jasonvanf/codex/locate-and-fix-an-important-bug

Jason Van•7 months ago

53c7735View on GitHub

Fix checkpoint save path for reward model

Jason Van•7 months ago

a99eef8View on GitHub

Increase the flexibility of parameters in the training reward model

jasonvan•2 years ago

f280755View on GitHub

Format the model training code for better readability

jasonvan•2 years ago

1b03699View on GitHub

Added support for `resume_from_checkpoint`

jasonvan•2 years ago

763725fView on GitHub

Keep the maximum length consistent with 'seq_length' when setting up sft_trainer

jasonvan•2 years ago

397e700View on GitHub

Save model for hf trainer safely

jasonvan•2 years ago

34520d2View on GitHub

Specifies the variable for the final checkpoint

jasonvan•2 years ago

8ff411aView on GitHub

Remove obsolete parameter layer_norm_names

jasonvan•2 years ago

138ba72View on GitHub

Full weight fine-tuning example

jasonvan•2 years ago

ec932d7View on GitHub

Support full weight fine-tuning with DeepSpeed stage-3 (offload)

jasonvan•2 years ago

68f61bfView on GitHub

Add `warmup_ratio` and `save_total_limit` argument settings

jasonvan•2 years ago

ea9ee75View on GitHub

Fix the final weights save

jasonvan•2 years ago

70ab644View on GitHub

Support automatic merge adapter after training

jasonvan•2 years ago

911aa38View on GitHub

Separate tokenizer setting for decapoda-research

jasonvan•2 years ago

9ed21f3View on GitHub

View all commits

GitHub Explorer

llama-trl

Score Breakdown

Issues Activity: Last 6 months

Hottest Issues