GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

TideDra/lmm-r1 - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

lmm-r1

TideDra•PUBLIC

View on GitHub

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Apache License 2.0

Created on Feb 13, 2025

Updated on Apr 2, 2026

Stars

845

Forks

Watchers

845

Open Issues

Repository Health Score

💛

71/100

Good

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Issues Analytics

Total Issues

All time

Open

19% of total

Closed

Recent Commits

Merge pull request #74 from ForJadeForest/dev

Geary.Z•11 months ago

f917f18View on GitHub

add special token of gemma3

yingzhepeng•11 months ago

5085cfdView on GitHub

Remove unused _register_to_autoclass method from Gemma3_Patch to streamline the class implementation.

yingzhepeng•11 months ago

cfd369bView on GitHub

Remove dataset argument from command line options in math verifier and update error handling for unknown prompt templates.

yingzhepeng•11 months ago

fef546eView on GitHub

Add handling for fake pixel values in Gemma3_Patch to ensure inputs_embeds are updated correctly when no valid image features are available.

yingzhepeng•11 months ago

657b10fView on GitHub

Refactor Gemma3_VLDataProcessor error handling and message formatting

yingzhepeng•11 months ago

aff8ea9View on GitHub

Add model parameter filtering in DeepspeedStrategy initialization

yingzhepeng•11 months ago

41dab70View on GitHub

Add command line option to enable/disable format reward calculation in math verifier

yingzhepeng•11 months ago

89d358cView on GitHub

Implement liger kernel support in Gemma3 by integrating apply_liger_kernel_to_gemma3 function

yingzhepeng•11 months ago

4f4fbc7View on GitHub

support gemma3

yingzhepeng•11 months ago

53f69a8View on GitHub

use label_key to get the answer in remote_rm

yingzhepeng•11 months ago

d48e367View on GitHub

use label_key to get the answer in remote_rm

yingzhepeng•11 months ago

2dbac2bView on GitHub

fix llm offset_split_position_ids

TideDra•11 months ago

eed5b5aView on GitHub

Merge branch 'dev'

TideDra•11 months ago

813b04dView on GitHub

Merge branch 'openrlhf_v0.7.3' into dev

TideDra•11 months ago

9006cecView on GitHub

View all commits

GitHub Explorer

lmm-r1

Score Breakdown

Issues Activity: Last 6 months

Hottest Issues