openai/evals - GitHub Explorer | GitHub Explorer | Trending

Stars

18.1k

Forks

2.9k

Watchers

18.1k

Open Issues

182

Repository Health Score

💚

100/100

Excellent

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Recent Commits

[codex] Pin GitHub Actions workflow references (#1637)

Drew Hintz•6 hours ago

dbb1a20View on GitHub

Remove incontext_rl suite with defunct dependencies (#1605)

Max Burkhardt•5 months ago

4bfc1f5View on GitHub

Updating readme to link to OpenAI hosted evals experience (#1572)

dmitry-openai•1 year ago

cdb8ce9View on GitHub

20240930 steven exception handling usage tokens (#1560)

sjadler2004•1 year ago

a32c982View on GitHub

[eval] Add IMO problems with exact answers (#1528)

Justin W. Lin•1 year ago

234bcdeView on GitHub

Release 3.0.1 (#1525)

Erik Ritter•1 year ago

d3dc890View on GitHub

Make the torch dep optional (#1524)

Erik Ritter•1 year ago

1d3f11cView on GitHub

Release 3.0.0 (#1520)

Erik Ritter•1 year ago

778caa6View on GitHub

Unpin dependencies (#1519)

Shantanu•1 year ago

518a9a8View on GitHub

Remove citation prediction eval (#1512)

Oliver Jaffe•2 years ago

c124f98View on GitHub

Allow for evals with no args (#1517)

Giulio Starace•2 years ago

4ed2f6fView on GitHub

Relax version constraint for `playwright` module (#1516)

Dane•2 years ago

20de8c5View on GitHub

Switch from pyzstd to zstandard (#1514)

Josh Snyder•2 years ago

b5853ebView on GitHub

Updates on existing solvers and bugged tool eval (#1506)

Oliver Jaffe•2 years ago

2420c62View on GitHub

Update ReadMe with New Cookbook link (#1507)

royziv11•2 years ago

d9d2f5fView on GitHub

View all commits

GitHub Explorer

evals

Score Breakdown

Issues Activity: Last 6 months

Top Labels

Hottest Issues