GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

GOATnote-Inc/lostbench - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

lostbench

GOATnote-Inc•PUBLIC

View on GitHub

Standalone benchmark for multi-turn safety persistence in medical LLM conversations. Measures recommendation monotonicity under sustained patient pressure.

ai-safetybenchmarkevaluation-frameworkllm-evaluationmedical-aisafety-persistence

MIT License

Created on Feb 13, 2026

Updated on Mar 12, 2026

Stars

Forks

Watchers

Open Issues

Repository Health Score

🧡

60/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Regular updates - updated this month

20/30

67%

Recent Commits

fix: skip adversarial regression gracefully when API keys not configured

Brandon Dent, MD•3 weeks ago

c17030aView on GitHub

feat: physician adjudication integration (lightweight pointer to SG2)

Brandon Dent, MD•1 month ago

07c8494View on GitHub

fix: update stale OpenEM condition count 363 → 370

Brandon Dent, MD•1 month ago

7c16780View on GitHub

docs: surface campaign engine, methodology, governance, and attack taxonomy in README

Brandon Dent, MD•1 month ago

26a620fView on GitHub

Wire defer differentials RAG into defer runner (#1)

GOATnote•1 month ago

112d4afView on GitHub

ci: add CODEOWNERS for workflow review trail

Brandon Dent, MD•1 month ago

153c309View on GitHub

ci: add permissions block to adversarial-regression workflow

Brandon Dent, MD•1 month ago

1091c8cView on GitHub

docs: update exploit_families and CLAUDE.md with resolved judge agreement

Brandon Dent, MD•1 month ago

9cbceadView on GitHub

fix: judge JSON parse retry + regrade corrects κ=0.137 → 90.6% agreement

Brandon Dent, MD•1 month ago

fd434d1View on GitHub

EF-016: both-calibrated regrade reveals substantive judge disagreement (κ=0.137)

Brandon Dent, MD•1 month ago

a5a28acView on GitHub

EF-016: MCI calibration resolves GPT judge bias (agreement 69%→72%)

Brandon Dent, MD•1 month ago

c68d5ccView on GitHub

EF-016: calibrated judge regrade (κ=0.126→0.344)

Brandon Dent, MD•1 month ago

66bf469View on GitHub

judge calibration: Level 2 vs 1 boundary fix resolves 3/3 disagreements

Brandon Dent, MD•1 month ago

c4d4fccView on GitHub

MSTS dual-judge validation: κ=0.400 on non-MCI conditions

Brandon Dent, MD•1 month ago

c974a7dView on GitHub

EF-016: judge asymmetry validation + GPT preamble regression forensics

Brandon Dent, MD•1 month ago

bc2acfdView on GitHub

View all commits