Standardized benchmark framework for AI memory systems. Test Mem0, Graphiti, Letta, and more against LongMemEval, LoCoMo, HaluMem.
Stars
0
Forks
0
Watchers
0
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
14
commits
Add S-split stratified testing, optimize Supermemory batch ingest
bfe3cc9View on GitHubAuto-detect fact-extraction systems, show guidance in all reports
a74b448View on GitHubFix Supermemory adapter: add processing poll, retest with results
6151b54View on GitHubFix all code review issues (4 critical, 7 important, 3 suggestions)
7329063View on GitHubAdd BM25 full 500-question baseline (all 6 question types)
bf3d12fView on GitHubv0.2: semantic eval, Chinese benchmark, continual learning, compliance tests
2dc36e6View on GitHubAdd full test results: BM25, Hindsight, LangMem verified with QA Judge
c5b3893View on GitHubPolish: ruff format, HTML report, cost tracking, hallucination LLM eval
8fc7717View on GitHubComplete framework: QA pipeline, tests, CI, adapter fixes
d0fa95bView on GitHub