Search Results

Found 31 repositories(showing 30)

LongMemEval

xiaowu0162

🧡66

Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)

549

MIT

Python

Updated 29 minutes ago

Zengram

ZenSystemAI

🧡65

A Multi Agent Memory MCP That Connect Agents Across Systems and Machines

MIT

JavaScript

Updated 5 hours ago

claudeclaude-aiclaude-code+17

agentmemory

JordanMcCann

💛70

Memory system for AI agents. #1 on LongMemEval — 96.2% (481/500). Beats every published system including Chronos, Mastra, Supermemory, and Emergence. Built solo in 16 days for $1,000.

MIT

Python

Updated 1 day ago

workshop-longmemeval

mastra-ai

❤️35

Memory examples for the Mastra Memory workshop on Jul 24, 2025

Updated 5 months ago

longmemeval-rlm

rawwerks

❤️45

No description available

Python

Updated 1 week ago

atlas-memory-releases

dddabtc

🧡60

Atlas Memory — self-hosted long-term memory for AI agents. LongMemEval (90.18%)

MIT

Updated 1 week ago

Backboard-longmemEval-results

Backboard-io

❤️40

No description available

HTML

Updated 2 weeks ago

sociomemory

B-Divyesh

🧡60

High-accuracy long-term memory for AI agents. 86.6% on LongMemEval with 10-step Hyper Search RAG pipeline.

MIT

Python

Updated 1 week ago

sdk-python

recallrai

🧡50

Official Python SDK for RecallrAI – a revolutionary contextual memory system that enables AI assistants to form meaningful connections between conversations, just like human memory.

Python

Updated 6 days ago

aicogneecontextual-memory+9

longmemeval-inspector

nicoloboschi

❤️45

Visual inspector for LongMemEval dataset

HTML

Updated 1 month ago

tribalmemory-bench

abbudjoe

❤️45

Benchmark suite for conversational memory systems (LongMemEval, ConvoMem)

Python

Updated 1 month ago

LongMemEval-Advanced-Deeplearning

kinsingo

❤️30

No description available

MIT

Python

Updated 4 months ago

lens-benchmark

marklubin

🧡50

LENS - AI Memory Benchmark - Memory as Experience, Not Facts

MIT

HTML

Updated 3 hours ago

agentbenchmarkcontext-engineering+5

Local-first AI memory that scores 100% on user fact recall. Open-source memory layer for LLM agents with hybrid search, middle-out compression, and local LLM support. Beats Mem0 (49%) and Zep (71%) on LongMemEval. Your data never leaves your machine.

MIT

Python

Updated 1 day ago

longmemeval

pdx97

❤️25

No description available

Python

Updated 10 months ago

longmemeval

lugmanhussainkhan

❤️25

No description available

TypeScript

Updated 4 months ago

longmemeval

josancamon19

❤️35

No description available

Python

Updated 2 months ago

longmemeval-results

Neutrally-app

🧡65

Neutrally's LongMemEval-S hypothesis file and reproduction instructions — 89.4% (447/500)

Updated 1 day ago

AtlasLongMemEval

hellen9527

🧡65

在longmemeval评估集上评测

Python

Updated 5 days ago

MemoryOS-LongMemEval

Yummytanmo

❤️45

This project provides a complete adaptation of MemoryOS for evaluating long-term memory capabilities on the LongMemEval benchmark, with all core MemoryOS functionality preserved and optimized for evaluation scenarios.

Python

Updated 1 month ago

A-mem-LongMemEval

Yummytanmo

❤️45

An evaluation adapter for benchmarking the A-mem memory system on LongMemEval, supporting retrieval metrics (Recall@k, NDCG@k) and QA evaluation with multiple LLM backends (OpenAI, SGLang, Ollama).

Python

Updated 1 month ago

longmemeval-nous-3b-evidence

AlpenglowAgents

🧡55

NOUSai LongMemEval benchmark evidence: 73% with Ollama 3B (local inference)

Updated 2 weeks ago

investigathon-NLP

juaneliascabrera

❤️35

Implementación de RAG sobre Gemma3:4b. Testeado con LongMemEval

Python

Updated 3 months ago

oltramem

Koushik1161

🧡50

Next-generation agent memory system with 70.4% QA accuracy on LongMemEval

MIT

JavaScript

Updated 1 month ago

openclaw

omega-memory

❤️30

OMEGA persistent memory plugin for OpenClaw — graph-based, local-first, #1 on LongMemEval

TypeScript

Updated 1 month ago

ai-agentmcpmemory+4

mem-bench

rivercrab26

🧡60

Standardized benchmark framework for AI memory systems. Test Mem0, Graphiti, Letta, and more against LongMemEval, LoCoMo, HaluMem.

NOASSERTION

Python

Updated 1 week ago

memorybench

Jinstronda

🧡60

Open source memory benchmark + RAG system. 82.8% on LongMemEval. Ships as MCP server for Claude Code.

MIT

TypeScript

Updated 3 weeks ago

rlm-memory

sayedRaheel

❤️45

Scalable conversational memory via recursive sub-agent delegation — 46% EM vs 5% truncation on LongMemEval-S, zero training

Python

Updated 1 month ago

gated-mem

VihAMBR

🧡60

What works for LLM long-term memory - tested on 2,040 questions across LoCoMo and LongMemEval. CoT prompting > fancy encoding.

MIT

Python

Updated 1 week ago

memory-benchmark-explorer

SidU

❤️45

A lightweight web app for humans to explore LongMemEval’s long‑context questions and see how the dataset is structured.

MIT

TypeScript

Updated 2 months ago

GitHub Explorer

Search Results

LongMemEval

Zengram

agentmemory

workshop-longmemeval

longmemeval-rlm

atlas-memory-releases

Backboard-longmemEval-results

sociomemory

sdk-python

longmemeval-inspector

tribalmemory-bench

LongMemEval-Advanced-Deeplearning

lens-benchmark

ReCall

longmemeval

longmemeval

longmemeval

longmemeval-results

AtlasLongMemEval

MemoryOS-LongMemEval

A-mem-LongMemEval

longmemeval-nous-3b-evidence

investigathon-NLP

oltramem

openclaw

mem-bench

memorybench

rlm-memory

gated-mem

memory-benchmark-explorer

LongMemEval

Zengram

agentmemory

workshop-longmemeval

longmemeval-rlm

atlas-memory-releases

Backboard-longmemEval-results

sociomemory

sdk-python

longmemeval-inspector

tribalmemory-bench

LongMemEval-Advanced-Deeplearning

lens-benchmark

ReCall

longmemeval

longmemeval

longmemeval

longmemeval-results

AtlasLongMemEval

MemoryOS-LongMemEval

A-mem-LongMemEval

longmemeval-nous-3b-evidence

investigathon-NLP

oltramem

openclaw

mem-bench

memorybench

rlm-memory

gated-mem

memory-benchmark-explorer