Evaluate memory layer candidates: Mem0, Zep, Letta, Cognee, custom

## Context

The memory layer library is TBD per DESIGN_SPEC §15.2. Multiple candidates are under evaluation. Before committing to any library, we need a thorough comparative evaluation to determine the best fit for our requirements.

**Candidates under evaluation:**
- **Mem0** — Universal memory layer for AI agents (open-source, production-proven)
- **Graphiti** — Temporal knowledge graph for AI agents (Zep's open-source successor)
- **Letta** — Stateful LLM agents with memory management (formerly MemGPT)
- **Cognee** — Knowledge engine for AI agents (graph-based)
- **Custom solution** — Neo4j + Qdrant + FastEmbed/LiteLLM composed stack
- Plus 12+ additional candidates discovered during open search

## Evaluation Criteria

For each candidate, assess:

- [x] Does it support all our memory types (working, episodic, semantic, procedural, social)?
- [x] Retrieval quality — published benchmarks and community reputation
- [x] Per-agent memory isolation support
- [x] Fully local operation (no cloud dependencies required)
- [x] Memory consolidation/summarization capabilities
- [x] API compatibility with our abstract memory interface (§7.1-7.3)
- [x] Compatibility with `OrgMemoryBackend` protocol (§7.4) — does the library provide graph memory capabilities that could serve as a backend for organizational memory?
- [x] Active maintenance, community size, license compatibility (must be compatible with BUSL-1.1)
- [x] Python 3.14+ compatibility
- [x] Async support quality
- [x] Container architecture and operational complexity
- [x] Embedding provider configurability (local + cloud)

## Acceptance Criteria

- [x] Open discovery: broad search beyond spec-listed candidates (16+ found)
- [x] Gate checks: pass/fail on local-first, license, Docker, Python 3.14, isolation, embeddings
- [x] Comparison table: all gate-passing candidates scored against weighted criteria (S1-S11, 100 points)
- [x] Decision documented with rationale: ADR-001 in `docs/decisions/ADR-001-memory-layer.md`
- [x] Evaluate whether selected candidate can serve §7.4 Shared Organizational Memory backends
- [x] Downstream impact documented for #32 (memory interface), #36 (persistence), #125 (org memory)
- [x] DESIGN_SPEC.md §15.2 updated to reflect decision
- [x] Backend swappability strategy: protocol-based design allows future backend swaps via config

## Decision Summary

**Mem0** as initial backend (in-process, Qdrant embedded + SQLite, persistent to Docker volume) behind pluggable `MemoryBackend` protocol. Custom stack (Neo4j + Qdrant external) as planned future upgrade. Cognee/Letta on watch list pending Python 3.14 support.

Key findings:
- 16+ candidates evaluated, 3 passed all gates (Mem0, Graphiti, Custom Stack)
- Letta and Cognee eliminated by Python `<3.14` constraint (conservative bounds, not technical — on watch list)
- Supermemory eliminated: proprietary engine, SDK-only open source
- Mem0 chosen as initial for production-readiness (v1.0+, 49k stars) and low setup cost
- Protocol architecture ensures any backend can be swapped in later via config

## Dependencies

- Depends on memory system interface (#32)

## Design Spec Reference

- §7 — Memory System (§7.1-7.3 individual agent memory, §7.4 shared org memory)
- §15.2 — Technology Stack (updated with decision)

---
> **Updated 2026-03-08:** Evaluation complete. ADR-001 accepted. Acceptance criteria revised to match research-based evaluation approach (prototypes and latency benchmarks deferred — unnecessary given pragmatic decision to use Mem0 as initial backend with protocol-based swappability).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluate memory layer candidates: Mem0, Zep, Letta, Cognee, custom #39

Context

Evaluation Criteria

Acceptance Criteria

Decision Summary

Dependencies

Design Spec Reference

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Evaluate memory layer candidates: Mem0, Zep, Letta, Cognee, custom #39

Description

Context

Evaluation Criteria

Acceptance Criteria

Decision Summary

Dependencies

Design Spec Reference

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions