-
Notifications
You must be signed in to change notification settings - Fork 0
feat: hybrid search (dense + BM25 sparse) for memory retrieval pipeline #694
Copy link
Copy link
Labels
prio:highImportant, should be prioritizedImportant, should be prioritizedscope:large3+ days of work3+ days of workspec:memoryDESIGN_SPEC Section 7 - Memory & PersistenceDESIGN_SPEC Section 7 - Memory & Persistencetype:featureNew feature implementationNew feature implementationv0.6Minor version v0.6Minor version v0.6v0.6.2Patch release v0.6.2Patch release v0.6.2
Description
Context
Two independent sources converge on the same architecture for next-gen agent memory retrieval:
- Agentic RAG with Hybrid Search (TDS) -- Dense + BM25 sparse retrieval with RRF fusion and agentic query reformulation
- NVIDIA NeMo Retriever -- ReACT loop with think/retrieve/final_results triad, RRF fallback. Add design specification and project setup #1 on ViDoRe v3.
RRF rank fusion is already noted as adopted in the research log. Qdrant natively supports sparse vectors + RRF.
Action Items
- Implement BM25/sparse search alongside dense retrieval in Mem0/Qdrant config
- Wire RRF fusion for merging retrieval rounds
- Add agentic query reformulation (query rewriting, sufficiency checking) for TOOL_BASED injection strategy
- Evaluate whether memory hot path should bypass MCP bridge (in-process retriever for latency)
References
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
prio:highImportant, should be prioritizedImportant, should be prioritizedscope:large3+ days of work3+ days of workspec:memoryDESIGN_SPEC Section 7 - Memory & PersistenceDESIGN_SPEC Section 7 - Memory & Persistencetype:featureNew feature implementationNew feature implementationv0.6Minor version v0.6Minor version v0.6v0.6.2Patch release v0.6.2Patch release v0.6.2