feat: hybrid search (dense + BM25 sparse) for memory retrieval pipeline

## Context

Two independent sources converge on the same architecture for next-gen agent memory retrieval:

1. **[Agentic RAG with Hybrid Search (TDS)](https://towardsdatascience.com/how-to-build-agentic-rag-with-hybrid-search/)** -- Dense + BM25 sparse retrieval with RRF fusion and agentic query reformulation
2. **[NVIDIA NeMo Retriever](https://huggingface.co/blog/nvidia/nemo-retriever-agentic-retrieval)** -- ReACT loop with think/retrieve/final_results triad, RRF fallback. #1 on ViDoRe v3.

RRF rank fusion is already noted as adopted in the research log. Qdrant natively supports sparse vectors + RRF.

## Action Items

- [ ] Implement BM25/sparse search alongside dense retrieval in Mem0/Qdrant config
- [ ] Wire RRF fusion for merging retrieval rounds
- [ ] Add agentic query reformulation (query rewriting, sufficiency checking) for TOOL_BASED injection strategy
- [ ] Evaluate whether memory hot path should bypass MCP bridge (in-process retriever for latency)

## References

- [TDS article](https://towardsdatascience.com/how-to-build-agentic-rag-with-hybrid-search/)
- [NVIDIA NeMo blog](https://huggingface.co/blog/nvidia/nemo-retriever-agentic-retrieval)
- Research log entries #25, #26 (2026-03-14)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: hybrid search (dense + BM25 sparse) for memory retrieval pipeline #694

Context

Action Items

References

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

feat: hybrid search (dense + BM25 sparse) for memory retrieval pipeline #694

Description

Context

Action Items

References

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions