Restructure usage examples#3385
Conversation
There was a problem hiding this comment.
Pull Request Overview
This PR adds documentation for implementing two-stage reranking in MTEB, explaining how to combine initial retrieval with subsequent reranking for improved search quality.
Key Changes:
- New documentation file explaining the two-stage reranking approach
- Code examples demonstrating the workflow: initial retrieval with an encoder model followed by reranking with a cross-encoder model
Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Added comment for clarity on model usage.
Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>
KennethEnevoldsen
left a comment
There was a problem hiding this comment.
Isn't this basically what is already here:
(We could restructure that section)
|
I didn't look though this doc. I think we can split it into multiple section for visibility |
|
Yeah agree. We could add an "advanced usage" section below "usage" with subheadings like "Late-interaction" and "Two-stage retrieval"? |
|
There was a problem hiding this comment.
Pull Request Overview
Copilot reviewed 4 out of 4 changed files in this pull request and generated 3 comments.
Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.
| # stage 2: reranking | ||
| # if model implemented in mteb it's better to use `mteb.get_model` | ||
| # cross_encoder = mteb.get_model("jinaai/jina-reranker-v2-base-multilingual") | ||
| # or if models wasn't implemented you can pass CrossEncoder directly |
There was a problem hiding this comment.
Corrected grammar: 'wasn't' should be 'isn't' for proper present tense.
| # or if models wasn't implemented you can pass CrossEncoder directly | |
| # or if models aren't implemented you can pass CrossEncoder directly |
KennethEnevoldsen
left a comment
There was a problem hiding this comment.
ahh CLI alternatives are great - we could probably redo some of the CLI documentation to be more integrated with the rest
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Cached embeddingsandTwo stage rerankingto the advanced usage section