Skip to content

Make reads sequential in BatchedVectorReader#6508

Merged
generall merged 2 commits intobatch-reader-for-segment-constructionfrom
sequential-batch-reader-seg-construction
May 9, 2025
Merged

Make reads sequential in BatchedVectorReader#6508
generall merged 2 commits intobatch-reader-for-segment-constructionfrom
sequential-batch-reader-seg-construction

Conversation

@JojiiOfficial
Copy link
Contributor

@JojiiOfficial JojiiOfficial commented May 8, 2025

Depends on #6487

Improves optimization when copying points from different segments into a new one.

Benchmark:

On the same machine with storage on a Hetzner Network Disk (5M points, 20=>10 segments, vectors on disk)
Dev: 200s
PR: 3s

SSH-FS mounted Hetzner Network Disk (100k points, 10=>5 segments, vectors on disk)
Dev: ~450s
PR: 240s

@generall
Copy link
Member

generall commented May 9, 2025

Confirming improvement on my HDD:

Dev

2025-05-09T09:46:08.203869Z
-
2025-05-09T09:45:23.945362Z
=
00:00:44.258507

PR

2025-05-09T09:57:06.075739Z
-
2025-05-09T09:56:36.425863Z
=
00:00:29.649876

@generall generall merged commit 19e5758 into batch-reader-for-segment-construction May 9, 2025
13 checks passed
@generall generall deleted the sequential-batch-reader-seg-construction branch May 9, 2025 11:02
generall added a commit that referenced this pull request May 9, 2025
* naive implementation of batched reader for segment construction

* Make reads sequential in BatchedVectorReader (#6508)

* Add sequential reads and improve BatchedVectorReader

* implement get_many_sequential

---------

Co-authored-by: generall <andrey@vasnetsov.com>

* implement get_many_sequential

---------

Co-authored-by: Jojii <15957865+JojiiOfficial@users.noreply.github.com>
generall added a commit that referenced this pull request May 22, 2025
* naive implementation of batched reader for segment construction

* Make reads sequential in BatchedVectorReader (#6508)

* Add sequential reads and improve BatchedVectorReader

* implement get_many_sequential

---------

Co-authored-by: generall <andrey@vasnetsov.com>

* implement get_many_sequential

---------

Co-authored-by: Jojii <15957865+JojiiOfficial@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants