Fix `too_many_internal_resets` error by ffuugoo · Pull Request #8128 · qdrant/qdrant

ffuugoo · 2026-02-13T14:40:56Z

Patch tonic and hyper crates to expose max_local_error_reset_streams, and disable it when creating internal gRPC connections.

All Submissions:

Contributions should target the dev branch. Did you create your branch from dev?
Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?

New Feature Submissions:

Does your submission pass tests?
Have you formatted your code locally using cargo +nightly fmt --all command prior to submission?
Have you checked your code using cargo clippy --workspace --all-features command?

Changes to Core Features:

Have you added an explanation of what your changes do and why you'd like us to include them?
Have you written new tests for your core changes, as applicable?
Have you successfully ran tests with your changes locally?

Patch `tonic` and `hyper` crates to expose `max_local_error_reset_streams`, and *disable* it when creating internal gRPC connections

timvisee · 2026-02-13T15:04:22Z

Some more context for the future 🦾 🤖 🦿 if we hit this again:

We hit a security feature here that helps us clean up borked connections. Since this is for internal cluster communication and we manage both sides of the connection we can disable this security feature, which is what this PR takes care of.

We mainly hit this because we drop connections before waiting on the result. We count an error when we drop a connection before we receive it's response, and eventually hit a limit. This happens a lot in fanning out reads which races with the local replica. Forcefully dropping these connections once we get a result is much easier and fits better with our implementation than waiting for all responses and dropping connections gracefully.

agourlay · 2026-02-13T15:56:09Z

How can this fix be validated? :)

ffuugoo · 2026-02-13T16:35:19Z

How can this fix be validated? :)

bfb \
    --uri $QDRANT_HOST \
    -n 10M \
    -d 512 \
    --skip-setup \
    --search \
    --keywords 5000 \
    --rps 300

@generall used this collection setup (not sure if critical, though, search is what repro the bug)

bfb \
    --uri $QDRANT_HOST \
    -n 10M \
    -d 512 \
    --shards 9 \
    --replication-factor 2 \
    --on-disk-vectors true \
    --keywords 5000 \
    --hnsw-m 0 \
    --hnsw-payload-m 16 \
    --tenants true \
    -b 10 \
    --timeout 60 \
    --rps 100

generall · 2026-02-13T16:39:02Z

important detail is fan-out-factor

ffuugoo · 2026-02-13T16:51:16Z

important detail is fan-out-factor

Repro with default setup, without any explicit fan-out factor setting on my machine.

Behavior with and without fix is different:

without the fix too_many_internal_resets
- note "internal"
- that's what we observed recently and what we expected from this test
with the fix there's still error, but it's too_many_resets
- no "internal"
- it's abort-reset-streams error, which is different
- I'd consider this as indication that "fix works", because "internal" error goes away as expected

generall · 2026-02-13T17:15:19Z

It seems to fix problem on my repro setup

Patch `tonic` and `hyper` crates to expose `max_local_error_reset_streams`, and *disable* it when creating internal gRPC connections

Fix too_many_internal_resets error

03d6f1b

Patch `tonic` and `hyper` crates to expose `max_local_error_reset_streams`, and *disable* it when creating internal gRPC connections

ffuugoo requested review from agourlay, generall and timvisee February 13, 2026 14:40

qdrant deleted a comment from coderabbitai bot Feb 13, 2026

timvisee added the release:1.17.0 label Feb 13, 2026

This was referenced Feb 13, 2026

Flaky test tests::test_cancel_optimization #7794

Open

Flaky test segment_builder_test::test_building_cancellation #5667

Open

generall approved these changes Feb 13, 2026

View reviewed changes

ffuugoo merged commit 5bb06d9 into dev Feb 13, 2026
17 checks passed

ffuugoo deleted the too-many-internal-resets-go-away branch February 13, 2026 17:21

timvisee pushed a commit that referenced this pull request Feb 16, 2026

Fix too_many_internal_resets error (#8128)

d96709a

Patch `tonic` and `hyper` crates to expose `max_local_error_reset_streams`, and *disable* it when creating internal gRPC connections

timvisee mentioned this pull request Feb 17, 2026

Bump version to 1.17.0 #8160

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `too_many_internal_resets` error#8128

Fix `too_many_internal_resets` error#8128
ffuugoo merged 1 commit intodevfrom
too-many-internal-resets-go-away

ffuugoo commented Feb 13, 2026 •

edited

Loading

Uh oh!

timvisee commented Feb 13, 2026 •

edited

Loading

Uh oh!

agourlay commented Feb 13, 2026

Uh oh!

ffuugoo commented Feb 13, 2026 •

edited

Loading

Uh oh!

generall commented Feb 13, 2026

Uh oh!

ffuugoo commented Feb 13, 2026 •

edited

Loading

Uh oh!

generall commented Feb 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ffuugoo commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

All Submissions:

New Feature Submissions:

Changes to Core Features:

Uh oh!

timvisee commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agourlay commented Feb 13, 2026

Uh oh!

ffuugoo commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

generall commented Feb 13, 2026

Uh oh!

ffuugoo commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

generall commented Feb 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ffuugoo commented Feb 13, 2026 •

edited

Loading

timvisee commented Feb 13, 2026 •

edited

Loading

ffuugoo commented Feb 13, 2026 •

edited

Loading

ffuugoo commented Feb 13, 2026 •

edited

Loading