Fix checkPrefixCollisionsOrReply returning non-zero on self-overlap by beebs-systap · Pull Request #3583 · valkey-io/valkey

beebs-systap · 2026-04-28T21:57:24Z

The self-overlap branch in checkPrefixCollisionsOrReply() returns the
outer loop index i instead of 0. When i > 0, the truthy return causes
the caller in networking.c to fall through to enableTracking() and
addReply(c, shared.ok), producing a double reply (protocol violation)
and registering overlapping prefixes.

Change return i to return 0 to match the function's documented boolean
contract and the existing-prefix collision check above it.

Fixes #3582

The self-overlap branch in checkPrefixCollisionsOrReply() returns the outer loop index i instead of 0. When i > 0, the truthy return causes the caller in networking.c to fall through to enableTracking() and addReply(c, shared.ok), producing a double reply (protocol violation) and registering overlapping prefixes. Change return i to return 0 to match the function's documented boolean contract and the existing-prefix collision check above it. Fixes valkey-io#3582 Signed-off-by: Brad Bebee <beebs@amazon.com>

addReplyErrorFormat adds the client to server.clients_pending_write via prepareClientToWrite. When freeTrackingTestClient frees the client without unlinking it first, TearDown calls listRelease on the pending write list which dereferences the freed client memory. Initialize the embedded list node in createTrackingTestClient and unlink the client from clients_pending_write in freeTrackingTestClient before freeing, matching the pattern in networking.c freeClient. Signed-off-by: Brad Bebee <beebs@amazon.com>

Set reply_off on the test client so prepareClientToWrite returns early and the client is never added to server.clients_pending_write. This avoids the heap-use-after-free where freeTrackingTestClient freed the client struct containing an embedded listNode, and TearDown then called listRelease which dereferenced the freed node. Removes the clients_pending_write list from SetUp/TearDown and the manual unlink logic from freeTrackingTestClient since they are no longer needed. Signed-off-by: Brad Bebee <beebs@amazon.com>

murphyjacob4

Good catch!

murphyjacob4 · 2026-04-29T03:12:43Z

+    }
+
+    void TearDown() override {
+        raxFree(server.errors);


Think you need raxFreeWithCallback(server.errors, zfree); for ASAN

codecov · 2026-04-29T03:33:04Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 76.70%. Comparing base (8091c6c) to head (8406041).
⚠️ Report is 17 commits behind head on unstable.

Additional details and impacted files

@@             Coverage Diff              @@
##           unstable    #3583      +/-   ##
============================================
+ Coverage     76.42%   76.70%   +0.28%     
============================================
  Files           159      162       +3     
  Lines         80113    80612     +499     
============================================
+ Hits          61225    61835     +610     
+ Misses        18888    18777     -111

Files with missing lines	Coverage Δ
src/tracking.c	`99.34% <100.00%> (ø)`

... and 38 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Use raxFreeWithCallback(server.errors, zfree) instead of raxFree() so that error stat structs allocated by afterErrorReply are properly freed. Addresses review feedback from murphyjacob4. Signed-off-by: Brad Bebee <beebs@amazon.com>

rainsupreme

LGTM! Nice set of UTs too 😁

madolson · 2026-04-29T23:41:32Z

These tests are good, but now there is quite a bit of overlap with the TCL testing we are doing in unit/tracking. I think in this instance there is quite a bit of whitebox testing, accessing local variables, so I think extending the integration tests is probably the better long term maintainable here. We probably need to add better guidance on when to write unit vs integration.

Specifically, I think we only need these three new end tests to cover the existing main permutations:

test {BCAST self-collision at later index is rejected} { set r [valkey_client] catch {$r CLIENT TRACKING ON BCAST PREFIX aaa PREFIX bbb PREFIX bbbc} output assert_match {ERR*Prefix*overlaps*} $output $r close } test {BCAST identical prefix at later index is rejected} { set r [valkey_client] catch {$r CLIENT TRACKING ON BCAST PREFIX xxx PREFIX yyy PREFIX yyy} output assert_match {ERR*Prefix*overlaps*} $output $r close } test {BCAST empty prefix collides with any prefix} { set r [valkey_client] catch {$r CLIENT TRACKING ON BCAST PREFIX {} PREFIX foo} output assert_match {ERR*Prefix*overlaps*} $output $r close }

@madolson Do you want me to update to remove the other tests? And/or make them tcl integration tests?

I think we can remove them. We have good coverage of them on TCL already, and it's a lot code that's effectively redundant. I don't want to discourage unit tests, but in this case it seems unecessary.

Replace the C++ unit tests in src/unit/test_tracking.cpp with 3 TCL integration tests in tests/unit/tracking.tcl that verify both the error reply and that tracking remains disabled after a prefix collision at index > 0. Fixes valkey-io#3582 Signed-off-by: Brad Bebee <beebs@amazon.com>

…3583)

github-actions Bot assigned beebs-systap Apr 28, 2026

beebs-systap added 3 commits April 28, 2026 22:20

murphyjacob4 approved these changes Apr 29, 2026

View reviewed changes

murphyjacob4 reviewed Apr 29, 2026

View reviewed changes

rainsupreme approved these changes Apr 29, 2026

View reviewed changes

madolson reviewed Apr 29, 2026

View reviewed changes

sarthakaggarwal97 mentioned this pull request May 3, 2026

[Shadow] Fix checkPrefixCollisionsOrReply returning non-zero on self-overlap sarthakaggarwal97/valkey#140

Closed

madolson approved these changes May 3, 2026

View reviewed changes

madolson merged commit 8891441 into valkey-io:unstable May 3, 2026
75 of 78 checks passed

lucasyonge pushed a commit that referenced this pull request May 11, 2026

Fix checkPrefixCollisionsOrReply returning non-zero on self-overlap (#…

6a3121b

…3583)

lucasyonge pushed a commit that referenced this pull request May 12, 2026

Fix checkPrefixCollisionsOrReply returning non-zero on self-overlap (#…

d6ae863

…3583)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix checkPrefixCollisionsOrReply returning non-zero on self-overlap#3583

Fix checkPrefixCollisionsOrReply returning non-zero on self-overlap#3583
madolson merged 6 commits into
valkey-io:unstablefrom
beebs-systap:issue_3582_prefix-collison

beebs-systap commented Apr 28, 2026

Uh oh!

murphyjacob4 left a comment

Uh oh!

murphyjacob4 Apr 29, 2026

Uh oh!

codecov Bot commented Apr 29, 2026 •

edited

Loading

Uh oh!

rainsupreme left a comment

Uh oh!

madolson Apr 29, 2026

Uh oh!

madolson Apr 30, 2026

Uh oh!

beebs-systap Apr 30, 2026

Uh oh!

madolson May 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

beebs-systap commented Apr 28, 2026

Uh oh!

murphyjacob4 left a comment

Choose a reason for hiding this comment

Uh oh!

murphyjacob4 Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

rainsupreme left a comment

Choose a reason for hiding this comment

Uh oh!

madolson Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

madolson Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

beebs-systap Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

madolson May 1, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov Bot commented Apr 29, 2026 •

edited

Loading