Fix hang when using --reconnect-interval with --rate-limiting by fcostaoliveira · Pull Request #348 · redis/memtier_benchmark

fcostaoliveira · 2026-02-26T14:13:13Z

Summary

Fix memtier_benchmark hanging indefinitely when --reconnect-interval and --rate-limiting are used together (regression introduced in 2.2.1 via Issue #285, Fix rate-limiting with cluster-mode #286)
The rate-limiting timer was only created on the first connection. After --reconnect-interval triggered a disconnect/reconnect cycle, disconnect() properly freed the timer but handle_event() never recreated it, leaving m_request_per_cur_interval permanently at 0 and causing fill_pipeline() to return without sending any requests
Moved timer creation outside the first-connection guard so it is recreated on every successful connect/reconnect when m_event_timer is NULL
Added test_reconnect_interval_with_rate_limiting integration test matching the exact failing scenario (--reconnect-interval=1 --rate-limiting=1)

Test plan

New test test_reconnect_interval_with_rate_limiting passes (would hang forever without fix)
Existing test_short_reconnect_interval still passes (no regression)
CI passes

🤖 Generated with Claude Code

Note

Medium Risk
Touches core connection/event-loop behavior and rate limiting, so regressions could affect request pacing or reconnect stability. Change is small and covered by a new integration test that previously would hang.

Overview
Prevents memtier_benchmark from stalling after a reconnect when rate limiting is enabled by recreating the request-rate timer on successful (re)connect whenever m_event_timer is NULL (instead of only on the first connection).

Adds test_reconnect_interval_with_rate_limiting to exercise --reconnect-interval=1 --rate-limiting=1 and assert the run completes and produces non-zero ops in mb.json.

^{Written by Cursor Bugbot for commit d4e3296. This will update automatically on new commits. Configure here.}

The rate-limiting timer was only created on the first connection (when get_reqs_processed() == 0). After a reconnect triggered by --reconnect-interval, disconnect() properly freed the timer, but handle_event() never recreated it because requests had already been processed. This left m_request_per_cur_interval permanently at 0, causing fill_pipeline() to return immediately on every call. Move timer creation outside the first-connection guard so it is recreated on every successful connect/reconnect when m_event_timer is NULL. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fcostaoliveira · 2026-02-26T14:13:21Z

cursor review

cursor

✅ Bugbot reviewed your changes and found no new issues!

Comment @cursor review or bugbot run to trigger another review on this PR

jit-ci · 2026-02-26T14:19:11Z

❌ Jit Scanner failed - Our team is investigating

Jit Scanner failed - Our team has been notified and is working to resolve the issue. Please contact support if you have any questions.

💡 Need to bypass this check? Comment @sera bypass to override.

jit-ci · 2026-02-26T15:24:16Z

🛡️ Jit Security Scan Results

✅ No security findings were detected in this PR

^{Security scan by Jit}

The rate-limiting timer was only created on the first connection (when get_reqs_processed() == 0). After a reconnect triggered by --reconnect-interval, disconnect() properly freed the timer, but handle_event() never recreated it because requests had already been processed. This left m_request_per_cur_interval permanently at 0, causing fill_pipeline() to return immediately on every call. Move timer creation outside the first-connection guard so it is recreated on every successful connect/reconnect when m_event_timer is NULL. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

* Add clang-format for code style enforcement (#336) Cherry-picked and re-applied clang-format configuration, CI workflow, Makefile targets, and DEVELOPMENT.md docs from master. Source files reformatted against the 2.2.x branch codebase. * Concurrent ubuntu test jobs for faster CI (#337) * Concurrent ubuntu test jobs for faster CI * Prevent duplicate workflow runs on push+PR by filtering branches * Fixed coverage workflow * Concurrent ASAN, TSAN, and UBSAN test jobs using matrix strategy * Updated org from redislabs to redis (#339) * Fix rate-limit induced hanging at test completion (#340) * Fix rate-limit induced hanging at test completion * include a null check for conns in all_connections_idle * Remove crufty ctx reference * Extend CI with higher shard count scenario. * Extra logging on 99 shards scenario * Verbose was already defined on CI. Using 49 shards to expedite CI * Using RLTEST_DEBUG to avoid overriding old behaviour * Shard count 99 in rate-limiting test --------- Co-authored-by: fcostaoliveira <filipe@redis.com> * configure: Respect user-supplied CXXFLAGS (#342) * Add AGENTS.md and CLAUDE.md for AI assistant guidelines (#343) Add documentation to help AI assistants work effectively with the memtier_benchmark codebase, following the https://agents.md/ conventions. AGENTS.md includes: - Project overview and repository structure - Build system and commands (autotools) - Code style (clang-format) - Testing with RLTest (standalone, cluster, TLS, sanitizers) - Key technical details - Common development tasks - Debugging guide (GDB, crash handler, core dumps, sanitizers) - License header requirements CLAUDE.md points to AGENTS.md for shared guidelines. References: - https://agents.md/ - Standard for AI agent documentation - https://docs.anthropic.com/en/docs/agents - Anthropic agent guidelines * Use latest rltest and set cluster-start-timeout to accomodate large shard count (#345) * CI: trigger workflows on semver release branches Add branch pattern '[0-9]+.[0-9]+' to push/pull_request triggers for ci, code-style, asan, tsan, and ubsan workflows so CI runs on PRs targeting release branches like 2.2. * Fix hang when using --reconnect-interval with --rate-limiting (#348) The rate-limiting timer was only created on the first connection (when get_reqs_processed() == 0). After a reconnect triggered by --reconnect-interval, disconnect() properly freed the timer, but handle_event() never recreated it because requests had already been processed. This left m_request_per_cur_interval permanently at 0, causing fill_pipeline() to return immediately on every call. Move timer creation outside the first-connection guard so it is recreated on every successful connect/reconnect when m_event_timer is NULL. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> * Bumping version to 2.2.2 --------- Co-authored-by: Tristan Schneiter <tschneiter@figma.com> Co-authored-by: LINKIWI <LINKIWI@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

fcostaoliveira requested a review from paulorsousa February 26, 2026 14:13

cursor bot reviewed Feb 26, 2026

View reviewed changes

paulorsousa approved these changes Feb 26, 2026

View reviewed changes

fcostaoliveira merged commit 47a1e02 into master Feb 26, 2026
76 of 78 checks passed

fcostaoliveira mentioned this pull request Feb 26, 2026

Prepare for 2.2.2 version. #349

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix hang when using --reconnect-interval with --rate-limiting#348

Fix hang when using --reconnect-interval with --rate-limiting#348
fcostaoliveira merged 1 commit intomasterfrom
reconnect.fix

fcostaoliveira commented Feb 26, 2026 •

edited

Loading

Uh oh!

fcostaoliveira commented Feb 26, 2026

Uh oh!

cursor bot left a comment

Uh oh!

jit-ci bot commented Feb 26, 2026

Uh oh!

jit-ci bot commented Feb 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

fcostaoliveira commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

fcostaoliveira commented Feb 26, 2026

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

jit-ci bot commented Feb 26, 2026

❌ Jit Scanner failed - Our team is investigating

Uh oh!

jit-ci bot commented Feb 26, 2026

🛡️ Jit Security Scan Results

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fcostaoliveira commented Feb 26, 2026 •

edited

Loading