Disable flaky defrag tests affecting daily run #12672

hpatro · 2023-10-18T19:01:09Z

Temporarily disabling few of the defrag tests in cluster mode to make the daily run stable:

Active defrag eval scripts
Active defrag big keys
Active defrag big list
Active defrag edge case

Few scenarios observed and I'm investigating:

defrag not started - Either the defrag scan got completed real quick or never started.
Actual fragmentation is lower than the expected fragmentation - Might require us to lower the fragmentation expectation or have separate threshold for cluster mode.
Max latency exceeds from the current set threshold.
defrag didn't stop - Sometimes allocator_frag_ratio reaches 1.06 and the test fails. Expectation is 1.05.

Failure run: https://github.com/redis/redis/actions/runs/6527277037/job/17721912648

oranagra · 2023-10-19T06:36:51Z

Trueth be told, when you added the loop to run all the Defrag tests twice, I thought it was excessive, but I think the one about big keys, or big list is actually needed (we had some concerns about the DefragLater list in cluster mode)

hpatro · 2023-10-19T16:27:48Z

Trueth be told, when you added the loop to run all the Defrag tests twice, I thought it was excessive, but I think the one about big keys, or big list is actually needed (we had some concerns about the DefragLater list in cluster mode)

Wasn't expecting this amount of flakiness at each validation level. Seems the tests were particularly crafted for standalone setup. I'm trying to fix them here #12674

oranagra · 2023-10-19T18:10:45Z

I don't understand the changes of the other PR, i guess it's still WIP.
since it takes time and the diff here is trivial to revert, i'll merge it just to silence the errors for now.

hpatro · 2023-10-19T22:37:53Z

I don't understand the changes of the other PR, i guess it's still WIP. since it takes time and the diff here is trivial to revert, i'll merge it just to silence the errors for now.

@oranagra I've explained the issue with the current logic for defrag not started failure in this comment #12674 (comment). PTAL.

Fixing issues described in #12672, started after #11695 Related to #12674 Fixes the `defrag didn't stop' issue. In some cases of how the keys were stored in memory defrag_later_item_in_progress was not getting reset once we finish defragging the later items and we move to the next slot. This stopped the scan to happen in the later slots and did not get

Reverts the skipping defrag tests in cluster mode (done in #12672. instead it skips only some defrag tests that are relevant for cluster modes. The test now run well after investigating and making the changes in #12674 and #12694. Co-authored-by: Oran Agra <oran@redislabs.com>

Fixing issues described in redis#12672, started after redis#11695 Related to redis#12674 Fixes the `defrag didn't stop' issue. In some cases of how the keys were stored in memory defrag_later_item_in_progress was not getting reset once we finish defragging the later items and we move to the next slot. This stopped the scan to happen in the later slots and did not get

Disable flaky defrag tests affecting daily run

0f259cd

This was referenced Oct 18, 2023

Fix resize hash table dictionary iterator #12660

Merged

Fix defrag test #12674

Merged

oranagra approved these changes Oct 19, 2023

View reviewed changes

oranagra merged commit becd50d into redis:unstable Oct 19, 2023

roshkhatri mentioned this pull request Oct 25, 2023

Reset later item flag after defrag later is done #12694

Merged

roshkhatri mentioned this pull request Oct 31, 2023

re-enable defrag tests in cluster mode #12710

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Disable flaky defrag tests affecting daily run #12672

Disable flaky defrag tests affecting daily run #12672

Uh oh!

hpatro commented Oct 18, 2023 •

edited

Loading

Uh oh!

oranagra commented Oct 19, 2023

Uh oh!

hpatro commented Oct 19, 2023

Uh oh!

oranagra commented Oct 19, 2023

Uh oh!

hpatro commented Oct 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Disable flaky defrag tests affecting daily run #12672

Disable flaky defrag tests affecting daily run #12672

Uh oh!

Conversation

hpatro commented Oct 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oranagra commented Oct 19, 2023

Uh oh!

hpatro commented Oct 19, 2023

Uh oh!

oranagra commented Oct 19, 2023

Uh oh!

hpatro commented Oct 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hpatro commented Oct 18, 2023 •

edited

Loading