Fix test_dictionaries_update_and_reload::test_reload_after_fail_by_timer flakiness by azat · Pull Request #67754 · ClickHouse/ClickHouse

azat · 2024-08-04T14:32:12Z

Changelog category (leave one):

Not for changelog (changelog entry is not required)

…mer flakiness Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

robot-ch-test-poll2 · 2024-08-04T15:35:41Z

This is an automated comment for commit e6921bf with description of existing statuses. It's updated for the latest CI running

❌ Click here to open a full report in a separate page

Check name	Description	Status
Flaky tests	Checks if new added or modified tests are flaky by running them repeatedly, in parallel, with more randomization. Functional tests are run 100 times with address sanitizer, and additional randomization of thread scheduling. Integration tests are run up to 10 times. If at least once a new test has failed, or was too long, this check will be red. We don't allow flaky tests, read the doc	❌ failure

Successful checks

Check name	Description	Status
Builds	There's no description for the check yet, please add it to tests/ci/ci_config.py:CHECK_DESCRIPTIONS	✅ success
Docs check	Builds and tests the documentation	✅ success
Fast test	Normally this is the first check that is ran for a PR. It builds ClickHouse and runs most of stateless functional tests, omitting some. If it fails, further checks are not started until it is fixed. Look at the report to see which tests fail, then reproduce the failure locally as described here	✅ success
Integration tests	The integration tests report. In parenthesis the package type is given, and in square brackets are the optional part/total tests	✅ success
Style check	Runs a set of checks to keep the code style clean. If some of tests failed, see the related log from the report	✅ success
Unit tests	Runs the unit tests for different release types	✅ success
Upgrade check	Runs stress tests on server version from last release and then tries to upgrade it to the version from the PR. It checks if the new server can successfully startup without any errors, crashes or sanitizer asserts	✅ success

fm4v · 2024-08-04T16:30:03Z

I don't like this change because now we expect both statuses ["FAILED", "FAILED_AND_RELOADING"] even in release build. Original fix expect only "FAIL" status and in sanitizer builds "FAILED_AND_RELOADING" is expected for some time which is always changed to "FAILED".
I think it's better to increase retry_count and add same retries for second assert.

Anyway this code is wrong, since you still can get FAILED_AND_RELOADING two lines below, there should be else at least.

Original changes fixes problem in first assert. I didn't see fix for the second assert in this PR too.

According to fails for the last 90 days:
3 fails in second asserts:
1 2 3
2 fails in first assert:
1 2

Also flakiness rate is decreased after first fix and why it happens?

azat · 2024-08-04T19:47:12Z

I don't like this change because now we expect both statuses ["FAILED", "FAILED_AND_RELOADING"] even in release build

@fm4v

Why it is possible to have FAILED_AND_RELOADING under sanitizers but not release build? - my guess is that the sanitiazers builds are just slow (see comments), if so, then it may fail under release as well, and it is just a matter of time when it happens...
Why allowing FAILED_AND_RELOADING is bad? - personally I don't see any issue with this

azat · 2024-08-04T19:57:03Z

This test is not idempotent and should be rewritten completely

azat · 2024-08-04T20:06:09Z

@fm4v help is welcome

fm4v · 2024-08-05T08:14:09Z

Close in favor of #67793

qoega · 2024-08-05T12:36:04Z

This test is not idempotent and should be rewritten completely

+1 to this. Trivial ways to fix is to either drop all dictionaries in the end of a test or to have unique dictionary name in every test run. If we rely on logs, make it dependent on specific test run

Fix test_dictionaries_update_and_reload::test_reload_after_fail_by_ti…

e6921bf

…mer flakiness Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>

This was referenced Aug 4, 2024

Fix flaky tests 2 (stateless, integration) #61869

Merged

Fix possible data-race StorageKafka with statistics_interval_ms>0 #66311

Merged

azat added the 🍃 green ci 🌿 Fixing flaky tests in CI label Aug 4, 2024

robot-ch-test-poll added the pr-not-for-changelog This PR should not be mentioned in the changelog label Aug 4, 2024

alexey-milovidov approved these changes Aug 4, 2024

View reviewed changes

alexey-milovidov self-assigned this Aug 4, 2024

fm4v self-requested a review August 4, 2024 16:30

fm4v closed this Aug 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix test_dictionaries_update_and_reload::test_reload_after_fail_by_timer flakiness#67754

Fix test_dictionaries_update_and_reload::test_reload_after_fail_by_timer flakiness#67754
azat wants to merge 1 commit intoClickHouse:masterfrom
azat:tests/test_dictionaries_update_and_reload

azat commented Aug 4, 2024

Uh oh!

robot-ch-test-poll2 commented Aug 4, 2024 •

edited by robot-ch-test-poll3

Loading

Uh oh!

fm4v commented Aug 4, 2024

Uh oh!

azat commented Aug 4, 2024 •

edited

Loading

Uh oh!

azat commented Aug 4, 2024

Uh oh!

azat commented Aug 4, 2024

Uh oh!

fm4v commented Aug 5, 2024

Uh oh!

qoega commented Aug 5, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

azat commented Aug 4, 2024

Changelog category (leave one):

Uh oh!

robot-ch-test-poll2 commented Aug 4, 2024 • edited by robot-ch-test-poll3 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fm4v commented Aug 4, 2024

Uh oh!

azat commented Aug 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

azat commented Aug 4, 2024

Uh oh!

azat commented Aug 4, 2024

Uh oh!

fm4v commented Aug 5, 2024

Uh oh!

qoega commented Aug 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

robot-ch-test-poll2 commented Aug 4, 2024 •

edited by robot-ch-test-poll3

Loading

azat commented Aug 4, 2024 •

edited

Loading

qoega commented Aug 5, 2024 •

edited

Loading