Skip to content

Fix Flaky Test SpecificClusterManagerNodesIT.testElectOnlyBetweenClusterManagerNodes#16021

Merged
andrross merged 1 commit intoopensearch-project:mainfrom
kkewwei:fix_16015
Feb 5, 2025
Merged

Fix Flaky Test SpecificClusterManagerNodesIT.testElectOnlyBetweenClusterManagerNodes#16021
andrross merged 1 commit intoopensearch-project:mainfrom
kkewwei:fix_16015

Conversation

@kkewwei
Copy link
Copy Markdown
Contributor

@kkewwei kkewwei commented Sep 20, 2024

Description

The case is as follows:

  1. When the node_t1 is excluded from the vote config, and the cluster starts a new leader election, but the the node node_t2 hasn't been elected as the new leader.

  2. At the moment, we send request to get the ClusterManager, we first get ClusterManager name, and leads to the NullPointerException.

internalCluster().nonClusterManagerClient()-> ......->getClusterManagerName()

return client.admin().cluster().prepareState().get().getState().nodes().getClusterManagerNode().getName();

Related Issues

Resolves #15944 #16015

Check List

  • Functionality includes testing.
  • API changes companion pull request created, if applicable.
  • Public documentation issue/PR created, if applicable.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

@github-actions
Copy link
Copy Markdown
Contributor

❕ Gradle check result for 1b3920b: UNSTABLE

  • TEST FAILURES:
      1 org.opensearch.cluster.MinimumClusterManagerNodesIT.testThreeNodesNoClusterManagerBlock

Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure.

@msfroh
Copy link
Copy Markdown
Contributor

msfroh commented Sep 20, 2024

Checking the history of this test, has this been flaky for more than a year?

The only code related to cluster manager election that I've been able to find changed more recently than ~5 years ago (besides renaming) is the introduction of DecommisionService in 2023.

@github-actions
Copy link
Copy Markdown
Contributor

❕ Gradle check result for 40548ec: UNSTABLE

  • TEST FAILURES:
      1 org.opensearch.test.rest.ClientYamlTestSuiteIT.test {p0=search.aggregation/20_terms/string profiler via global ordinals}

Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure.

@kkewwei
Copy link
Copy Markdown
Contributor Author

kkewwei commented Jan 21, 2025

@rajiv-kv, would you please help merge the pull request at your convenience.

@shwetathareja
Copy link
Copy Markdown
Member

Minor comment, thanks for picking it up @kkewwei

…terManagerNodes

Signed-off-by: kkewwei <kewei.11@bytedance.com>
Signed-off-by: kkewwei <kkewwei@163.com>
@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Feb 5, 2025

❕ Gradle check result for 366bb52: UNSTABLE

Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure.

@kkewwei
Copy link
Copy Markdown
Contributor Author

kkewwei commented Feb 5, 2025

❕ Gradle check result for 366bb52: UNSTABLE

Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure.

org.opensearch.index.ShardIndexingPressureSettingsIT.testShardIndexingPressureEnforcedEnabledDisabledSetting #14331

@kkewwei
Copy link
Copy Markdown
Contributor Author

kkewwei commented Feb 5, 2025

@shwetathareja @andrross please have a look when you are free.

@andrross andrross merged commit 852011a into opensearch-project:main Feb 5, 2025
opensearch-trigger-bot bot pushed a commit that referenced this pull request Feb 5, 2025
…terManagerNodes (#16021)

Signed-off-by: kkewwei <kewei.11@bytedance.com>
Signed-off-by: kkewwei <kkewwei@163.com>
(cherry picked from commit 852011a)
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
@kkewwei kkewwei deleted the fix_16015 branch February 6, 2025 08:32
andrross pushed a commit that referenced this pull request Feb 7, 2025
…terManagerNodes (#16021) (#17265)

(cherry picked from commit 852011a)

Signed-off-by: kkewwei <kewei.11@bytedance.com>
Signed-off-by: kkewwei <kkewwei@163.com>
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
opensearch-trigger-bot bot pushed a commit that referenced this pull request May 16, 2025
…terManagerNodes (#16021)

Signed-off-by: kkewwei <kewei.11@bytedance.com>
Signed-off-by: kkewwei <kkewwei@163.com>
(cherry picked from commit 852011a)
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
andrross pushed a commit that referenced this pull request May 16, 2025
…terManagerNodes (#16021) (#18320)

(cherry picked from commit 852011a)

Signed-off-by: kkewwei <kewei.11@bytedance.com>
Signed-off-by: kkewwei <kkewwei@163.com>
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

autocut backport 2.x Backport to 2.x branch backport 2.19 bug Something isn't working Cluster Manager flaky-test Random test failure that succeeds on second run skip-changelog >test-failure Test failure from CI, local build, etc.

Projects

Status: ✅ Done

Development

Successfully merging this pull request may close these issues.

[AUTOCUT] Gradle Check Flaky Test Report for SpecificClusterManagerNodesIT

7 participants