Skip to content

drive_group: fix limit filter in drive_selection.selector#49969

Merged
guits merged 1 commit intoceph:mainfrom
guits:fix-drive-group-limit
Mar 3, 2023
Merged

drive_group: fix limit filter in drive_selection.selector#49969
guits merged 1 commit intoceph:mainfrom
guits:fix-drive-group-limit

Conversation

@guits
Copy link
Contributor

@guits guits commented Feb 2, 2023

When multiple osd service specs with 'limit' filter are applied, the current logic makes the second service speec
try to pick devices that are already used by the first service spec.

Fixes: https://tracker.ceph.com/issues/58626

Signed-off-by: Guillaume Abrioux gabrioux@ibm.com

@guits guits force-pushed the fix-drive-group-limit branch 3 times, most recently from 575c801 to 0a766d7 Compare February 14, 2023 06:40
When multiple osd service specs with 'limit' filter are applied,
the current logic makes the second service speec
try to pick devices that are already used by the first service spec.

Fixes: https://tracker.ceph.com/issues/58626

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
@adk3798
Copy link
Contributor

adk3798 commented Feb 17, 2023

https://pulpito.ceph.com/adking-2023-02-13_21:57:03-orch:cephadm-wip-adk-testing-2023-02-13-1359-distro-default-smithi/

11 failures, one dead job

  • 8 of the failed jobs were on new tests being added in by other PRs in the run, which obviously aren't quite working yet (test_monitoring_stack_basic, rotate-ssh-keys, test_iscsi_container) but outside of iscsi and monitoring stack specific changes and the PRs that introduced the tests, this shouldn't block merging.
  • 2 instances of https://tracker.ceph.com/issues/58535
  • 1 instance of https://tracker.ceph.com/issues/49287
  • 1 dead job was an error re-imaging machines, infra issue

@adk3798
Copy link
Contributor

adk3798 commented Feb 27, 2023

https://pulpito.ceph.com/adking-2023-02-21_05:38:18-orch:cephadm-wip-adk-testing-2023-02-20-1650-distro-default-smithi/

failed/dead job reruns: https://pulpito.ceph.com/adking-2023-02-24_17:44:54-orch:cephadm-wip-adk-testing-2023-02-20-1650-distro-default-smithi/

After reruns, 3 failures and 1 dead job

  • dead job was failure pulling a podman package
Failed to download packages: podman-docker-3:4.3.1-2.module_el8.8.0+1254+78119b6e.noarch:
  Cannot download, all mirrors were already tried without success

Another instance of this test passed in original run so didn't bother with another rerun

  • 2 failures were staggered upgrade test issue tracked by https://tracker.ceph.com/issues/58535
  • last failure was in currently flaky test_nfs task test. Passed when I tried an interactive rerun for debugging

Overall, nothing to block merging. Will note initial version of basic monitoring stack test passed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants