Skip to content

cephfs_mirror, qa: fix test failure test_cephfs_mirror_cancel_mirroring_and_readd#56118

Merged
vshankar merged 2 commits intoceph:mainfrom
joscollin:wip-B64711-qa-failure_cancel-mirroring-and-readd
Oct 7, 2024
Merged

cephfs_mirror, qa: fix test failure test_cephfs_mirror_cancel_mirroring_and_readd#56118
vshankar merged 2 commits intoceph:mainfrom
joscollin:wip-B64711-qa-failure_cancel-mirroring-and-readd

Conversation

@joscollin
Copy link
Member

@joscollin joscollin commented Mar 11, 2024

Fixes: https://tracker.ceph.com/issues/64711
Fixes: https://tracker.ceph.com/issues/65115

Contribution Guidelines

  • To sign and title your commits, please refer to Submitting Patches to Ceph.

  • If you are submitting a fix for a stable branch (e.g. "quincy"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.

  • When filling out the below checklist, you may click boxes directly in the GitHub web UI. When entering or editing the entire PR message in the GitHub web UI editor, you may also select a checklist item by adding an x between the brackets: [x]. Spaces and capitalization matter when checking off items this way.

Checklist

  • Tracker (select at least one)
    • References tracker ticket
    • Very recent bug; references commit where it was introduced
    • New feature (ticket optional)
    • Doc update (no ticket needed)
    • Code cleanup (no ticket needed)
  • Component impact
    • Affects Dashboard, opened tracker ticket
    • Affects Orchestrator, opened tracker ticket
    • No impact that needs to be tracked
  • Documentation (select at least one)
    • Updates relevant documentation
    • No doc update is appropriate
  • Tests (select at least one)
Show available Jenkins commands
  • jenkins retest this please
  • jenkins test classic perf
  • jenkins test crimson perf
  • jenkins test signed
  • jenkins test make check
  • jenkins test make check arm64
  • jenkins test submodules
  • jenkins test dashboard
  • jenkins test dashboard cephadm
  • jenkins test api
  • jenkins test docs
  • jenkins render docs
  • jenkins test ceph-volume all
  • jenkins test ceph-volume tox
  • jenkins test windows
  • jenkins test rook e2e

@github-actions github-actions bot added cephfs Ceph File System tests labels Mar 11, 2024
@joscollin joscollin changed the title cephfs_mirror: fix test failure: test_cephfs_mirror_cancel_mirroring_and_readd cephfs_mirror: fix test failure test_cephfs_mirror_cancel_mirroring_and_readd Mar 11, 2024
@joscollin joscollin force-pushed the wip-B64711-qa-failure_cancel-mirroring-and-readd branch from cd8ceea to 6923235 Compare March 11, 2024 12:01
@joscollin joscollin marked this pull request as draft March 12, 2024 04:38
@github-actions
Copy link

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

@joscollin joscollin force-pushed the wip-B64711-qa-failure_cancel-mirroring-and-readd branch from 6923235 to d532a11 Compare March 22, 2024 12:45
@joscollin joscollin marked this pull request as ready for review March 22, 2024 12:45
@joscollin joscollin force-pushed the wip-B64711-qa-failure_cancel-mirroring-and-readd branch from d532a11 to 26cd7bc Compare March 25, 2024 03:51
@joscollin joscollin requested a review from vshankar March 25, 2024 14:20
@joscollin
Copy link
Member Author

jenkins test make check arm64

@joscollin joscollin force-pushed the wip-B64711-qa-failure_cancel-mirroring-and-readd branch from 26cd7bc to 7b283c7 Compare March 26, 2024 09:01
@vshankar
Copy link
Contributor

Thanks for update @joscollin. Could you run this through fs:mirror and share the results?

Suggest running with -N 10 to schedule more jobs.

@joscollin
Copy link
Member Author

Thanks for update @joscollin. Could you run this through fs:mirror and share the results?

Suggest running with -N 10 to schedule more jobs.

The shaman build is in progress (main rebased). Will run the fs:mirror when it's ready.

I ran fs:mirror on the top of your failed sha: 717ce3c. They are all passed:
https://pulpito.ceph.com/jcollin-2024-03-26_03:56:22-fs:mirror-wip-jcollin-testing-25032024-distro-default-smithi/

@joscollin
Copy link
Member Author

jenkins test make check arm64

@joscollin
Copy link
Member Author

jenkins test make check

@joscollin
Copy link
Member Author

Thanks for update @joscollin. Could you run this through fs:mirror and share the results?

Suggest running with -N 10 to schedule more jobs.

@vshankar
Test test_cephfs_mirror_cancel_mirroring_and_readd passed 25 times with no Bad file descriptor in the logs. The 1 failed job is unrelated.

Please check:
https://pulpito.ceph.com/jcollin-2024-03-27_00:57:31-fs:mirror-wip-jcollin-testing-26032024-distro-default-smithi/

@vshankar
Copy link
Contributor

vshankar commented Apr 1, 2024

@joscollin changed under qa look good.

@joscollin
Copy link
Member Author

jenkins test make check arm64

vshankar added a commit to vshankar/ceph that referenced this pull request Apr 8, 2024
* refs/pull/56118/head:
	qa: fix check_peer_snap_in_progress wait for 'syncing'
	cephfs_mirror: Fixes 'Bad file descriptor' showing during test_cephfs_mirror_cancel_mirroring_and_readd execution
Copy link
Contributor

@vshankar vshankar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@joscollin PTAL as the build failure.

@joscollin joscollin force-pushed the wip-B64711-qa-failure_cancel-mirroring-and-readd branch from 7b283c7 to 41dcd81 Compare April 9, 2024 05:34
@joscollin joscollin force-pushed the wip-B64711-qa-failure_cancel-mirroring-and-readd branch from 41dcd81 to ad05017 Compare April 19, 2024 13:44
@joscollin
Copy link
Member Author

rebased

@joscollin joscollin requested a review from vshankar August 13, 2024 15:09
@joscollin
Copy link
Member Author

jenkins test api

@joscollin
Copy link
Member Author

jenkins test make check

@joscollin
Copy link
Member Author

jenkins test make check arm64

@joscollin
Copy link
Member Author

@vshankar The test passed on top of your failed branch: https://pulpito.ceph.com/jcollin-2024-08-14_10:58:02-fs:mirror-wip-vshankar-testing-20240806.162829-debug-distro-default-smithi/

@joscollin
Copy link
Member Author

jenkins test make check arm64

…_mirror_cancel_mirroring_and_readd execution

Avoid stale fh by calling pre_sync_check_and_open_handles inside the loop

Fixes: https://tracker.ceph.com/issues/65115
Signed-off-by: Jos Collin <jcollin@redhat.com>
* Make all replayer threads busy and then query for 'syncing' state
instead of just fetching the current status.

* Dropped 'current_syncing_snap' check, as it's not compulsory for
this test. The actual intension is to make threads in 'syncing' status
and 'current_syncing_snap' check is not necessary for that.

* Drop 'snaps_deleted' metrics check in test_cephfs_mirror_cancel_mirroring_and_readd.
test_cephfs_mirror_cancel_mirroring_and_readd primarily focusses
on the synchronization of the newly added directory paths post removal
of the previously added/syncing directory paths. So checking of 'snaps_deleted'
metrics is unnecessary here.

* Wait for more time to finish the new snapshot creations and the sync backoff.
We need to wait for more time in test_cephfs_mirror_cancel_mirroring_and_readd,
as the test makes all replayer threads busy.

Fixes: https://tracker.ceph.com/issues/64711
Signed-off-by: Jos Collin <jcollin@redhat.com>
@joscollin joscollin force-pushed the wip-B64711-qa-failure_cancel-mirroring-and-readd branch from 3986c36 to db9825d Compare August 20, 2024 07:50
@joscollin
Copy link
Member Author

jenkins test make check arm64

@joscollin joscollin changed the title cephfs_mirror: fix test failure test_cephfs_mirror_cancel_mirroring_and_readd cephfs_mirror, qa: fix test failure test_cephfs_mirror_cancel_mirroring_and_readd Aug 27, 2024
@joscollin
Copy link
Member Author

@rishabh-d-dave @vshankar Please add this PR to your next QA batch.

@joscollin
Copy link
Member Author

jenkins test make check arm64

@vshankar
Copy link
Contributor

This PR is under test in https://tracker.ceph.com/issues/68092.

@joscollin
Copy link
Member Author

This PR is under test in https://tracker.ceph.com/issues/68092.

@vshankar test_cephfs_mirror_cancel_mirroring_and_readd passed in https://tracker.ceph.com/issues/68092.

joscollin pushed a commit to joscollin/ceph that referenced this pull request Sep 23, 2024
* refs/pull/56118/head:
	qa: make all replayer threads busy and query 'syncing' status
	cephfs_mirror: Fixes 'Bad file descriptor' showing during test_cephfs_mirror_cancel_mirroring_and_readd execution

Reviewed-by: Rishabh Dave <ridave@redhat.com>
@joscollin
Copy link
Member Author

Copy link
Contributor

@vshankar vshankar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@vshankar vshankar dismissed rishabh-d-dave’s stale review October 7, 2024 07:08

Comment addressed.

@vshankar vshankar merged commit 635b554 into ceph:main Oct 7, 2024
@joscollin joscollin deleted the wip-B64711-qa-failure_cancel-mirroring-and-readd branch October 8, 2024 03:25
joscollin pushed a commit to joscollin/ceph that referenced this pull request Oct 11, 2024
* refs/pull/56118/head:
	qa: make all replayer threads busy and query 'syncing' status
	cephfs_mirror: Fixes 'Bad file descriptor' showing during test_cephfs_mirror_cancel_mirroring_and_readd execution

Reviewed-by: Rishabh Dave <ridave@redhat.com>
@joscollin
Copy link
Member Author

2024-07-11T20:19:57.022 INFO:tasks.cephfs_test_runner:FAIL: test_cephfs_mirror_incremental_sync (tasks.cephfs.test_mirroring.TestMirroring)
2024-07-11T20:19:57.022 INFO:tasks.cephfs_test_runner:Test incremental snapshot synchronization (based on mtime differences).
2024-07-11T20:19:57.022 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2024-07-11T20:19:57.022 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2024-07-11T20:19:57.022 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/github.com_rishabh-d-dave_ceph_167d9c833410f9c01f499ea86eaca54be8d33d31/qa/tasks/cephfs/test_mirroring.py", line 1158, in test_cephfs_mirror_incremental_sync
2024-07-11T20:19:57.022 INFO:tasks.cephfs_test_runner:    self.verify_snapshot(repo_path, 'snap_c')
2024-07-11T20:19:57.023 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/github.com_rishabh-d-dave_ceph_167d9c833410f9c01f499ea86eaca54be8d33d31/qa/tasks/cephfs/test_mirroring.py", line 248, in verify_snapshot
2024-07-11T20:19:57.023 INFO:tasks.cephfs_test_runner:    self.assertTrue(source_res == dest_res)
2024-07-11T20:19:57.023 INFO:tasks.cephfs_test_runner:AssertionError: False is not true
2024-07-11T20:19:57.023 INFO:tasks.cephfs_test_runner:
2024-07-11T20:19:57.023 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2024-07-11T20:19:57.023 INFO:tasks.cephfs_test_runner:Ran 9 tests in 2845.658s
2024-07-11T20:19:57.023 INFO:tasks.cephfs_test_runner:
2024-07-11T20:19:57.024 INFO:tasks.cephfs_test_runner:FAILED (failures=1)

Unexpected failure here.

Fixed by ced1f6a

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants