mgr/vol: handling the failed non-atomic operation#59676
Conversation
b992ec1 to
33f82c3
Compare
|
jenkins test api |
vshankar
left a comment
There was a problem hiding this comment.
Changes look fine. Possible to add a test real quick @neesingh-rh ?
b74ff1e to
5dd8ea5
Compare
|
jenkins test api |
|
jenkins test make check |
rishabh-d-dave
left a comment
There was a problem hiding this comment.
Will review it once more in some time.
BTW, have you run the test added on this PR locally?
7aec81b to
53fd9dc
Compare
qa/workunits/fs/full/subvol_rm_retained_snap_when_cluster_is_full.sh
Outdated
Show resolved
Hide resolved
|
@neesingh-rh Also please change the commit title of 020e7c6 to something more meaningful - Also in the commit message mention about subvolume removal with --retain-snapshot option. The patch is touching only that case and not the generic subvolume removal when there are no snapshots. |
@neesingh-rh ping |
qa/workunits/fs/full/subvol_rm_retained_snap_when_cluster_is_full.sh
Outdated
Show resolved
Hide resolved
qa/workunits/fs/full/subvol_rm_retained_snap_when_cluster_is_full.sh
Outdated
Show resolved
Hide resolved
qa/workunits/fs/full/subvol_rm_retained_snap_when_cluster_is_full.sh
Outdated
Show resolved
Hide resolved
qa/workunits/fs/full/subvol_rm_retained_snap_when_cluster_is_full.sh
Outdated
Show resolved
Hide resolved
qa/workunits/fs/full/subvol_rm_retained_snap_when_cluster_is_full.sh
Outdated
Show resolved
Hide resolved
53fd9dc to
b22ab21
Compare
Yes. |
Done, PTAL |
rishabh-d-dave
left a comment
There was a problem hiding this comment.
@neesingh-rh I see that PR has gone through some changes since I reviewed it last. Please ping me when changes requested from me have been incorporated and also when PR goes through changes substantial enough.
Current set of changes looks better than last one, I'll do an-depth review shortly.
|
ceph API tests failed for unrelated reasons. |
|
jenkins test api |
3ed0cc0 to
29f9141
Compare
|
Rebased! |
OK. I will have a look at the test failures since they are new failures. |
I think this is related to this PR. Since the |
It definitely is -- I reran fs:volumes with and without the change and the issue reproduces. |
vshankar
left a comment
There was a problem hiding this comment.
@neesingh-rh - PTAL at the failed test.
The order of operation done for subvolume removal with retain-snapshot option set to true, is reversed. The metadata is updated first followed by a rename operation on the uuid directory. If the metadata update operation fails, then the remove operations is failed thereby, keeping the subvolume metadata consistent with the uuid path. Fixes: https://tracker.ceph.com/issues/67330 Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com>
… full Fixes: https://tracker.ceph.com/issues/67330 Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com> Conflicts: qa/suites/fs/full/tasks/mgr-osd-full.yaml
29f9141 to
ea624e4
Compare
|
Here's the run link: Failures unrelated. |
|
@neesingh-rh - could you give a gist about what the issue was with the last revision that caused the test to fail and what has been done to fix it? |
The tests were failing because of saving the self.path value at the wrong place due to which the auto cleanup was getting affected/failing. It should be saved just before the deletion of path from .meta, to have the correct path info. |
|
This PR is under test in https://tracker.ceph.com/issues/73162. |
|
@neesingh-rh - I am seeing a failures in fs:volumes: https://pulpito.ceph.com/vshankar-2025-09-26_04:42:20-fs:volumes-wip-vshankar-testing-20250925.124305-debug-testing-default-smithi/8521305/ In tautology.log, the error happens when removing a subvolume I want to be doubly sure that this is not sue to this change, especially since its the subvolume part that this change modifies and the test failure is too in that code path. |
@vshankar The first failure is the |
You are right. Thx for checking. I will link up a tracker for the above issue. |
The order of operation done for subvolume removal is reversed. The metadata is updated first followed by a rename operation on the uuid directory. If the metadata update operation fails, then the remove operations is failed thereby, keeping the subvolume metadata consistent with the uuid path.
Fixes: https://tracker.ceph.com/issues/67330
Signed-off-by: Neeraj Pratap Singh neesingh@redhat.com
Contribution Guidelines
To sign and title your commits, please refer to Submitting Patches to Ceph.
If you are submitting a fix for a stable branch (e.g. "quincy"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.
When filling out the below checklist, you may click boxes directly in the GitHub web UI. When entering or editing the entire PR message in the GitHub web UI editor, you may also select a checklist item by adding an
xbetween the brackets:[x]. Spaces and capitalization matter when checking off items this way.Checklist
Show available Jenkins commands
jenkins retest this pleasejenkins test classic perfjenkins test crimson perfjenkins test signedjenkins test make checkjenkins test make check arm64jenkins test submodulesjenkins test dashboardjenkins test dashboard cephadmjenkins test apijenkins test docsjenkins render docsjenkins test ceph-volume alljenkins test ceph-volume toxjenkins test windowsjenkins test rook e2e