Skip to content

mds: defer trim() until after the last cache_rejoin ack being received#52648

Merged
rishabh-d-dave merged 1 commit intoceph:mainfrom
lxbsz:wip-62036
Apr 4, 2024
Merged

mds: defer trim() until after the last cache_rejoin ack being received#52648
rishabh-d-dave merged 1 commit intoceph:mainfrom
lxbsz:wip-62036

Conversation

@lxbsz
Copy link
Member

@lxbsz lxbsz commented Jul 26, 2023

Just before the last cache_rejoin ack being received the entire subtree, together with the inode subtree root belongs to, were trimmed the isolated_inodes list couldn't be correctly erased. We should defer calling the trim() until the last cache_rejoin ack being received.

Fixes: https://tracker.ceph.com/issues/62036

Contribution Guidelines

Checklist

  • Tracker (select at least one)
    • References tracker ticket
    • Very recent bug; references commit where it was introduced
    • New feature (ticket optional)
    • Doc update (no ticket needed)
    • Code cleanup (no ticket needed)
  • Component impact
    • Affects Dashboard, opened tracker ticket
    • Affects Orchestrator, opened tracker ticket
    • No impact that needs to be tracked
  • Documentation (select at least one)
    • Updates relevant documentation
    • No doc update is appropriate
  • Tests (select at least one)
Show available Jenkins commands
  • jenkins retest this please
  • jenkins test classic perf
  • jenkins test crimson perf
  • jenkins test signed
  • jenkins test make check
  • jenkins test make check arm64
  • jenkins test submodules
  • jenkins test dashboard
  • jenkins test dashboard cephadm
  • jenkins test api
  • jenkins test docs
  • jenkins render docs
  • jenkins test ceph-volume all
  • jenkins test ceph-volume tox
  • jenkins test windows

@lxbsz lxbsz requested a review from a team July 26, 2023 08:43
@github-actions github-actions bot added the cephfs Ceph File System label Jul 26, 2023
@lxbsz lxbsz force-pushed the wip-62036 branch 2 times, most recently from 27a5b63 to 7db99a7 Compare August 3, 2023 04:11
Copy link
Member

@batrick batrick left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's move mds: remove unused C_MDC_OpenSnapRealms to another PR.

Rest looks good to me but I think you forgot to add MDCache.h changes?

@lxbsz
Copy link
Member Author

lxbsz commented Aug 3, 2023

Let's move mds: remove unused C_MDC_OpenSnapRealms to another PR.

Sure.

Rest looks good to me but I think you forgot to add MDCache.h changes?

Yeah, still in the stash.

@rishabh-d-dave
Copy link
Contributor

@lxbsz @vshankar Does the PR need QA?

@vshankar
Copy link
Contributor

@lxbsz @vshankar Does the PR need QA?

yes.

@rishabh-d-dave
Copy link
Contributor

@lxbsz @vshankar Does the PR need QA?

yes.

Okay, I have collected enough PRs for this one, I'll pick this in next batch. Hopefully, in a couple of days.

@vshankar
Copy link
Contributor

@lxbsz @vshankar Does the PR need QA?

yes.

Okay, I have collected enough PRs for this one, I'll pick this in next batch. Hopefully, in a couple of days.

Seen: https://pulpito.ceph.com/vshankar-2023-10-18_10:34:52-fs-wip-vshankar-testing-20231018.065603-testing-default-smithi/7431467/

@rishabh-d-dave Please include it in your next integration run.

@github-actions
Copy link

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

@github-actions
Copy link

This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days.
If you are a maintainer or core committer, please follow-up on this pull request to identify what steps should be taken by the author to move this proposed change forward.
If you are the author of this pull request, thank you for your proposed contribution. If you believe this change is still appropriate, please ensure that any feedback has been addressed and ask for a code review.

@github-actions github-actions bot added the stale label Jan 28, 2024
@rishabh-d-dave
Copy link
Contributor

@lxbsz Please rebase and let me know, I'll put it through QA.

@github-actions github-actions bot removed the stale label Feb 2, 2024
Just before the last cache_rejoin ack being received the entire
subtree, together with the inode subtree root belongs to, were
trimmed the isolated_inodes list couldn't be correctly erased. We
should defer calling the trim() until the last cache_rejoin ack
being received.

Fixes: https://tracker.ceph.com/issues/62036
Signed-off-by: Xiubo Li <xiubli@redhat.com>
@lxbsz
Copy link
Member Author

lxbsz commented Feb 2, 2024

@lxbsz Please rebase and let me know, I'll put it through QA.

Done. Thanks.

@rishabh-d-dave rishabh-d-dave added needs-qa wip-rishabh-testing Rishabh's testing label and removed needs-qa labels Feb 22, 2024
Copy link
Contributor

@rishabh-d-dave rishabh-d-dave left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I had to do multiple QA runs and re-runs due to build issues (due to one of the PRs in testing branch), unrelated issues in FS test suite, unrelated issues in testing kernel and new failures in every run that needed to ensured that they were unrelated to PRs in testing branch.

I finally have a run that is better than others - https://tracker.ceph.com/projects/cephfs/wiki/Main#4-Apr-2024. Sorry for the delay.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cephfs Ceph File System wip-rishabh-testing Rishabh's testing label

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants