mds: batch backtrace updates by pool-id when expiring a log segment by vshankar · Pull Request #55421 · ceph/ceph

vshankar · 2024-02-02T05:03:52Z

Otherwise, a backtrace update failure due to a removed data pool would cause the entire batch to be considered as a failed backtrace update (depending on when the first failure happens), thereby causing the MDS to go read-only when the error (-ENOENT) is trickeled up for a backtrace update for the metadata pool or a undeleted data pool.

Fixes: http://tracker.ceph.com/issues/63259

NOTE - this hasn't been tested yet - will get to that in a while.

@lxbsz - this change does away with the vector resize bits, I think we can bring that back in. Question: was that (vector resize) done since you expected the MDS to spend much time in that (holding the mds_lock) so preallocating space would lower the time spend?

Show available Jenkins commands

jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test dashboard cephadm
jenkins test api
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox
jenkins test windows
jenkins test rook e2e

lxbsz · 2024-02-21T06:35:45Z

Otherwise, a backtrace update failure due to a removed data pool would cause the entire batch to be considered as a failed backtrace update (depending on when the first failure happens), thereby causing the MDS to go read-only when the error (-ENOENT) is trickeled up for a backtrace update for the metadata pool or a undeleted data pool.

Fixes: http://tracker.ceph.com/issues/63259

NOTE - this hasn't been tested yet - will get to that in a while.

@lxbsz - this change does away with the vector resize bits, I think we can bring that back in. Question: was that (vector resize) done since you expected the MDS to spend much time in that (holding the mds_lock) so preallocating space would lower the time spend?

@vshankar I think the resize() could be removed. We have already reserved enough space before the loop.

lxbsz

LGTM.

batrick

It's not clear to me from reading the patch how "thereby causing the MDS
to go read-only when the error (-ENOENT) is trickeled up for a backtrace
update for the metadata pool or a undeleted data pool." is avoided by this change. Could you explain in a comment?

vshankar · 2024-02-26T07:31:07Z

It's not clear to me from reading the patch how "thereby causing the MDS to go read-only when the error (-ENOENT) is trickeled up for a backtrace update for the metadata pool or a undeleted data pool." is avoided by this change. Could you explain in a comment?

Sure. Will update the change explaining the fix.

src/mds/journal.cc

vshankar · 2024-02-26T10:46:09Z

Dropped from testing till #55421 (comment) gets fixed.

vshankar · 2024-03-13T04:14:57Z

jenkins test api

batrick

I think a test should be somewhat trivial to sythesize, no?

create dir with ceph.dir.layout.pool == some-new-pool
create empty file in that dir
flush mds journal
set ceph.file.layout.pool == some-new-pool2 on the file
restore default layout for dir
delete some-new-pool
flush the mds log (would fail before but now does not)

? I don't think a failover is required?

You can use ceph-dencoder of course to verify the backtrace updates.

batrick · 2024-04-10T15:41:16Z

src/mds/journal.cc

+      // dispatch separate ops for backtrace updates for old pools
+      in->store_backtrace(ops_vec_map[pool_id].back(), op_prio, true);
+      for (auto p : in->get_inode()->old_pools) {
+	in->store_backtrace(ops_vec_map[p].back(), op_prio, true);


Suggested change

in->store_backtrace(ops_vec_map[p].back(), op_prio, true);

ops_vec_map[p].push_back(CInodeCommitOperations());

in->store_backtrace(ops_vec_map[p].back(), op_prio, true);

?

I see what you mean to suggest. It's done for the current pool - can be the same for old_pools.

src/mds/journal.cc

* refs/pull/55421/head: mds: batch backtrace updates by pool-id when expiring a log segment Reviewed-by: Xiubo Li <xiubli@redhat.com>

vshankar · 2024-04-15T10:15:50Z

I think a test should be somewhat trivial to sythesize, no?

Make sense. I didn't bother adding a test since this issue was pretty much getting reproduced on fs suite test branches. I'll add a test and update.

create dir with ceph.dir.layout.pool == some-new-pool

create empty file in that dir

flush mds journal

set ceph.file.layout.pool == some-new-pool2 on the file

restore default layout for dir

delete some-new-pool

flush the mds log (would fail before but now does not)

? I don't think a failover is required?

Correct.

You can use ceph-dencoder of course to verify the backtrace updates.

* refs/pull/55421/head: mds: batch backtrace updates by pool-id when expiring a log segment Reviewed-by: Xiubo Li <xiubli@redhat.com>

qa/tasks/cephfs/test_backtrace.py

vshankar · 2024-05-10T05:52:32Z

This change is ready for re-review. I change the implementation a bit since the old implementation was buggy - backtrace updated to old pools were not dispatched. So, now, journal.cc prepares a separate CInodeCommitOperations() instance of each old pool for an inode and uses the backtrace generated for the default data pool. Also, I added a check for STATE_DIRTYPOOL (as done in CInode::_store_backtrace) since layout and backtraces to old pools should not be updated under certain circumstances (parent changing, etc..).

Note: The idea of dispatching per-pool backtrace updates still remains the same.

vshankar · 2024-06-05T05:21:42Z

@lxbsz fixed and updated.

vshankar · 2024-06-06T05:00:51Z

jenkins test make check

vshankar · 2024-06-06T05:00:58Z

jenkins test make check arm64

batrick

base for this branch is weird, it's not on main?

otherwise lgtm

src/mds/journal.cc

vshankar · 2024-06-17T09:35:10Z

base for this branch is weird, it's not on main?

It should be. I fixed and refreshed the change.

vshankar · 2024-06-17T09:41:25Z

This PR is under test in https://tracker.ceph.com/issues/66521.

vshankar · 2024-07-04T11:41:37Z

This change seems to be a bit buggy and causing failures like https://pulpito.ceph.com/vshankar-2024-06-30_16:43:29-fs-wip-vshankar-testing-20240628.170835-debug-testing-default-smithi/7779994/

vshankar · 2024-07-08T07:24:59Z

This change seems to be a bit buggy and causing failures like https://pulpito.ceph.com/vshankar-2024-06-30_16:43:29-fs-wip-vshankar-testing-20240628.170835-debug-testing-default-smithi/7779994/

I take this back. The issue is in another change. See: #54725 (comment)

(back into test branch)

vshankar · 2024-07-15T04:01:56Z

I see one failed test job with similar effect - mds going read-only. Deferring merge till its investigated.

/a/vshankar-2024-07-08_07:21:13-fs-wip-vshankar-testing-20240705.150505-debug-testing-default-smithi/)/7791798

vshankar · 2024-07-24T05:34:11Z

I see one failed test job with similar effect - mds going read-only. Deferring merge till its investigated.

/a/vshankar-2024-07-08_07:21:13-fs-wip-vshankar-testing-20240705.150505-debug-testing-default-smithi/)/7791798

Some osd_op's are returning -2 (ENOENT) for commit operations sent by the MDS.

./remote/smithi135/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.3.log.gz:2024-07-09T19:51:18.690+0000 7f15b3c00640 20 osd.3 pg_epoch: 102 pg[2.d( v 102'772 (0'0,102'772] local-lis/les=75/76 n=93 ec=75/75 lis/c=75/75 les/c/f=76/76/0 sis=75) [5,10,3] r=2 lpr=75 luod=0'0 lua=101'771 crt=102'772 lcod 101'771 mlcod 96'754 active mbc={}] rollforward: entry=101'771 (0'0) error    2:b5eee495:::100000058fa.00000000:head by mds.0.14:129792 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi135/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.3.log.gz:2024-07-09T19:51:18.693+0000 7f15b5000640 20 osd.3 pg_epoch: 102 pg[4.1bs1( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=1 lpr=84 luod=0'0 lua=101'2885 crt=101'2884 lcod 101'2885 mlcod 96'2882 active mbc={}] rollforward: entry=101'2884 (0'0) error    4:dbab607a:dirns::10000005908.00000019:head by mds.0.14:129838 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi135/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.3.log.gz:2024-07-09T19:51:18.697+0000 7f15b0000640 20 osd.3 pg_epoch: 102 pg[4.1bs1( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=1 lpr=84 luod=0'0 crt=102'2886 mlcod 101'2884 active mbc={}] rollforward: entry=101'2885 (0'0) error    4:dae0eac8:dirns::10000005908.00000020:head by mds.0.14:129845 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi135/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.9.log.gz:2024-07-09T19:51:18.692+0000 7fc2ac800640 20 osd.9 pg_epoch: 102 pg[4.1bs2( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=2 lpr=84 luod=0'0 lua=101'2885 crt=101'2884 lcod 101'2885 mlcod 96'2882 active mbc={}] rollforward: entry=101'2884 (0'0) error    4:dbab607a:dirns::10000005908.00000019:head by mds.0.14:129838 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi135/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.9.log.gz:2024-07-09T19:51:18.697+0000 7fc2a7800640 20 osd.9 pg_epoch: 102 pg[4.1bs2( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=2 lpr=84 luod=0'0 crt=102'2886 mlcod 101'2884 active mbc={}] rollforward: entry=101'2885 (0'0) error    4:dae0eac8:dirns::10000005908.00000020:head by mds.0.14:129845 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi163/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.5.log.gz:2024-07-09T19:51:18.690+0000 7f02deb59640 20 osd.5 pg_epoch: 102 pg[2.d( v 102'772 (0'0,102'772] local-lis/les=75/76 n=93 ec=75/75 lis/c=75/75 les/c/f=76/76/0 sis=75) [5,10,3] r=0 lpr=75 luod=101'771 lua=101'771 crt=102'772 lcod 96'770 mlcod 96'770 active+clean] rollforward: entry=101'771 (0'0) error    2:b5eee495:::100000058fa.00000000:head by mds.0.14:129792 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi163/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.5.log.gz:2024-07-09T19:51:18.694+0000 7f02dfb5b640 20 osd.5 pg_epoch: 102 pg[4.1bs3( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=3 lpr=84 luod=0'0 lua=101'2885 crt=101'2884 lcod 101'2885 mlcod 96'2882 active mbc={}] rollforward: entry=101'2884 (0'0) error    4:dbab607a:dirns::10000005908.00000019:head by mds.0.14:129838 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi163/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.5.log.gz:2024-07-09T19:51:18.698+0000 7f02dbb53640 20 osd.5 pg_epoch: 102 pg[4.1bs3( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=3 lpr=84 luod=0'0 crt=102'2886 mlcod 101'2884 active mbc={}] rollforward: entry=101'2885 (0'0) error    4:dae0eac8:dirns::10000005908.00000020:head by mds.0.14:129845 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi160/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.4.log.gz:2024-07-09T19:51:18.691+0000 7f236c2c9640 20 osd.4 pg_epoch: 102 pg[4.1bs0( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=0 lpr=84 luod=101'2885 lua=101'2885 crt=101'2884 lcod 101'2884 mlcod 101'2884 active+clean] rollforward: entry=101'2884 (0'0) error    4:dbab607a:dirns::10000005908.00000019:head by mds.0.14:129838 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi160/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.4.log.gz:2024-07-09T19:51:18.698+0000 7f236c2c9640 20 osd.4 pg_epoch: 102 pg[4.1bs0( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=0 lpr=84 crt=102'2886 lcod 101'2885 mlcod 101'2885 active+clean] rollforward: entry=101'2885 (0'0) error    4:dae0eac8:dirns::10000005908.00000020:head by mds.0.14:129845 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi160/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.10.log.gz:2024-07-09T19:51:18.690+0000 7fd231dac640 20 osd.10 pg_epoch: 102 pg[2.d( v 102'772 (0'0,102'772] local-lis/les=75/76 n=93 ec=75/75 lis/c=75/75 les/c/f=76/76/0 sis=75) [5,10,3] r=1 lpr=75 luod=0'0 lua=101'771 crt=102'772 lcod 101'771 mlcod 96'754 active mbc={}] rollforward: entry=101'771 (0'0) error    2:b5eee495:::100000058fa.00000000:head by mds.0.14:129792 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false

vshankar · 2024-08-05T14:18:59Z

I see one failed test job with similar effect - mds going read-only. Deferring merge till its investigated.
/a/vshankar-2024-07-08_07:21:13-fs-wip-vshankar-testing-20240705.150505-debug-testing-default-smithi/)/7791798

Some osd_op's are returning -2 (ENOENT) for commit operations sent by the MDS.

./remote/smithi135/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.3.log.gz:2024-07-09T19:51:18.690+0000 7f15b3c00640 20 osd.3 pg_epoch: 102 pg[2.d( v 102'772 (0'0,102'772] local-lis/les=75/76 n=93 ec=75/75 lis/c=75/75 les/c/f=76/76/0 sis=75) [5,10,3] r=2 lpr=75 luod=0'0 lua=101'771 crt=102'772 lcod 101'771 mlcod 96'754 active mbc={}] rollforward: entry=101'771 (0'0) error    2:b5eee495:::100000058fa.00000000:head by mds.0.14:129792 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi135/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.3.log.gz:2024-07-09T19:51:18.693+0000 7f15b5000640 20 osd.3 pg_epoch: 102 pg[4.1bs1( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=1 lpr=84 luod=0'0 lua=101'2885 crt=101'2884 lcod 101'2885 mlcod 96'2882 active mbc={}] rollforward: entry=101'2884 (0'0) error    4:dbab607a:dirns::10000005908.00000019:head by mds.0.14:129838 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi135/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.3.log.gz:2024-07-09T19:51:18.697+0000 7f15b0000640 20 osd.3 pg_epoch: 102 pg[4.1bs1( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=1 lpr=84 luod=0'0 crt=102'2886 mlcod 101'2884 active mbc={}] rollforward: entry=101'2885 (0'0) error    4:dae0eac8:dirns::10000005908.00000020:head by mds.0.14:129845 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi135/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.9.log.gz:2024-07-09T19:51:18.692+0000 7fc2ac800640 20 osd.9 pg_epoch: 102 pg[4.1bs2( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=2 lpr=84 luod=0'0 lua=101'2885 crt=101'2884 lcod 101'2885 mlcod 96'2882 active mbc={}] rollforward: entry=101'2884 (0'0) error    4:dbab607a:dirns::10000005908.00000019:head by mds.0.14:129838 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi135/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.9.log.gz:2024-07-09T19:51:18.697+0000 7fc2a7800640 20 osd.9 pg_epoch: 102 pg[4.1bs2( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=2 lpr=84 luod=0'0 crt=102'2886 mlcod 101'2884 active mbc={}] rollforward: entry=101'2885 (0'0) error    4:dae0eac8:dirns::10000005908.00000020:head by mds.0.14:129845 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi163/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.5.log.gz:2024-07-09T19:51:18.690+0000 7f02deb59640 20 osd.5 pg_epoch: 102 pg[2.d( v 102'772 (0'0,102'772] local-lis/les=75/76 n=93 ec=75/75 lis/c=75/75 les/c/f=76/76/0 sis=75) [5,10,3] r=0 lpr=75 luod=101'771 lua=101'771 crt=102'772 lcod 96'770 mlcod 96'770 active+clean] rollforward: entry=101'771 (0'0) error    2:b5eee495:::100000058fa.00000000:head by mds.0.14:129792 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi163/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.5.log.gz:2024-07-09T19:51:18.694+0000 7f02dfb5b640 20 osd.5 pg_epoch: 102 pg[4.1bs3( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=3 lpr=84 luod=0'0 lua=101'2885 crt=101'2884 lcod 101'2885 mlcod 96'2882 active mbc={}] rollforward: entry=101'2884 (0'0) error    4:dbab607a:dirns::10000005908.00000019:head by mds.0.14:129838 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi163/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.5.log.gz:2024-07-09T19:51:18.698+0000 7f02dbb53640 20 osd.5 pg_epoch: 102 pg[4.1bs3( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=3 lpr=84 luod=0'0 crt=102'2886 mlcod 101'2884 active mbc={}] rollforward: entry=101'2885 (0'0) error    4:dae0eac8:dirns::10000005908.00000020:head by mds.0.14:129845 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi160/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.4.log.gz:2024-07-09T19:51:18.691+0000 7f236c2c9640 20 osd.4 pg_epoch: 102 pg[4.1bs0( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=0 lpr=84 luod=101'2885 lua=101'2885 crt=101'2884 lcod 101'2884 mlcod 101'2884 active+clean] rollforward: entry=101'2884 (0'0) error    4:dbab607a:dirns::10000005908.00000019:head by mds.0.14:129838 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi160/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.4.log.gz:2024-07-09T19:51:18.698+0000 7f236c2c9640 20 osd.4 pg_epoch: 102 pg[4.1bs0( v 102'2886 (0'0,102'2886] local-lis/les=84/85 n=1323 ec=84/84 lis/c=84/84 les/c/f=85/85/0 sis=84) [4,3,9,5]p4(0) r=0 lpr=84 crt=102'2886 lcod 101'2885 mlcod 101'2885 active+clean] rollforward: entry=101'2885 (0'0) error    4:dae0eac8:dirns::10000005908.00000020:head by mds.0.14:129845 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false
./remote/smithi160/log/ec2cb1ec-3e28-11ef-bcac-c7b262605968/ceph-osd.10.log.gz:2024-07-09T19:51:18.690+0000 7fd231dac640 20 osd.10 pg_epoch: 102 pg[2.d( v 102'772 (0'0,102'772] local-lis/les=75/76 n=93 ec=75/75 lis/c=75/75 les/c/f=76/76/0 sis=75) [5,10,3] r=1 lpr=75 luod=0'0 lua=101'771 crt=102'772 lcod 101'771 mlcod 96'754 active mbc={}] rollforward: entry=101'771 (0'0) error    2:b5eee495:::100000058fa.00000000:head by mds.0.14:129792 0.000000 -2 [r=-2+0b] ObjectCleanRegions clean_offsets: [(0, 18446744073709551615)], clean_omap: true, new_object: false

This looks like a new bug and unrelated to this change. I don't see any backtrace related error during segment expiry.

➜  7791798 pwd
/a/vshankar-2024-07-08_07:21:13-fs-wip-vshankar-testing-20240705.150505-debug-testing-default-smithi/7791798
➜  7791798 find . -name "ceph-mds*" | xargs zgrep "store backtrace error"

Signed-off-by: Venky Shankar <vshankar@redhat.com>

LogSegment::try_to_expire() batches backtrace updations for inodes in dirty_parent_inodes list. If a backtrace update operations fails for one inode due to missing (removed) data pool, which is specially handled by treating the operation as a success, however, the errno (-ENOENT) is stored by the gather context and passed on as the return value to subsequent operations (even for successful backtrace update operations in the same gather context). Fixes: http://tracker.ceph.com/issues/63259 Signed-off-by: Venky Shankar <vshankar@redhat.com>

…a pool Signed-off-by: Venky Shankar <vshankar@redhat.com>

vshankar · 2024-08-20T07:08:07Z

https://tracker.ceph.com/issues/67628

vshankar · 2024-08-26T12:29:15Z

This PR is under test in https://tracker.ceph.com/issues/67711.

vshankar · 2024-09-17T11:12:32Z

https://tracker.ceph.com/projects/cephfs/wiki/Main#wip-vshankar-testing-20240826122843-debug

vshankar added the cephfs Ceph File System label Feb 2, 2024

vshankar requested a review from a team February 2, 2024 05:03

lxbsz approved these changes Feb 21, 2024

View reviewed changes

vshankar requested a review from a team February 21, 2024 09:25

vshankar added the wip-vshankar-testing3 label Feb 21, 2024

batrick reviewed Feb 21, 2024

View reviewed changes

vshankar force-pushed the wip-63259 branch from 5dd62a2 to 0dd0750 Compare February 26, 2024 10:05

vshankar requested a review from a team February 26, 2024 10:07

vshankar force-pushed the wip-63259 branch from 0dd0750 to 66cf8ad Compare February 26, 2024 10:11

vshankar commented Feb 26, 2024

View reviewed changes

src/mds/journal.cc Show resolved Hide resolved

vshankar removed the wip-vshankar-testing3 label Feb 26, 2024

vshankar force-pushed the wip-63259 branch from 66cf8ad to 64f4823 Compare March 12, 2024 11:50

vshankar force-pushed the wip-63259 branch from 64f4823 to 93f51e9 Compare March 13, 2024 05:05

vshankar requested review from batrick and lxbsz April 8, 2024 15:31

vshankar added the wip-vshankar-testing label Apr 10, 2024

batrick requested changes Apr 10, 2024

View reviewed changes

vshankar added a commit to vshankar/ceph that referenced this pull request Apr 12, 2024

Merge PR ceph#55421 into wip-vshankar-testing-20240411.061452

324baf8

* refs/pull/55421/head: mds: batch backtrace updates by pool-id when expiring a log segment Reviewed-by: Xiubo Li <xiubli@redhat.com>

vshankar removed the wip-vshankar-testing label May 6, 2024

vshankar force-pushed the wip-63259 branch from 93f51e9 to 14b8af4 Compare May 9, 2024 10:25

github-actions bot added the tests label May 9, 2024

vshankar commented May 9, 2024

View reviewed changes

qa/tasks/cephfs/test_backtrace.py Outdated Show resolved Hide resolved

vshankar force-pushed the wip-63259 branch from 14b8af4 to 40e689b Compare May 10, 2024 05:30

vshankar added the wip-vshankar-testing label Jun 6, 2024

batrick requested changes Jun 15, 2024

View reviewed changes

src/mds/journal.cc Outdated Show resolved Hide resolved

vshankar force-pushed the wip-63259 branch from ad5d9ae to 54ee829 Compare June 17, 2024 09:34

batrick approved these changes Jun 18, 2024

View reviewed changes

vshankar removed the wip-vshankar-testing label Jul 4, 2024

vshankar added the wip-vshankar-testing label Jul 5, 2024

vshankar removed the wip-vshankar-testing label Jul 16, 2024

vshankar added 4 commits August 20, 2024 07:07

mds: dump log segment end along with offset

5639fa2

Signed-off-by: Venky Shankar <vshankar@redhat.com>

mds: dump log segment in segment expiry callback

e5728c4

Signed-off-by: Venky Shankar <vshankar@redhat.com>

qa/cephfs: add test to verify backtrace update failure on deleted dat…

9f27bde

…a pool Signed-off-by: Venky Shankar <vshankar@redhat.com>

vshankar force-pushed the wip-63259 branch from 54ee829 to 9f27bde Compare August 20, 2024 07:07

vshankar added the wip-vshankar-testing1 label Aug 20, 2024

vshankar merged commit 8bd63f8 into ceph:main Sep 17, 2024

vshankar removed the wip-vshankar-testing1 label Sep 17, 2024

This was referenced Nov 11, 2024

squid: mds: batch backtrace updates by pool-id when expiring a log segment #60688

Merged

reef: mds: batch backtrace updates by pool-id when expiring a log segment #60689

Merged

	in->store_backtrace(ops_vec_map[p].back(), op_prio, true);
	ops_vec_map[p].push_back(CInodeCommitOperations());
	in->store_backtrace(ops_vec_map[p].back(), op_prio, true);

Conversation

vshankar commented Feb 2, 2024 • edited by leonid-s-usov Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lxbsz commented Feb 21, 2024

Uh oh!

lxbsz left a comment

Choose a reason for hiding this comment

Uh oh!

batrick left a comment

Choose a reason for hiding this comment

Uh oh!

vshankar commented Feb 26, 2024

Uh oh!

Uh oh!

vshankar commented Feb 26, 2024

Uh oh!

vshankar commented Mar 13, 2024

Uh oh!

batrick left a comment

Choose a reason for hiding this comment

Uh oh!

batrick Apr 10, 2024

Choose a reason for hiding this comment

Uh oh!

vshankar Apr 15, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vshankar commented Apr 15, 2024

Uh oh!

Uh oh!

vshankar commented May 10, 2024

Uh oh!

vshankar commented Jun 5, 2024

Uh oh!

vshankar commented Jun 6, 2024

Uh oh!

vshankar commented Jun 6, 2024

Uh oh!

batrick left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vshankar commented Jun 17, 2024

Uh oh!

vshankar commented Jun 17, 2024

Uh oh!

vshankar commented Jul 4, 2024

Uh oh!

vshankar commented Jul 8, 2024

Uh oh!

vshankar commented Jul 15, 2024

Uh oh!

vshankar commented Jul 24, 2024

Uh oh!

vshankar commented Aug 5, 2024

Uh oh!

vshankar commented Aug 20, 2024

Uh oh!

vshankar commented Aug 26, 2024

Uh oh!

vshankar commented Sep 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vshankar commented Feb 2, 2024 •

edited by leonid-s-usov

Loading