Reread sync marker and objv after acquiring the lease by soumyakoduri · Pull Request #48397 · ceph/ceph

soumyakoduri · 2022-10-07T18:37:47Z

In RGWDataSyncShardCR, after acquiring the lease, reread sync status shard object to fetch the latest marker & objv stored.

This fix is on top of PR##47682

Signed-off-by: Soumya Koduri skoduri@redhat.com

Checklist

Tracker (select at least one)
- References tracker ticket
- Very recent bug; references commit where it was introduced
- New feature (ticket optional)
- Doc update (no ticket needed)
- Code cleanup (no ticket needed)
Component impact
- Affects Dashboard, opened tracker ticket
- Affects Orchestrator, opened tracker ticket
- No impact that needs to be tracked
Documentation (select at least one)
- Updates relevant documentation
- No doc update is appropriate
Tests (select at least one)
- Includes unit test(s)
- Includes integration test(s)
- Includes bug reproducer
- No tests

Show available Jenkins commands

jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test dashboard cephadm
jenkins test api
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox
jenkins test windows

…ject and store it in a vector Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

cbodley · 2022-10-10T13:22:45Z

src/rgw/rgw_data_sync.cc

+      /* Reread data sync status to fech latest marker and objv */
+      yield call(new RGWSimpleRadosReadCR<rgw_data_sync_marker>(sync_env->dpp, sync_env->async_rados, sync_env->svc->sysobj,
+                                                             rgw_raw_obj(pool, status_oid),
+                                                             &sync_marker, true, &objv));


needs to check for error with a block like this:

if (retcode < 0) { lease_cr->go_down(); drain_all(); return set_cr_error(retcode); }

yes addressed it.. thanks!
Also observed in the tests that by reading into sync_marker directly here, we loose the in-memory sync_marker updated at the end of the do{} loop in the incremental_sync(). So the rgw server is repeatedly trying to read remote data log from this sync_marker in an endless loop. So I am planning to check version before updating sync_marker.. Does it seem right?

tmp_objv.clear(); /* Reread data sync status to fech the latest marker and update objv */ yield call(new RGWSimpleRadosReadCR<rgw_data_sync_marker>(sync_env->dpp, sync_env->store, rgw_raw_obj(pool, status_oid), &tmp_sync_marker, true, &tmp_objv)); if (retcode < 0) { lease_cr->go_down(); drain_all(); return set_cr_error(retcode); } if (tmp_objv.read_version.ver > objv.read_version.ver) { sync_marker = tmp_sync_marker; objv = tmp_objv; }

<<<<

Also observed in the tests that by reading into sync_marker directly here, we loose the in-memory sync_marker updated at the end of the do{} loop in the incremental_sync().

hmm, i think i see what you mean. RGWDataIncSyncShardCR is using sync_marker.marker as a local variable to track its current position in the listing, separately from RGWDataSyncShardMarkerTrack::sync_marker whose updates are written to the sync status object

if RGWDataIncSyncShardCR bails out with an error, we'd retry this call to RGWDataSyncShardCR with the same sync_marker.marker as before. but we really should start over at the position recorded in our sync status object, so i think this RGWSimpleRadosReadCR is doing the right thing

can you tell why RGWDataIncSyncShardCR is failing? it should run forever unless the lease times out, so i guess that's why? if that's happening consistently, we wouldn't expect to make much progress

I think I found the issue here.. objv should be cleared before calling "RGWSimpleRadosReadCR" otherwise it may fail in the cls_version_check() in the read_op. This and retcode error check should fix the loop we had observed in the tests earlier.

In RGWDataSyncShardCR, after acquiring the lease, reread sync status shard object to fetch the latest marker & objv stored. Signed-off-by: Soumya Koduri <skoduri@redhat.com>

github-actions · 2022-11-30T17:31:06Z

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

cbodley · 2023-02-09T15:27:16Z

@soumyakoduri @smanjara is this still needed?

soumyakoduri · 2023-02-13T05:42:59Z

@soumyakoduri @smanjara is this still needed?

These changes are already merged as part of #48898.

rgw/multisite: add cls versioning for tracking data sync per shard ob…

565c3ca

…ject and store it in a vector Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

github-actions bot added the rgw label Oct 7, 2022

soumyakoduri force-pushed the rgw-cls-version-sync-shard branch 2 times, most recently from 18c0744 to e778c6a Compare October 7, 2022 19:13

smanjara self-requested a review October 7, 2022 20:46

cbodley reviewed Oct 10, 2022

View reviewed changes

rgw/multisite: Update marker and objv after acquiring the lease

6396c26

In RGWDataSyncShardCR, after acquiring the lease, reread sync status shard object to fetch the latest marker & objv stored. Signed-off-by: Soumya Koduri <skoduri@redhat.com>

soumyakoduri force-pushed the rgw-cls-version-sync-shard branch from e778c6a to 6396c26 Compare October 12, 2022 05:49

cbodley approved these changes Oct 12, 2022

View reviewed changes

github-actions bot added the needs-rebase label Nov 30, 2022

soumyakoduri closed this Feb 13, 2023

soumyakoduri deleted the rgw-cls-version-sync-shard branch March 6, 2026 09:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reread sync marker and objv after acquiring the lease#48397

Reread sync marker and objv after acquiring the lease#48397
soumyakoduri wants to merge 2 commits intoceph:mainfrom
soumyakoduri:rgw-cls-version-sync-shard

soumyakoduri commented Oct 7, 2022 •

edited

Loading

Uh oh!

cbodley Oct 10, 2022

Uh oh!

soumyakoduri Oct 10, 2022

Uh oh!

cbodley Oct 10, 2022

Uh oh!

soumyakoduri Oct 12, 2022

Uh oh!

github-actions bot commented Nov 30, 2022

Uh oh!

cbodley commented Feb 9, 2023

Uh oh!

soumyakoduri commented Feb 13, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

soumyakoduri commented Oct 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Uh oh!

cbodley Oct 10, 2022

Choose a reason for hiding this comment

Uh oh!

soumyakoduri Oct 10, 2022

Choose a reason for hiding this comment

Uh oh!

cbodley Oct 10, 2022

Choose a reason for hiding this comment

Uh oh!

soumyakoduri Oct 12, 2022

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 30, 2022

Uh oh!

cbodley commented Feb 9, 2023

Uh oh!

soumyakoduri commented Feb 13, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

soumyakoduri commented Oct 7, 2022 •

edited

Loading