rgw: multisite stabilization for reef by cbodley · Pull Request #48898 · ceph/ceph

cbodley · 2022-11-15T20:33:57Z

tracks multisite stabilization work that hasn't yet merged to main, so we can validate all of it together in workload testing

Show available Jenkins commands

jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test dashboard cephadm
jenkins test api
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox
jenkins test windows

cbodley · 2022-11-15T20:35:42Z

@adamemerson @smanjara @soumyakoduri @yuvalif i could use your help tracking down all of the multisite commits we've taken to 5.3 for testing that haven't made it upstream yet. feel free to push commits directly to this ceph:wip-rgw-multisite-reshard-reef branch

regarding the fifo stuff in #48632, it's probably best to keep it separate while we're still iterating on it?

soumyakoduri · 2022-11-17T16:56:48Z

@adamemerson @smanjara @soumyakoduri @yuvalif i could use your help tracking down all of the multisite commits we've taken to 5.3 for testing that haven't made it upstream yet. feel free to push commits directly to this ceph:wip-rgw-multisite-reshard-reef branch

regarding the fifo stuff in #48632, it's probably best to keep it separate while we're still iterating on it?

I have added couple of commits to #48936. Will merge them to this branch after few sanity tests. Meanwhile please review the changes. Thanks!

cbodley · 2022-11-23T13:40:52Z

added commits from #43609 #47566 #47797 #48451

cbodley · 2022-11-23T18:30:21Z

local multisite test results:

FAIL: test_multi.test_version_suspended_incremental_sync
FAIL: test_multi.test_zg_master_zone_delete
FAIL: test_multi.test_bucket_reshard_index_log_trim
Ran 54 tests in 2843.306s

FAILED (SKIP=16, failures=3)

github-actions · 2022-11-30T17:33:38Z

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

cbodley · 2022-12-16T18:03:52Z

once #49179 merges to main, i'll rebase this PR on top

…ject and store it in a vector Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

In RGWDataSyncShardCR, after acquiring the lease, reread sync status shard object to fetch the latest marker & objv stored. Signed-off-by: Soumya Koduri <skoduri@redhat.com>

…will report that it's behind the remote's max-marker even if there are no more entries to sync for each behind shard. if we get an empty listing, remove that shard from behind_shards. Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

Also clear objv before reading the bucket sync status. Signed-off-by: Soumya Koduri <skoduri@redhat.com>

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

this can be useful to prevent long-lived connections from being dropped due to inactivity Fixes: https://tracker.ceph.com/issues/48402 Signed-off-by: Casey Bodley <cbodley@redhat.com>

Sticking random #defines everywhere is just atrocious style. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

Since we were taking them by reference and copying before, this is strictly better. Callers that give us an RValue can skip the copy. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

RGWDataSyncCR manages the lock instead, holding it through StateInit and StateBuildingFullSyncMaps but releasing it by StateSync. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

If someone else got there first, we won't smash their work. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

The `radosgw-admin data sync init` command does *not* use `cls_version` and just overwrites. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

Don't go through the 'system object' cache. This also saves us the use of the RADOS async completion processor. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

Lock latency in RGWContinuousLeaseCR gets high enough under load that the locks end up timing out, leading to incorrect behavior. Monitor lock latency and cut concurrent operations in half if it goes above ten seconds. Cut currency to one if it goes about twenty seconds. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

Limited to only warn every five minutes. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

Fixes: https://tracker.ceph.com/issues/48416 bucket was passed in without bucket_id, now reading entrypoint info if needed. Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

Added test cases for the various use-cases of multisite sync policy feature listed in https://docs.ceph.com/en/latest/radosgw/multisite-sync-policy/ Signed-off-by: Soumya Koduri <skoduri@redhat.com>

…s enabled/disabled Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

…n when versioning is disabled on primary Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

…ucket modifications Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

Flush the marker tracker and abort if we don't still have it. Resolves: rhbz#2129718 Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

…r update failures Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

In case if any data log entries are missing for the older generations, the sync server may not mark those shards as done and can get stuck in that old gen for ever. To avoid that, whenever a future gen entry is read, write the undone (shard,gen) entry to error repo so that it can be processed and marked as done and hence sync can progress eventually. Signed-off-by: Soumya Koduri <skoduri@redhat.com>

adamemerson · 2023-01-13T17:45:12Z

@cbodley We think this has all the things from 5.3. Do we want to put it through QA and merge it, or do we want to try doing load tests against it?

cbodley · 2023-01-13T21:28:48Z

@adamemerson i'd like to see it qa'd and merged. we can continue testing on the main branch

adamemerson · 2023-01-14T01:29:40Z

https://pulpito.ceph.com/aemerson-2023-01-13_23:20:40-rgw-wip-rgw-multisite-reshard-reef-distro-default-smithi/

cbodley · 2023-01-19T17:17:14Z

we'll need to clean up the multisite functional tests. i won't block this merge due to the multisite failures, but i'd really like the tests to be green for the reef release. that way we can actually validate future multisite changes and their reef backports

cbodley added the rgw label Nov 15, 2022

cbodley mentioned this pull request Nov 15, 2022

rgw: multisite reshard stabilization for reef #48897

Closed

cbodley mentioned this pull request Nov 22, 2022

rgw/multisite: add cls versioning for tracking data sync per shard object #47682

Closed

14 tasks

cbodley force-pushed the wip-rgw-multisite-reshard-reef branch from 263e930 to 90a57a3 Compare November 23, 2022 13:34

github-actions bot added build/ops common tests labels Nov 23, 2022

cbodley force-pushed the wip-rgw-multisite-reshard-reef branch from 90a57a3 to 7a09368 Compare November 23, 2022 16:29

github-actions bot added the needs-rebase label Nov 30, 2022

adamemerson self-assigned this Dec 1, 2022

yuvalif mentioned this pull request Dec 8, 2022

rgw: ContinuousLeaseCR uses LOCK_FLAG_MUST_RENEW to detect timeouts and racing lockers #47809

Closed

Shilpa Jagannath and others added 9 commits January 11, 2023 00:13

rgw/multisite: add cls versioning for tracking data sync per shard ob…

9aeb2aa

…ject and store it in a vector Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

rgw/multisite: Update marker and objv after acquiring the lease

abc5883

In RGWDataSyncShardCR, after acquiring the lease, reread sync status shard object to fetch the latest marker & objv stored. Signed-off-by: Soumya Koduri <skoduri@redhat.com>

rgw/multisite: marker_tracker->finish() returns error

d072575

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

rgw/bucket-sync: flush marker_tracker when the lease is lost

e1c31b2

Also clear objv before reading the bucket sync status. Signed-off-by: Soumya Koduri <skoduri@redhat.com>

rgw/multisite: clear objv before reading bucket sync status

852e60d

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

rgw: add rgw_curl_tcp_keepalive option for http client requests

113cda8

this can be useful to prevent long-lived connections from being dropped due to inactivity Fixes: https://tracker.ceph.com/issues/48402 Signed-off-by: Casey Bodley <cbodley@redhat.com>

rgw: Get rid of some COOKIE_LEN preprocessor macros

77deaa9

Sticking random #defines everywhere is just atrocious style. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

rgw: RGWContinuousLeaseCR takes arguments by value and moves them

2b3fbcf

Since we were taking them by reference and copying before, this is strictly better. Callers that give us an RValue can skip the copy. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

adamemerson added 8 commits January 11, 2023 01:32

rgw: Pull lock out of RGWInitDataSyncStatusCoroutine

b0d401a

RGWDataSyncCR manages the lock instead, holding it through StateInit and StateBuildingFullSyncMaps but releasing it by StateSync. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

rgw: Reread sync status after acquiring lock in RGWDataSyncCR

25ef265

If someone else got there first, we won't smash their work. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

rgw: Use cls_version for read/writes to global data sync status

634673a

The `radosgw-admin data sync init` command does *not* use `cls_version` and just overwrites. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

rgw: SimpleRadosReadAttrsCR uses an async RADOS call

04d3a4f

Don't go through the 'system object' cache. This also saves us the use of the RADOS async completion processor. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

rgw: SimpleRadosReadCR uses an async RADOS call

a366b90

Don't go through the 'system object' cache. This also saves us the use of the RADOS async completion processor. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

rgw: SimpleRadosWriteCR uses an async RADOS call

aea643a

Don't go through the 'system object' cache. This also saves us the use of the RADOS async completion processor. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

rgw: SimpleRadosWriteAttrsCR uses an async RADOS call

900068e

Don't go through the 'system object' cache. This also saves us the use of the RADOS async completion processor. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

adamemerson force-pushed the wip-rgw-multisite-reshard-reef branch from 7a09368 to 3010abd Compare January 12, 2023 23:17

github-actions bot removed the needs-rebase label Jan 12, 2023

adamemerson and others added 3 commits January 13, 2023 08:27

rgw: LatencyConcurrencyControl warns on very high latency

a1ea877

Limited to only warn every five minutes. Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

rgw: fix get sync policy handler if bucket_id is empty

9f774b4

Fixes: https://tracker.ceph.com/issues/48416 bucket was passed in without bucket_id, now reading entrypoint info if needed. Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

rgw/multisite: Add testcases for sync policy feature

c55c505

Added test cases for the various use-cases of multisite sync policy feature listed in https://docs.ceph.com/en/latest/radosgw/multisite-sync-policy/ Signed-off-by: Soumya Koduri <skoduri@redhat.com>

adamemerson force-pushed the wip-rgw-multisite-reshard-reef branch from 3010abd to c55c505 Compare January 13, 2023 13:33

Shilpa Jagannath and others added 6 commits January 13, 2023 08:51

rgw/multisite: take the bucket lease before checking if bucket sync i…

bfebd24

…s enabled/disabled Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

rgw/multisite: object should generate version id on archive zone, eve…

bcea963

…n when versioning is disabled on primary Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

rgw/multisite: don't disable versioning on archive zone upon source b…

8ff79b1

…ucket modifications Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

rgw: Check for lost lease more often

dc4685f

Flush the marker tracker and abort if we don't still have it. Resolves: rhbz#2129718 Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

rgw/multisite: return error from RGWLastCallerWinsCR() to track marke…

fb92311

…r update failures Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>

adamemerson self-requested a review January 19, 2023 17:24

adamemerson approved these changes Jan 19, 2023

View reviewed changes

adamemerson merged commit e0f68a1 into main Jan 19, 2023

adamemerson deleted the wip-rgw-multisite-reshard-reef branch January 19, 2023 17:26

badone mentioned this pull request Jan 20, 2023

build: Fix build of radosgw-cr-test #49798

Merged

14 tasks

soumyakoduri mentioned this pull request Feb 13, 2023

Reread sync marker and objv after acquiring the lease #48397

Closed

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rgw: multisite stabilization for reef#48898

rgw: multisite stabilization for reef#48898
adamemerson merged 26 commits intomainfrom
wip-rgw-multisite-reshard-reef

cbodley commented Nov 15, 2022

Uh oh!

cbodley commented Nov 15, 2022

Uh oh!

soumyakoduri commented Nov 17, 2022

Uh oh!

cbodley commented Nov 23, 2022

Uh oh!

cbodley commented Nov 23, 2022

Uh oh!

github-actions bot commented Nov 30, 2022

Uh oh!

cbodley commented Dec 16, 2022

Uh oh!

adamemerson commented Jan 13, 2023

Uh oh!

cbodley commented Jan 13, 2023

Uh oh!

adamemerson commented Jan 14, 2023

Uh oh!

cbodley commented Jan 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

cbodley commented Nov 15, 2022

Uh oh!

cbodley commented Nov 15, 2022

Uh oh!

soumyakoduri commented Nov 17, 2022

Uh oh!

cbodley commented Nov 23, 2022

Uh oh!

cbodley commented Nov 23, 2022

Uh oh!

github-actions bot commented Nov 30, 2022

Uh oh!

cbodley commented Dec 16, 2022

Uh oh!

adamemerson commented Jan 13, 2023

Uh oh!

cbodley commented Jan 13, 2023

Uh oh!

adamemerson commented Jan 14, 2023

Uh oh!

cbodley commented Jan 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants