osd/PG.cc: handle removal of pgmeta object by neha-ojha · Pull Request #40993 · ceph/ceph

neha-ojha · 2021-04-22T17:51:51Z

In 7f04700, we made the pg removal code
much more efficient. But it started marking the pgmeta object as an unexpected
onode, which in reality is expected to be removed after all the other objects.

This behavior is very easily reproducible in a vstart cluster:

ceph osd pool create test 1 1
rados -p test bench 10 write --no-cleanup
ceph osd pool delete test test --yes-i-really-really-mean-it

Before this patch:

"do_delete_work additional unexpected onode list (new onodes has appeared
since PG removal started[#2:00000000::::head#]" seen in the OSD logs.

After this patch:

"do_delete_work removing pgmeta object #2:00000000::::head#" is seen.

Related to:https://tracker.ceph.com/issues/50466
Signed-off-by: Neha Ojha nojha@redhat.com

Checklist

References tracker ticket
Updates documentation if necessary
Includes tests for new functionality or reproducer for bug

Show available Jenkins commands

jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test api
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox

In 7f04700, we made the pg removal code much more efficient. But it started marking the pgmeta object as an unexpected onode, which in reality is expected to be removed after all the other objects. This behavior is very easily reproducible in a vstart cluster: ceph osd pool create test 1 1 rados -p test bench 10 write --no-cleanup ceph osd pool delete test test --yes-i-really-really-mean-it Before this patch: "do_delete_work additional unexpected onode list (new onodes has appeared since PG removal started[ceph#2:00000000::::head#]" seen in the OSD logs. After this patch: "do_delete_work removing pgmeta object ceph#2:00000000::::head#" is seen. Related to:https://tracker.ceph.com/issues/50466 Signed-off-by: Neha Ojha <nojha@redhat.com>

dvanders · 2021-04-22T18:18:30Z

@neha-ojha this will suppress the warning, but not solve the performance cost of the full collection list just above. Now that we understand the extra leftover object, can we just delete it directly instead of listing the entire collection?

src/osd/PG.cc

jdurgin · 2021-04-22T22:07:07Z

@neha-ojha this will suppress the warning, but not solve the performance cost of the full collection list just above. Now that we understand the extra leftover object, can we just delete it directly instead of listing the entire collection?

I think we're keeping the pgmeta object around so we can still open up the pg and continue if the osd crashes at this point - similar to not removing a directory until all files in it are gone

xiexingguo

👍

tchaikov · 2021-04-23T04:51:26Z

src/osd/PG.cc

-      dout(0) << __func__ << " additional unexpected onode list"
-              <<" (new onodes has appeared since PG removal started"
-              << olist << dendl;
+      for (auto& oid : olist) {


nit, i'd suggest drop the if (!olist.empty()) check at line 2671.

created #41233 to address this comment.

dvanders · 2021-04-23T07:24:15Z

@neha-ojha this will suppress the warning, but not solve the performance cost of the full collection list just above. Now that we understand the extra leftover object, can we just delete it directly instead of listing the entire collection?

I think we're keeping the pgmeta object around so we can still open up the pg and continue if the osd crashes at this point - similar to not removing a directory until all files in it are gone

In that case, do we still need the entire block from line 2660 to 2682? AFAIU the unexpected leftover onode is now understood, is cleaned up elsewhere, so we can drop this expensive collection_list from the beginning.

k0ste · 2021-04-23T08:56:45Z

@neha-ojha this will suppress the warning, but not solve the performance cost of the full collection list just above. Now that we understand the extra leftover object, can we just delete it directly instead of listing the entire collection?

I think @dvanders tells about latency spike and possibility to be marked down by mon. Log on tracker.

tchaikov · 2021-05-08T08:31:25Z

neha-ojha · 2021-05-10T20:48:45Z

@neha-ojha this will suppress the warning, but not solve the performance cost of the full collection list just above. Now that we understand the extra leftover object, can we just delete it directly instead of listing the entire collection?

I think we're keeping the pgmeta object around so we can still open up the pg and continue if the osd crashes at this point - similar to not removing a directory until all files in it are gone

In that case, do we still need the entire block from line 2660 to 2682? AFAIU the unexpected leftover onode is now understood, is cleaned up elsewhere, so we can drop this expensive collection_list from the beginning.

@dvanders @k0ste I will verify this and address it in a follow-up PR.

github-actions bot added the core label Apr 22, 2021

neha-ojha requested review from ifed01 and jdurgin April 22, 2021 18:06

jdurgin reviewed Apr 22, 2021

View reviewed changes

src/osd/PG.cc Show resolved Hide resolved

jdurgin approved these changes Apr 22, 2021

View reviewed changes

neha-ojha added the needs-qa label Apr 22, 2021

xiexingguo approved these changes Apr 23, 2021

View reviewed changes

tchaikov reviewed Apr 23, 2021

View reviewed changes

tchaikov added the wip-kefu-testing2 label May 5, 2021

tchaikov merged commit a46db0c into ceph:master May 8, 2021

neha-ojha deleted the wip-50466 branch May 10, 2021 20:46

cfsnyder mentioned this pull request Jun 1, 2021

octopus: osd/PG.cc: handle removal of pgmeta object #41623

Merged

This was referenced Jun 3, 2021

pacific: osd/PG.cc: handle removal of pgmeta object #41680

Merged

nautilus: osd/PG.cc: handle removal of pgmeta object #41682

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

osd/PG.cc: handle removal of pgmeta object#40993

osd/PG.cc: handle removal of pgmeta object#40993
tchaikov merged 1 commit intoceph:masterfrom
neha-ojha:wip-50466

neha-ojha commented Apr 22, 2021

Uh oh!

dvanders commented Apr 22, 2021

Uh oh!

Uh oh!

jdurgin commented Apr 22, 2021

Uh oh!

xiexingguo left a comment

Uh oh!

tchaikov Apr 23, 2021 •

edited

Loading

Uh oh!

tchaikov May 8, 2021

Uh oh!

dvanders commented Apr 23, 2021

Uh oh!

k0ste commented Apr 23, 2021

Uh oh!

tchaikov commented May 8, 2021

Uh oh!

neha-ojha commented May 10, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

neha-ojha commented Apr 22, 2021

Checklist

Uh oh!

dvanders commented Apr 22, 2021

Uh oh!

Uh oh!

jdurgin commented Apr 22, 2021

Uh oh!

xiexingguo left a comment

Choose a reason for hiding this comment

Uh oh!

tchaikov Apr 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tchaikov May 8, 2021

Choose a reason for hiding this comment

Uh oh!

dvanders commented Apr 23, 2021

Uh oh!

k0ste commented Apr 23, 2021

Uh oh!

tchaikov commented May 8, 2021

Uh oh!

neha-ojha commented May 10, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

tchaikov Apr 23, 2021 •

edited

Loading