Actions
Bug #57618
closedrados/test.sh hang and pkilled (LibRadosWatchNotifyEC.WatchNotify)
% Done:
100%
Source:
Community (dev)
Backport:
pacific
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Tags (freeform):
Merge Commit:
Fixed In:
v18.0.0-1508-ga55987be804
Released In:
v18.2.0~799
Upkeep Timestamp:
2025-07-13T00:40:25+00:00
Description
Job stopped with
2022-09-15T12:49:22.055 INFO:tasks.workunit.client.0.smithi150.stdout: api_tier_pp: [==========] 77 tests from 4 test suites ran. (1973701 ms total) 2022-09-15T12:49:22.056 INFO:tasks.workunit.client.0.smithi150.stdout: api_tier_pp: [ PASSED ] 77 tests. 2022-09-15T12:49:22.056 INFO:tasks.workunit.client.0.smithi150.stderr:+ exit 1 2022-09-15T12:49:22.057 INFO:tasks.workunit.client.0.smithi150.stderr:+ cleanup 2022-09-15T12:49:22.057 INFO:tasks.workunit.client.0.smithi150.stderr:+ pkill -P 120635 2022-09-15T12:49:22.058 DEBUG:teuthology.orchestra.run:got remote process result: 1
since we error in:
2022-09-15T12:37:48.615 DEBUG:teuthology.orchestra.run.smithi150:> sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 30 ceph --cluster ceph --admin-daemon /var/run/ceph/ceph-osd.1.asok dump_ops_in_flight 2022-09-15T12:37:48.653 INFO:tasks.workunit.client.0.smithi150.stdout: api_watch_notify_pp: Running main() from gmock_main.cc 2022-09-15T12:37:48.654 INFO:tasks.workunit.client.0.smithi150.stdout: api_watch_notify_pp: [==========] Running 16 tests from 2 test suites. 2022-09-15T12:37:48.654 INFO:tasks.workunit.client.0.smithi150.stdout: api_watch_notify_pp: [----------] Global test environment set-up. 2022-09-15T12:37:48.654 INFO:tasks.workunit.client.0.smithi150.stdout: api_watch_notify_pp: [----------] 2 tests from LibRadosWatchNotifyECPP 2022-09-15T12:37:48.655 INFO:tasks.workunit.client.0.smithi150.stdout: api_watch_notify_pp: [ RUN ] LibRadosWatchNotifyECPP.WatchNotify 2022-09-15T12:37:48.656 INFO:tasks.workunit.client.0.smithi150.stderr:bash: line 1: 120883 Alarm clock ceph_test_rados_api_watch_notify_pp 2>&1 2022-09-15T12:37:48.656 INFO:tasks.workunit.client.0.smithi150.stderr: 120884 Done | tee ceph_test_rados_api_watch_notify_pp.log 2022-09-15T12:37:48.656 INFO:tasks.workunit.client.0.smithi150.stderr: 120885 Done | sed "s/^/ api_watch_notify_pp: /" 2022-09-15T12:37:48.656 INFO:tasks.workunit.client.0.smithi150.stderr:+ echo 'error in api_watch_notify_pp (120879)' 2022-09-15T12:37:48.657 INFO:tasks.workunit.client.0.smithi150.stdout:error in api_watch_notify_pp (120879)
Alarm clock raised after 1200 seconds, the printing log was delay until the process killed.
/a/nmordech-2022-09-15_08:35:17-rados:verify-wip-nm-51282-distro-default-smithi/7033827
Updated by Nitzan Mordechai over 3 years ago
It will only happen with EC pools, the hang will happen when not all osd are up, but still, i'm not sure if we suppose to wait
Updated by Radoslaw Zarzynski over 3 years ago
Note from a scrub: might we worth talking about.
Updated by Nitzan Mordechai over 3 years ago
Some of the OSDs stopped due to valgrind errors. This is duplicate of other bug
Updated by Nitzan Mordechai over 3 years ago
- Status changed from New to Fix Under Review
Updated by Nitzan Mordechai over 3 years ago
- Related to Bug #52136: Valgrind reports memory "Leak_DefinitelyLost" errors. added
Updated by Nitzan Mordechai over 3 years ago
- Related to Bug #57751: LibRadosAio.SimpleWritePP hang and pkill added
Updated by Nitzan Mordechai over 3 years ago
- Related to Bug #53575: Valgrind reports memory "Leak_PossiblyLost" errors concerning lib64 added
Updated by Laura Flores over 3 years ago
/a/yuriw-2022-11-29_22:29:58-rados-wip-yuri10-testing-2022-11-29-1005-pacific-distro-default-smithi/7097464/
Updated by Laura Flores almost 3 years ago
- Status changed from Fix Under Review to Pending Backport
Updated by Upkeep Bot almost 3 years ago
- Copied to Backport #59627: quincy: rados/test.sh hang and pkilled (LibRadosWatchNotifyEC.WatchNotify) added
Updated by Upkeep Bot almost 3 years ago
- Copied to Backport #59628: pacific: rados/test.sh hang and pkilled (LibRadosWatchNotifyEC.WatchNotify) added
Updated by Nitzan Mordechai almost 3 years ago
- Backport changed from pacific,quincy to pacific
Updated by Konstantin Shalygin about 2 years ago
- Status changed from Pending Backport to Resolved
- % Done changed from 0 to 100
- Source set to Community (dev)
Updated by Upkeep Bot 8 months ago
- Merge Commit set to a55987be8044c84718beac942f4f1582012f2964
- Fixed In set to v18.0.0-1508-ga55987be804
- Released In set to v18.2.0~799
- Upkeep Timestamp set to 2025-07-13T00:40:25+00:00
Actions