Optimized Erasure Coding - Fixpack 2 by aainscow · Pull Request #64501 · ceph/ceph

aainscow · 2025-07-15T07:54:17Z

This is a fixpack for general bugs which do not (knowingly) affect performance.

A minor tracker fixed by this PR:
https://tracker.ceph.com/issues/72077

Contribution Guidelines

To sign and title your commits, please refer to Submitting Patches to Ceph.
If you are submitting a fix for a stable branch (e.g. "quincy"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.
When filling out the below checklist, you may click boxes directly in the GitHub web UI. When entering or editing the entire PR message in the GitHub web UI editor, you may also select a checklist item by adding an x between the brackets: [x]. Spaces and capitalization matter when checking off items this way.

Checklist

Tracker (select at least one)
- References tracker ticket
- Very recent bug; references commit where it was introduced
- New feature (ticket optional)
- Doc update (no ticket needed)
- Code cleanup (no ticket needed)
Component impact
- Affects Dashboard, opened tracker ticket
- Affects Orchestrator, opened tracker ticket
- No impact that needs to be tracked
Documentation (select at least one)
- Updates relevant documentation
- No doc update is appropriate
Tests (select at least one)
- Includes unit test(s)
- Includes integration test(s)
- Includes bug reproducer
- No tests

Show available Jenkins commands

jenkins test classic perf Jenkins Job | Jenkins Job Definition
jenkins test crimson perf Jenkins Job | Jenkins Job Definition
jenkins test signed Jenkins Job | Jenkins Job Definition
jenkins test make check Jenkins Job | Jenkins Job Definition
jenkins test make check arm64 Jenkins Job | Jenkins Job Definition
jenkins test submodules Jenkins Job | Jenkins Job Definition
jenkins test dashboard Jenkins Job | Jenkins Job Definition
jenkins test dashboard cephadm Jenkins Job | Jenkins Job Definition
jenkins test api Jenkins Job | Jenkins Job Definition
jenkins test docs ReadTheDocs | Github Workflow Definition
jenkins test ceph-volume all Jenkins Jobs | Jenkins Jobs Definition
jenkins test windows Jenkins Job | Jenkins Job Definition
jenkins test rook e2e Jenkins Job | Jenkins Job Definition

github-actions · 2025-07-16T15:26:40Z

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

1. Refactor the code that applies pwlc to update info and log so that there is one function rather than multiple copies of the code. 2. pwlc information is used to track shards that have not been updated by partial writes. It is used to advance last_complete (and last_update and the log head) to account for log entries that the shard missed. It was only being applied if last_complete matched the range of partial writes recorded in pwlc. When a shard has missing objects last_complete is deliberately held before the oldest need, this stops pwlc being applied. This is wrong - pwlc can still try and update last update and the log head even if it cannot update last_complete. 3. When the primary receives info (and pwlc) information from OSD x(y) it uses the pwlc information to update x(y)'s info. During backfill there may be other shards z(y) which should also be updated using the pwlc information. Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>

github-actions · 2025-07-25T06:41:56Z

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

When recovering attributes, we read them from the first potential primary, then if that read failures, attempt to read from another potential primary. The problem is that the code which calculates which shards to read for a recovery only takes into account *data* and not where the attributes are. As such, if the second read only required a non-primary, then the attribute read fails and the OSD panics. The fix is to detect this scenario and perform an empty read to that shard, which the attribute-read code can use for attribute reads. Code was incorrectly interpreting a failed attribute read on recovery as meaning a "fast_read". Also, no attribute recovery would occur in this case. Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

When the primary shard of an optimized EC pool does not have a copy of the log it may need to repeat the GetLog peering step twice, the first time to get a full copy of a log from a shard that sees all log entries and then a second time to get the "best" log from a nonprimary shard which may have a partial log due to partial writes. A side effect of repeating GetLog is that the missing list is collected for both the "best" shard and the shard that provides a full copy of the log. This later missing list confuses later steps in the peering process and may cause this shard to complete writes and end up diverging from the primary. Discarding this missing list causes Peering to behave the same as if the GetLog step did not need to be repeated. Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>

Fix 1: These are multiple fixes that affected the same code. To simplify review and understanding of the changes, they have been merged into a single commit. What happened in defect is (k,m = 6,4) 1. State is: fast_reads = true, shards 0,4,5,6,7,8 are available. Shard 1 is missing this object. 2. Shard 5 only needs zeros, so read is dropped. Other sub read message sent. 3. Object on shard 1 completes recovery (so becomes not-missing) 4. Read completes, complete notices that it only has 5 reads, so calculates what it needs to re-read. 5. Calculates it needs 0,1,4,5,6,7 - and so wants to read shard 1. 6. Code assumes that enough reads should have been performed, so refused to do another reads and instead generates an EIO. The problem here is some "lazy" code in step (4). What is should be doing is working out that it can use the zero buffers and not calling get_remaining_reads(). Instead, what it attempts to do is call get_remaining_reads() and if there is no work to do, then it assumes it has everything already and completes the read with success. This assumption mostly works - but in this combination of fast_reads picking less than k shards to read from AND an object completing recovery in parallel causes issue. The solution is to wait for all reads to complete and then assume that any remaining zero buffers count as completed reads. This should then cause the plugin to declare "success" Fix 2: There are decodes in new EC which can occur when less than k shards have been read. These reads in the last stripe, where for decoding purposes, the data past the end of the shard can be considered zeros. EC does not read these, but instead relies on the decode function inventing the zero buffers. This was working correctly when fast reads were turned off, but if such an IO was encountered with fast reads turned on the logic was disabled and the IO returns an EIO. This commit fixes that logic, so that if all reads have complete and send_all_remaining_reads conveys that no new reads were requested, then decode will still be possible. FIX 3: If reading the end of an object with unequally sized objects, we pad out the end of the decode with zeros, to provide the correct data to the plugin. Previously, the code decided not to add the zeros to "needed" shards. This caused a problem where for some parity-only decodes, an incomplete set of zeros was generated, fooling the _decode function into thinking that the entire shard was zeros. In the fix, we need to cope with the case where the only data needed from the shard is the padding itself. The comments around the new code describe the logic behind the change. This makes the encode-specific use case of padding out the to-be-decoded shards unnecessary, as this is performed by the pad_on_shards function below. Also fixing some logic in calculating the need_set being passed to the decode function did not contain the extra shards needed for the decode. This need_set is actually ignored by all the plugins as far as I know, but being wrong does not seem helpful if its needed in the future. Fix 4: Extend reads when recovering parity Here is an example use case which was going wrong: 1. Start with 3+2 EC, shards 0,3,4 are 8k shard 1,2 is 4k 2. Perform a recovery, where we recover 2 and 4. 2 is missing, 4 can be copied from another OSD. 3. Recovery works out that it can do the whole recovery with shards 0,1,3. (note not 4) 4. So the "need" set is 0,1,3, the "want" set is 2,4 and the "have" set is 0,1,3,4,5 5. The logic in get_all_avail_shards then tries to work out the extents it needs - it only. looks at 2, because we "have" 4 6. Result is that we end up reading 4k on 0,1,3, then attempt to recover 8k on shard 4 from this... which clearly does not work. Fix 5: Round up padding to 4k alignment in EC The pad_on_shards was not aligning to 4k. However, the decode/encode functions were. This meant that we assigned a new buffer, then added another after - this should be faster. Fix 6: Do not invent encode buffers before doing decode. In this bug, during recovery, we could potentially be creating unwanted encode buffers and using them to decode data buffers. This fix simply removes the bad code, as there is new code above which is already doing the correct action. Fix 7: Fix miscompare with missing decodes. In this case, two OSDs failed at once. One was replaced and the other was not. This caused us to attempt to encode a missing shard while another shard was missing, which caused a miscompare because the recovery failed to do the decode properly before doing an encode. Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

PeeringState::proc_replica_log needs to apply pwlc before calling PGLog so that any partial writes that have occurred are taken into account when working out where a replica/stray has diverged from the primary. Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>

This was a failed test, where the primary concluded that all objects were present despite one missing object on the non primary shard. The problem was caused because the log entries are sent to the unwritten shards if that shard is missing in order to update the version number in the missing object. However, the log entry should not actually be added to the log. Further testing showed there are other scenarios where log entries are sent to unwritten shards (for example a clone + partial_write in the same transaction), these scenarios do not want to add the log entry either. Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>

The CRCs were being invalidate at the wrong point, so the last CRC was not being invalidated. Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

The clone logic in the truncate was only cloning from the truncate to the end of the pre-truncate object. If the next shard was being truncated to a shorter length (which is common), then this shard has a larger clone. The rollback, however, can only be given a single range, so it was given a range which covers all clones. The problem is that if shard 0 is rolled back, then some empty space from the clone was copied to shard 0. Fix is easy - calculate the full clone length and apply to all shards, so it matches the rollback. Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

rollback_info_trimmed_to changes PGLog::rewind_divergent_log was only causing the log to be marked dirty and checkpointed if there were divergent entries. However after a PG split it is possible that the log can be rewound modifying crt and/or rollback_info_trimmed_to without creating divergent entries because the entries being rolled back were all split into the other PG. Failing to checkpoint the log generates a window where if the OSD is reset you can end up with crt (and rollback_info_trimmed_to) > head. One consequence of this is asserts like ceph_assert(rollback_info_trimmed_to == head); firing. Fixes: https://tracker.ceph.com/issues/55141 Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>

…issing Non-primary shards may not be updated because of partial writes. This means that the OI verison for an object on these shards may be stale. An assert in read_log_and_missing was checking that the OI version matched the have version in a missing entry. The missing entry calculates the have version using the prior_version from a log entry, this does not take into account partial writes so can be ahead of the stale OI version. Relax the assert for optimized pools to require have >= oi.version Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>

Optimized EC pools were blocking clean_temps from clearing pg_temp when up == acting but up_primary != acting_primary because optimized pools sometimes use pg_temp to force a change of primary shard. However this can block merges which require the two PGs being merged to have the same primary. Relax clean_temps to permit pg_temp to be cleared so long as the new primary is not a non-primary shard. Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>

If multiple object are being read as part of the same recovery (this happens when recovering some snapshots) and a read fails, then some reads from other shards will be necessary. However, some objects may not need to read. In this case it is only important that at least one read message is sent, rather than one read message per object is sent. Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

…ent to an object with the whiteout flag set. Signed-off-by: Jon Bailey <jonathan.bailey1@ibm.com>

1. proc_master_log can roll forward full-writes that have been applied to all shards but not yet completed. Add a new function consider_adjusting_pwlc to roll-forward pwlc. Later partial_write can be called to process the same writes again. This can result in pwlc being rolled backwards. Modify partial_write so it does not undo pwlc. 2. At the end of proc_master_log we want the new authorative view of pwlc to persist - this may be better or worse than the stale view of pwlc held by other shards. consider_rollback_pwlc sometimes updated the epoch in the toversion (second value of the range fromversion-toverison). We now always do this. Updating toversion.epoch causes problems because this version sometimes gets copied to last_update and last_complete - using the wrong epoch here messes everything up in later peering cycles. Instead we now update fromversion.epoch. This requires changes to apply_pwlc and an assert in Stray::react(const MInfoRec&) 3. Calling apply_pwlc at the end of proc_master_log is too early - updating last_update and last_complete here breaks GetMissing. We need to do this later when activating (change to search_missing and activate) 4. proc_master_log is calling partial_write with the wrong previous version - this causes problems after a split when the log is sparsely populated. 5. merging PGs is not setting up pwlc correctly which can cause issues in future peering cycles. The pwlc can simply be reset, we need to update the epoch to make sure this view of pwlc persists vs stale pwlc from other shards. Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>

…p as nop Optimized EC pools store pgtemp with primary shards first, this was not being taken into account by OSDMonitor::preprocess_pgtemp which meant that the change of pgtemp from [None,2,4] to [None,4,2] for a 2+1 pool was being rejected as a nop because the primary first encoded version of [None,2,4] is [None,4,2]. Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

When a shard is backfilling it gets given log entries for partial writes even if they do not apply to the shard. The code was updating the missing list but discarding the log entry. This is wrong because the update can be rolled backwards and the log entry is required to revert the update to the missing list. Keeping the log entry has a small but insignificant performance impact. Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>

There were some bugs in attribute reads during recovery in optimised EC where the attribute read failed. There were two scenarios: 1. It was not necessary to do any further reads to recover the data. This can happen during recovery of many shards. 2. The re-read could be honoured from non-primary shards. There are sometimes multiple copies of the shard whcih can be used, so a failed read on one OSD can be replaced by a read from another. Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

Explanation of bug which is being fixed: Log entry 204'784 is an error - "_rollback_to deleting head on smithi17019940-573 because got ENOENT|whiteout on find_object_context" so this log entry is generated outside of EC by PrimaryLogPG. It should be applied to all shards, however osd 13(2) was a little slow and the update got interrupted by a new epoch so it didn't apply it. All the other shards marked it as applied and completed (there isn't the usual interlock that EC has of making sure all shards apply the update before any complete it). We then processed 4 partial writes applying and completing them (they didn't update osd 13(2)), then we have a new epoch and go through peering. Peering says osd 13(2) didn't see update 204'784 (it didn't) and therefore the error log entry and the 4 partial writes need to be rolled back. The other shards had completed those 4 partial writes so we end up with 4 missing objects on all the shards which become unfound objects. I think the underlying bug means that log entry 204'784 isn't really complete and may "disappear" from the log in a subsequent peering cycle. Trying to forcefully rollback a logged error doesn't generate a missing object or a miscompare, so the consequences of the bug are hidden. It is however tripping up the new EC code where proc_master_log is being much stricter about what a completed write means. Fix: After generating a logged error we could force the next write to EC to update metadata on all shards even if its a partial write. This means this write won't complete unless all shards see the logged error. This will make new EC behave the same as old EC. There is already an interlock with EC (call_write_ordered) which is called just before generating the log error that ensures that any in-flight writes complete before submitting the log error. We could set a boolean flag here (at the point call_write_ordered is called is fine, don't need to wait for the callback) to say the next write has to be to all shards. The flag can be cleared if we generate the transactions for the next write, or we get an on_change notification (peering will then clear up the mess) Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

…_complete Fix bug in apply_pwlc where the primary was advancing last_complete for a shard doing async recorvery so that last_complete became equal to last_update and it then thought that recovery had completed. It is only valid to advance last_complete if it is equal to last_update. Tidy up the logging in this function as consecutive calls to this function often logged that it could advance on the 1st call and then that it could not on the 2nd call. We only want one log message. Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>

Undo a previous attempt at a fix that made add_log_entry skip adding partial writes to the log if the write did not update this shard. The only case where this code path executed is when a partial write was to an object that needs backfilling or async recovery. For async recovery we need to keep the log entry because it is needed to update the missing list. For backfill it doesn't harm to keep the log entry. Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>

Shards performing backfill or async recovery receive log entries (but not transactions) for updates to missing/yet to be backfilled objects. These log entries get applied and completed immediately because there is nothing that can be rolled back. This causes pwlc to advance too early and causes problems if other shards do not complete the update and end up rolling it backwards. This fix sets pwlc to be invalid when such a log entry is applied and completed and it then remains invalid until the next interval when peering runs again. Other shards will continue to update pwlc and any complete subset of shards in a future interval will include at least one shard that has continued to update pwlc Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>

JonBailey1993 · 2025-07-30T15:17:30Z

jenkins test make check

ljflores

@aainscow I found an issue in teuthology that looks like it could be related to this PR. Can you take a look? The same test passed in a rerun, so it seems nondeterministic. But I flagged it since there seemed to be a problem getting the pg acting set for an EC pool, which is an area this PR is changing.

Testing ref: https://tracker.ceph.com/issues/72299

/a/skanta-2025-07-26_06:22:18-rados-wip-bharath9-testing-2025-07-26-0628-distro-default-smithi/8407536

2025-07-26T09:09:34.349 INFO:tasks.thrashosds.thrasher:No pgs; trying again
2025-07-26T09:09:34.351 INFO:tasks.thrashosds.thrasher:Traceback (most recent call last):
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 192, in wrapper
    return func(self)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 1502, in _do_thrash
    self.choose_action()()
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 1047, in test_pool_min_size
    acting_set = self.get_rand_pg_acting_set(pool_id)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 954, in get_rand_pg_acting_set
    while proceed():
  File "/home/teuthworker/src/git.ceph.com_teuthology_bb07fbf8b821d133ffd961d10a72c4813e0d7a11/teuthology/contextutil.py", line 134, in __call__
    raise MaxWhileTries(error_msg)
teuthology.exceptions.MaxWhileTries: 'get_pg_stats' reached maximum tries (3) after waiting for 15 seconds

2025-07-26T09:09:34.351 ERROR:tasks.thrashosds.thrasher:exception:
Traceback (most recent call last):
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 1373, in do_thrash
    self._do_thrash()
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 192, in wrapper
    return func(self)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 1502, in _do_thrash
    self.choose_action()()
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 1047, in test_pool_min_size
    acting_set = self.get_rand_pg_acting_set(pool_id)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 954, in get_rand_pg_acting_set
    while proceed():
  File "/home/teuthworker/src/git.ceph.com_teuthology_bb07fbf8b821d133ffd961d10a72c4813e0d7a11/teuthology/contextutil.py", line 134, in __call__
    raise MaxWhileTries(error_msg)
teuthology.exceptions.MaxWhileTries: 'get_pg_stats' reached maximum tries (3) after waiting for 15 seconds
...
2025-07-26T09:09:41.301 INFO:tasks.ceph.mon.b.smithi019.stderr:2025-07-26T09:09:41.277+0000 7fe57926d640 -1 mon.b@1(peon) e1 *** Got Signal Terminated ***
2025-07-26T09:09:41.491 INFO:tasks.ceph.mgr.x.smithi072.stderr:daemon-helper: command crashed with signal 15
2025-07-26T09:11:41.153 DEBUG:teuthology.orchestra.run:got remote process result: 124
2025-07-26T09:11:41.156 ERROR:teuthology.contextutil:Saw exception from nested tasks
Traceback (most recent call last):
  File "/home/teuthworker/src/git.ceph.com_teuthology_bb07fbf8b821d133ffd961d10a72c4813e0d7a11/teuthology/contextutil.py", line 32, in nested
    yield vars
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph.py", line 2012, in task
    osd_scrub_pgs(ctx, config)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph.py", line 1324, in osd_scrub_pgs
    osd_dump = manager.get_osd_dump_json()
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 2651, in get_osd_dump_json
    out = self.raw_cluster_cmd('osd', 'dump', '--format=json')
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 1702, in raw_cluster_cmd
    return self.run_cluster_cmd(**kwargs).stdout.getvalue()
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 1693, in run_cluster_cmd
    return self.controller.run(**kwargs)
  File "/home/teuthworker/src/git.ceph.com_teuthology_bb07fbf8b821d133ffd961d10a72c4813e0d7a11/teuthology/orchestra/remote.py", line 575, in run
    r = self._runner(client=self.ssh, name=self.shortname, **kwargs)
  File "/home/teuthworker/src/git.ceph.com_teuthology_bb07fbf8b821d133ffd961d10a72c4813e0d7a11/teuthology/orchestra/run.py", line 461, in run
    r.wait()
  File "/home/teuthworker/src/git.ceph.com_teuthology_bb07fbf8b821d133ffd961d10a72c4813e0d7a11/teuthology/orchestra/run.py", line 161, in wait
    self._raise_for_status()
  File "/home/teuthworker/src/git.ceph.com_teuthology_bb07fbf8b821d133ffd961d10a72c4813e0d7a11/teuthology/orchestra/run.py", line 181, in _raise_for_status
    raise CommandFailedError(
teuthology.exceptions.CommandFailedError: Command failed on smithi001 with status 124: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph --cluster ceph osd dump --format=json'
...
raceback (most recent call last):
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph.py", line 1526, in run_daemon
    yield
  File "/home/teuthworker/src/git.ceph.com_teuthology_bb07fbf8b821d133ffd961d10a72c4813e0d7a11/teuthology/contextutil.py", line 32, in nested
    yield vars
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph.py", line 2012, in task
    osd_scrub_pgs(ctx, config)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph.py", line 1324, in osd_scrub_pgs
    osd_dump = manager.get_osd_dump_json()
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 2651, in get_osd_dump_json
    out = self.raw_cluster_cmd('osd', 'dump', '--format=json')
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 1702, in raw_cluster_cmd
    return self.run_cluster_cmd(**kwargs).stdout.getvalue()
  File "/home/teuthworker/src/github.com_ceph_ceph-c_f5a11d7f8fa0e8642a3bd53c2803cba1227712ad/qa/tasks/ceph_manager.py", line 1693, in run_cluster_cmd
    return self.controller.run(**kwargs)
  File "/home/teuthworker/src/git.ceph.com_teuthology_bb07fbf8b821d133ffd961d10a72c4813e0d7a11/teuthology/orchestra/remote.py", line 575, in run
    r = self._runner(client=self.ssh, name=self.shortname, **kwargs)
  File "/home/teuthworker/src/git.ceph.com_teuthology_bb07fbf8b821d133ffd961d10a72c4813e0d7a11/teuthology/orchestra/run.py", line 461, in run
    r.wait()
  File "/home/teuthworker/src/git.ceph.com_teuthology_bb07fbf8b821d133ffd961d10a72c4813e0d7a11/teuthology/orchestra/run.py", line 161, in wait
    self._raise_for_status()
  File "/home/teuthworker/src/git.ceph.com_teuthology_bb07fbf8b821d133ffd961d10a72c4813e0d7a11/teuthology/orchestra/run.py", line 181, in _raise_for_status
    raise CommandFailedError(
teuthology.exceptions.CommandFailedError: Command failed on smithi001 with status 124: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph --cluster ceph osd dump --format=json'

aainscow · 2025-08-04T21:20:31Z

Looks like a test issue to me. A pool is delete is racing the thrasher. See ticket for details. No tests were changed in this PR, so I don't think it can be a regression in this PR.

aainscow · 2025-08-06T07:17:24Z

@ljflores Can we get test approval on this?

src/osd/ECBackend.cc

aainscow · 2025-08-12T15:04:48Z

Assuming RADOS approved - we have run QA and had a single test failure, which we concluded was not a regression.

aainscow · 2025-08-12T16:05:19Z

Test results are here: https://pulpito.ceph.com/?branch=wip-bharath9-testing-2025-08-05-0506

They show as failed, but the only flagged problem was https://tracker.ceph.com/issues/72299 - this was shown not to be an issue.

This test was re-run without issue. (Although I now cannot find a reference to that)

rzarzynski · 2025-09-05T14:30:36Z

@aainscow: is there a tentacle backport for this?

aainscow · 2025-09-05T15:01:26Z

@aainscow: is there a tentacle backport for this?

There is now...
#65411

Not sure how I missed this one, thanks for spotting. I have reviewed all my PRs and all ave backports in progress if they need it.

aainscow requested a review from a team as a code owner July 15, 2025 07:54

github-actions bot added build/ops core mon script tests labels Jul 15, 2025

aainscow force-pushed the ec_fixpack2_pr branch from ef81fda to db45c09 Compare July 16, 2025 12:20

aainscow marked this pull request as draft July 16, 2025 13:34

github-actions bot added the needs-rebase label Jul 16, 2025

aainscow force-pushed the ec_fixpack2_pr branch 2 times, most recently from b2774c8 to 16ea594 Compare July 21, 2025 06:46

aainscow added the needs-qa label Jul 23, 2025

aainscow force-pushed the ec_fixpack2_pr branch from 16ea594 to 60f50c9 Compare July 23, 2025 14:31

github-actions bot removed the needs-rebase label Jul 23, 2025

aainscow force-pushed the ec_fixpack2_pr branch from 60f50c9 to 581b921 Compare July 24, 2025 15:17

github-actions bot added the needs-rebase label Jul 25, 2025

aainscow and others added 11 commits July 25, 2025 08:42

osd: code clean up and debug in optimised EC

1352868

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

osd: Fix incorrect invalidate_crc during slice iterate

564b53c

The CRCs were being invalidate at the wrong point, so the last CRC was not being invalidated. Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

aainscow and others added 12 commits July 25, 2025 08:43

osd: Fix issue where not all shards are receiving setattr when it's s…

89fef78

…ent to an object with the whiteout flag set. Signed-off-by: Jon Bailey <jonathan.bailey1@ibm.com>

osd: Remove some extraneous references to hinfo.

1c24765

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

osd: Use std::cmp_greater to avoid signedness warnings.

846879e

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

aainscow force-pushed the ec_fixpack2_pr branch from 581b921 to 534fc76 Compare July 25, 2025 07:44

github-actions bot removed the needs-rebase label Jul 25, 2025

SrinivasaBharath added the wip-bharath9-testing label Jul 26, 2025

rzarzynski self-requested a review July 29, 2025 22:52

JonBailey1993 marked this pull request as ready for review July 30, 2025 15:16

ljflores reviewed Aug 1, 2025

View reviewed changes

SrinivasaBharath removed the wip-bharath9-testing label Aug 4, 2025

aainscow removed the needs-qa label Aug 11, 2025

rzarzynski approved these changes Aug 11, 2025

View reviewed changes

src/osd/ECBackend.cc Show resolved Hide resolved

aainscow merged commit bc26409 into ceph:main Aug 12, 2025
16 checks passed

aainscow mentioned this pull request Sep 5, 2025

tentacle: Optimized Erasure Coding - Fixpack 2 #65411

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimized Erasure Coding - Fixpack 2#64501

Optimized Erasure Coding - Fixpack 2#64501
aainscow merged 29 commits intoceph:mainfrom
aainscow:ec_fixpack2_pr

aainscow commented Jul 15, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 16, 2025

Uh oh!

github-actions bot commented Jul 25, 2025

Uh oh!

JonBailey1993 commented Jul 30, 2025

Uh oh!

ljflores left a comment •

edited

Loading

Uh oh!

aainscow commented Aug 4, 2025

Uh oh!

aainscow commented Aug 6, 2025

Uh oh!

Uh oh!

aainscow commented Aug 12, 2025

Uh oh!

Uh oh!

aainscow commented Aug 12, 2025

Uh oh!

rzarzynski commented Sep 5, 2025

Uh oh!

aainscow commented Sep 5, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

aainscow commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Contribution Guidelines

Checklist

Uh oh!

github-actions bot commented Jul 16, 2025

Uh oh!

github-actions bot commented Jul 25, 2025

Uh oh!

JonBailey1993 commented Jul 30, 2025

Uh oh!

ljflores left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aainscow commented Aug 4, 2025

Uh oh!

aainscow commented Aug 6, 2025

Uh oh!

Uh oh!

aainscow commented Aug 12, 2025

Uh oh!

Uh oh!

aainscow commented Aug 12, 2025

Uh oh!

rzarzynski commented Sep 5, 2025

Uh oh!

aainscow commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

aainscow commented Jul 15, 2025 •

edited

Loading

ljflores left a comment •

edited

Loading

aainscow commented Sep 5, 2025 •

edited

Loading