osd/rados/rgw/cephfs: Modernize cls interface with compile time safety by aainscow · Pull Request #66258 · ceph/ceph

aainscow · 2025-11-14T13:56:05Z

This is a preparation PR for EC direct reads, but is also provides protection for balanced reads.

This PR does the following:

Adds a new C++ interface, with compile time safety, which prevents any code from attempting to call an exec which may do a write without marking the OP as write.
Deletes any C++ interface which can be used unsafely.
Enforces that IOCtx::exec() and IOCtx::aio_exec() are read only.
Refactors all clients within Ceph to use the new interface.
Deletes unused clients (cls_acl, cls_crypto and key_value_store) + associated tests

Following reviews of earlier versions and discussion at the CDM, we are NOT:

Deprecating the C interfaces and associated bindings to other languages.
Providing any safety in the client for the C / other languages interfaces

A later PR will add policing to the OSD:

Umbrella clients sending an illegal exec (i.e. one where a read contains a write) will be failed by the OSD.
Pre-Umbrella clients sending an illegal exec will cause a health warning to be generated, encouraging the user to update their client.

Tracker: https://tracker.ceph.com/issues/73986

Contribution Guidelines

To sign and title your commits, please refer to Submitting Patches to Ceph.
If you are submitting a fix for a stable branch (e.g. "quincy"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.
When filling out the below checklist, you may click boxes directly in the GitHub web UI. When entering or editing the entire PR message in the GitHub web UI editor, you may also select a checklist item by adding an x between the brackets: [x]. Spaces and capitalization matter when checking off items this way.

Checklist

Tracker (select at least one)
- References tracker ticket
- Very recent bug; references commit where it was introduced
- New feature (ticket optional)
- Doc update (no ticket needed)
- Code cleanup (no ticket needed)
Component impact
- Affects Dashboard, opened tracker ticket
- Affects Orchestrator, opened tracker ticket
- No impact that needs to be tracked
Documentation (select at least one)
- Updates relevant documentation
- No doc update is appropriate
Tests (select at least one)
- Includes unit test(s)
- Includes integration test(s)
- Includes bug reproducer
- No tests

Show available Jenkins commands

jenkins test classic perf Jenkins Job | Jenkins Job Definition
jenkins test crimson perf Jenkins Job | Jenkins Job Definition
jenkins test signed Jenkins Job | Jenkins Job Definition
jenkins test make check Jenkins Job | Jenkins Job Definition
jenkins test make check arm64 Jenkins Job | Jenkins Job Definition
jenkins test submodules Jenkins Job | Jenkins Job Definition
jenkins test dashboard Jenkins Job | Jenkins Job Definition
jenkins test dashboard cephadm Jenkins Job | Jenkins Job Definition
jenkins test api Jenkins Job | Jenkins Job Definition
jenkins test docs ReadTheDocs | Github Workflow Definition
jenkins test ceph-volume all Jenkins Jobs | Jenkins Jobs Definition
jenkins test windows Jenkins Job | Jenkins Job Definition
jenkins test rook e2e Jenkins Job | Jenkins Job Definition

You must only issue one Jenkins command per-comment. Jenkins does not understand
comments with more than one command.

bill-scales

How are you going to identify the set of class methods that need to be annotated as read only to get the performance benefit of direct reads?

Testing different types of I/O and looking for ops that don't get treated as direct reads is one method, but unless you have very good test coverage you might miss something, especially if an exec call is only added to ops when a specific feature (e.g. rbd mirroring) is in use.

I suspect it might be easier to just modify as many of the cls client code functions (and implementations in neorados) to add the read only flag rather than try and find cases via testing. Maybe that should be a 2nd PR?

src/test/librados/misc_cxx.cc

aainscow · 2025-11-17T11:30:38Z

How are you going to identify the set of class methods that need to be annotated as read only to get the performance benefit of direct reads?

Testing different types of I/O and looking for ops that don't get treated as direct reads is one method, but unless you have very good test coverage you might miss something, especially if an exec call is only added to ops when a specific feature (e.g. rbd mirroring) is in use.

I suspect it might be easier to just modify as many of the cls client code functions (and implementations in neorados) to add the read only flag rather than try and find cases via testing. Maybe that should be a 2nd PR?

There is a balance between these two approaches. The first is conservative and relies on testing identifying the necessary commands. Since we only need balanced reads to work for "most" reads for any one user, I was not worrying about edge cases. I was going with the test-and-see approach as a result.

The other method involves code inspection, then updating the cls clients in both src/cls and src/neorados/cls. An API exists for a client to manually specify these class calls, so there might be holes there too. Since this is a manual process and there are significant numbers of read only calls, I was worried about making mistakes. Since human error is a possibility, regression testing of every cls method is a requirement - this seemed like significant amounts of work and I am not sure we have the time and resources for this.

Another approach could be to look for any class method which has a decent unit test and only update the exec call for code paths which are tested. Clearly there is still room for human error in identifying which tests exist, but I think this might be a good half-way approach.

aainscow · 2025-11-17T11:32:13Z

jenkins test api

aainscow · 2025-11-17T13:39:33Z

jenkins test api

cbodley · 2025-11-17T16:29:41Z

is there some way the rados client library can base this direct-read decision solely on the use of ObjectWriteOperation vs ObjectReadOperation, without requiring applications to mark their calls to exec?

that wouldn't help with IoCtx::exec(), but https://tracker.ceph.com/issues/65889 led to two commits af17631 and 4612195 that replaced some IoCtx::exec() with calls to ObjectWriteOperation::exec(). if this implies IoCtx::exec() should be used only for reads, then that could be flagged as read-only and enforced on the osd with EIO

aainscow · 2025-11-17T17:04:15Z

is there some way the rados client library can base this direct-read decision solely on the use of ObjectWriteOperation vs ObjectReadOperation, without requiring applications to mark their calls to exec?

that wouldn't help with IoCtx::exec(), but https://tracker.ceph.com/issues/65889 led to two commits af17631 and 4612195 that replaced some IoCtx::exec() with calls to ObjectWriteOperation::exec(). if this implies IoCtx::exec() should be used only for reads, then that could be flagged as read-only and enforced on the osd with EIO

There is currently no policing in librados, or the OSD that the client correctly used ObjectReadOperation vs ObjectWriteOperation. I did consider simply overriding the ObjectOperation::exec() functions in ObjectReadOperation and assuming they are all read only. However, if any client has mistakenly used an ObjectReadOperation::exec() incorrectly, then this change would prevent that client from working. I considered this to be too high risk.

I am open to arguments to the contrary on this, as overrides would be a really neat way of implementing this.

Unfortunately, this would not work for NEORADOS, where I can't see a way of being explicit about read vs write.

cbodley · 2025-11-17T17:49:32Z

thanks @aainscow,

I did consider simply overriding the ObjectOperation::exec() functions in ObjectReadOperation and assuming they are all read only.

i was hoping we could handle this in the ObjectReadOperation overloads for librados::IoCtx::operate()/aio_operate() (and neorados::RADOS::execute() for ReadOp) without needing to treat exec() as a special case

However, if any client has mistakenly used an ObjectReadOperation::exec() incorrectly, then this change would prevent that client from working. I considered this to be too high risk.

i think it would be nice to catch any misuse*. ceph components may not have enough test coverage to rely on osd enforcement to catch everything, but a code audit probably could. worth discussing, at least?

edit: *especially now that #56180 relies on this for local/balanced read affinity

aainscow · 2025-11-18T16:06:20Z

@cbodley In response to your last comment, I would be interested in your take on my new templates, in the second commit.

There is quite a bit of boiler plate there in order to keep the legacy API/ABI in place, but I think this is a nice approach to policing that the client correctly marks class call/methods as read only or not.

When using the new interface:

It is a compile-time error to use a read-only class call with a write method.
The same mechanism defines the client and OSD definition of read/write/promote.
The cls/method strings are defined in one place only. (note I have only converted cls_version so far).

I think this should make balanced reads much safer, as well as providing a safe interface for my split-reads work.

aainscow · 2025-11-18T16:06:56Z

@adamemerson - I have included you on the review, as you were last to modify the exec code in neorados.

aainscow · 2025-11-18T16:08:06Z

@SrinivasaBharath I have removed the needs_qa. If you have not started a run, no need to include this PR. If you have started a run, do not abort it. The old code should not cause any regressions.

cbodley · 2025-11-18T16:28:58Z

@aainscow i like the idea of compile-time checks, but i wonder how much of this is already covered by the "cls client" interfaces like cls_version_client.h:

void cls_version_set(librados::ObjectWriteOperation& op, obj_version& ver);
void cls_version_inc(librados::ObjectWriteOperation& op);
void cls_version_inc(librados::ObjectWriteOperation& op, obj_version& ver, VersionCond cond);
void cls_version_read(librados::ObjectReadOperation& op, obj_version *objv);
void cls_version_check(librados::ObjectOperation& op, obj_version& ver, VersionCond cond);

assuming ceph code is calling these functions instead of making raw calls to exec(), i'm not sure we're gaining much from the new templates - for example, this already fails to compile:

librados::ObjectReadOperation op;
cls_version_inc(op);

the fact that cls_version_check() takes the base librados::ObjectOperation& does make it difficult to flag its exec as read-only, but does ec-direct-read really need special handling for exec? once we submit the op to Objecter, it'll be flagged as CEPH_OSD_FLAG_READ and/or CEPH_OSD_FLAG_WRITE - can't the direct-read decision rely on that alone?

aainscow · 2025-11-18T16:36:18Z

@cbodley Clients which call cls_* code are protected, so this gives safety, assuming the developer is well behaved and sticks to these functions.

The cls_* code itself, however, is unprotected. With balanced reads and EC split ops, data-integrity is at risk if a class call is marked as read only when it is not, so I wanted to add paranoid levels of compile-time protection.

If we are going to refactor all read-only execs to use the new read-only class call, then I think this code provides protection for the refactoring that we will require.

IMO, the new interface is much cleaner (in C++ at least). Having class and method strings hanging around in multiple places is "code smell".

cbodley · 2025-11-18T17:42:24Z

i agree that the exec interface isn't great, but as long as we're committed to librados api/abi compatibility there will always be an unsafe way to use it. so i think the osd's enforcement is critical to making this safe. you're currently relying on a new CEPH_OSD_CLS_FLAG_READ_ONLY flag for that, so need a reliable way for the client to mark read-only execs

but does the osd not already have some logic to reject client writes that were misdirected at a non-primary osd? if not, maybe that would be a more general solution to this class of problems?

src/osd/osd_op_util.cc

aainscow · 2025-11-18T17:51:52Z

That is a good point, so perhaps "data integrity" is going too far. The OSD will never action a write for a read-only op and only a read-only op would ever be executed. Any call would still, however, be broken (the op will be failed back to the client) and as such, a more robust interface is still important.

So I think we have three ways to go:

Stick with commit one only + maybe hunt down some more use-cases we need to split ops in RGW.
Allow through all calls which are marked as balanced reads and trust the client. I would need to find a new way of policing this in the OSD, but maybe thats OK.
The refactor I have proposed (or something similar).

I am going to dwell on this... Option (2) makes me uneasy....

src/include/rados/librados.hpp

aainscow · 2026-02-04T11:57:42Z

@adamemerson I have addressed your reviews with explicit commits. Please verify:

These latest commits are compile-only changes and do not need a new teuthology qa run.
You are happy with the changes - I wondered if you would prefer that I carry the string_view deeper into the interfaces

adamemerson

Looks good to me! Drop the string_view change. Everything else is fine and should be good compile-only.

src/include/neorados/RADOS.hpp

aainscow · 2026-02-05T20:34:21Z

jenkins test make check

aainscow · 2026-02-05T20:34:59Z

Latest push was to fill in a missing signed-off statement

aainscow · 2026-02-05T20:37:15Z

jenkins test make check

adamemerson · 2026-02-11T17:16:36Z

jenkins test make check

aainscow · 2026-02-18T15:30:22Z

jenkins test make check

src/include/rados/objclass.h

debian/libradospp-dev.install

src/include/rados/cls_flags.h

src/include/rados/cls_traits.hpp

src/cls/version/cls_version_types.h

aainscow · 2026-03-06T14:40:31Z

Recent changes mean I need to request a new review

ronen-fr · 2026-03-12T12:14:47Z

@aainscow (and @SrinivasaBharath ) - the only new failure in the test run is a CLS compilation
issue, and as such - may well be related to the PR.
See skanta-2026-03-08_04:40:13-rados-wip-bharath7-testing-2026-03-07-1317-distro-default-trial/94050/

aainscow · 2026-03-16T09:35:08Z

@batrick I have rebased this to squash all the correcionss following their review. Only new addition is addressing the compile failure in the test run and new release notes as requested.

Status Quo: Librados currently uses string-based identifiers for RADOS class method calls (exec) in both ObjectReadOperation and ObjectWriteOperation. The API does not distinguish between methods that modify object state and those that are read-only. Problem: This lack of semantic enforcement has become a critical safety gap with the introduction of EC Direct Reads. Because the EC direct read path is optimized for performance, it may process the same read-only operation multiple times (e.g., across different shards or during retries). Previously, while technically a violation of API semantics, a "read-that-writes" would often function correctly because operations were processed in-order and without retries on the read path. However, in the context of EC direct reads, executing a non-idempotent write through the read path poses a silent and critical risk to data integrity. Solution: Introduce a template-based trait system to encode method semantics into the API. By tagging methods with traits (MethodRead or MethodWrite), the compiler now prevents state-modifying methods from being added to ObjectReadOperation. This ensures that only side-effect-free methods reach the EC direct read path, eliminating the risk of accidental double-writes at the source. Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

Migrate existing internal RADOS classes and the associated test infrastructure to utilize the new template-based exec() interface. While the previous commit introduced the mechanism, this change applies it across the codebase to eliminate legacy string-based calls in internal logic. Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

The osd code was not present, the radosacl tool did nothing useful and the scratchtoolpp was testing something that doesn't exist. key_value_store is optionally compiled into the key store, but seems to serve no useful purpose. Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

There much mocking of exec going on in these tests. An extensive search and replace was required to find them all. Note that the failure looks like memory corruption when manipulating the ops array - but the actually issue is the mocking not intercepting the exec_impl call (and similar) correctly. Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

Update the Rados Gateway driver to utilize the new trait-based class method interface. This replaces legacy string-based exec() calls with the typesafe template-based alternatives. Key Changes: Updated reset_stats in buckets.cc to use cls::user::method::reset_user_stats2. Updated RGWRadosBILogTrimCR in rgw_cr_rados.cc to use cls::rgw::method::bi_log_trim. This change ensures that RGW operations are correctly categorized as reads or writes at compile-time, aligning with the broader effort to improve RADOS class method safety and OSD routing efficiency. Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

aainscow · 2026-03-19T08:02:48Z

jenkins test api

aainscow · 2026-03-19T16:35:28Z

jenkins test api

aainscow · 2026-03-20T11:49:13Z

Re-run of failed tests worked:
https://pulpito-ng.ceph.com/runs/cjames-2026-03-20_11:30:30-rados-read_only_execs-distro-default-trial/jobs/111854

Re-scheduling for QA

aainscow requested review from a team as code owners November 14, 2025 13:56

github-actions bot added core tests labels Nov 14, 2025

bill-scales reviewed Nov 17, 2025

View reviewed changes

src/test/librados/misc_cxx.cc Outdated Show resolved Hide resolved

src/test/librados/misc_cxx.cc Show resolved Hide resolved

aainscow added the needs-qa label Nov 17, 2025

SrinivasaBharath added the wip-bharath10-testing label Nov 18, 2025

aainscow removed the needs-qa label Nov 18, 2025

aainscow requested review from adamemerson and cbodley November 18, 2025 15:59

cbodley reviewed Nov 18, 2025

View reviewed changes

src/osd/osd_op_util.cc Outdated Show resolved Hide resolved

cbodley reviewed Nov 18, 2025

View reviewed changes

src/include/rados/librados.hpp Outdated Show resolved Hide resolved

src/include/rados/librados.hpp Outdated Show resolved Hide resolved

aainscow requested a review from idryomov November 19, 2025 14:22

adamemerson self-assigned this Nov 21, 2025

aainscow changed the title ~~osd: Introduce read only class calls and associated APIs~~ osd/rados/rgw/cephfs: Modernise cls interface with compile time safety Nov 24, 2025

aainscow force-pushed the read_only_execs branch from 4fdf9ea to 98f02ad Compare November 24, 2025 21:29

aainscow requested a review from a team as a code owner November 24, 2025 21:29

adamemerson approved these changes Feb 4, 2026

View reviewed changes

src/include/neorados/RADOS.hpp Outdated Show resolved Hide resolved

aainscow force-pushed the read_only_execs branch 2 times, most recently from fdf4293 to 7481e15 Compare February 5, 2026 19:33

bill-scales approved these changes Feb 9, 2026

View reviewed changes

aainscow force-pushed the read_only_execs branch from 7481e15 to dca0c73 Compare February 11, 2026 22:40

rzarzynski reviewed Mar 3, 2026

View reviewed changes

src/include/rados/objclass.h Show resolved Hide resolved

debian/libradospp-dev.install Outdated Show resolved Hide resolved

src/include/rados/cls_flags.h Outdated Show resolved Hide resolved

src/include/rados/cls_traits.hpp Show resolved Hide resolved

rzarzynski approved these changes Mar 4, 2026

View reviewed changes

src/cls/version/cls_version_types.h Outdated Show resolved Hide resolved

aainscow removed the wip-bharath15-testing label Mar 6, 2026

SrinivasaBharath added the wip-bharath7-testing label Mar 7, 2026

aainscow force-pushed the read_only_execs branch from 54fff2f to c7e95c8 Compare March 16, 2026 09:33

github-actions bot added the documentation label Mar 16, 2026

aainscow added 6 commits March 18, 2026 13:28

*: Update PendingReleaseNotes for librados.hpp CLS breaking API change

3faf8da

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>

aainscow force-pushed the read_only_execs branch from c7e95c8 to 3faf8da Compare March 18, 2026 13:29

Conversation

aainscow commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Contribution Guidelines

Checklist

Uh oh!

bill-scales left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

aainscow commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aainscow commented Nov 17, 2025

Uh oh!

aainscow commented Nov 17, 2025

Uh oh!

cbodley commented Nov 17, 2025

Uh oh!

aainscow commented Nov 17, 2025

Uh oh!

cbodley commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aainscow commented Nov 18, 2025

Uh oh!

aainscow commented Nov 18, 2025

Uh oh!

aainscow commented Nov 18, 2025

Uh oh!

cbodley commented Nov 18, 2025

Uh oh!

aainscow commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cbodley commented Nov 18, 2025

Uh oh!

Uh oh!

aainscow commented Nov 18, 2025

Uh oh!

Uh oh!

Uh oh!

aainscow commented Feb 4, 2026

Uh oh!

adamemerson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aainscow commented Feb 5, 2026

Uh oh!

aainscow commented Feb 5, 2026

Uh oh!

aainscow commented Feb 5, 2026

Uh oh!

adamemerson commented Feb 11, 2026

Uh oh!

aainscow commented Feb 18, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aainscow commented Mar 6, 2026

Uh oh!

ronen-fr commented Mar 12, 2026

Uh oh!

aainscow commented Mar 16, 2026

Uh oh!

aainscow commented Mar 19, 2026

Uh oh!

aainscow commented Mar 19, 2026

Uh oh!

aainscow commented Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

aainscow commented Nov 14, 2025 •

edited

Loading

aainscow commented Nov 17, 2025 •

edited

Loading

cbodley commented Nov 17, 2025 •

edited

Loading

aainscow commented Nov 18, 2025 •

edited

Loading