Conversation
|
We just tried to boostrap a new Ceph cluster in our company via cephadm using Ceph Quincy, EL9 and Docker and this broke our cluster every time we tried the process from scratch. ping @travisn |
|
This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days. |
|
I believe this PR is still relevant, but I'm not sure what I can do to move it along. |
|
@rzarzynski @ljflores Looks like this is waiting for a review from Core. |
|
This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days. |
|
Is there anything blocking this getting merged? |
|
@rzarzynski it's in a batch! |
|
@edef1c this a suspect to have broken the build: 19.0.0-466-g15215d01/src/common/SubProcess.cc @ljflores fyi |
Hmm, okay. Seems like focal's linux-libc-dev package doesn't include close_range.h. We don't actually need the header (it just supplies the |
|
I've dropped the header include. I think things should be fine now unless |
Fix rook/rook#10110, which occurs when _SC_OPEN_MAX/RLIMIT_NOFILE is set to very large values (2^30), leaving fork_function pegging a core busylooping. The glibc wrappers closefrom(3)/close_range(3) are not available before glibc 2.34, so we invoke the syscall directly. When glibc 2.34 is old enough to be a reasonable hard minimum dependency, we should switch to using closefrom. If we're not running on (recent enough) Linux, we fall back to the existing approach. Fixes: https://tracker.ceph.com/issues/59125 Signed-off-by: edef <edef@edef.eu>
|
I ended up taking a closer look, and found that neither @ljflores @yuriw Could we get another run? It should be fully baked wrt backward compat at this point, as far as I can tell. |
|
The failed API tests seem to go to a Jenkins 404 page, so I'm not sure what's up there. |
|
jenkins test api |
|
It seems like this is done and approved. Is this waiting for some milestone to merge? |
|
We need rerun the QA per #50622 (comment). |
|
See the same on Reef v18.2.4, definitely required backport to reef |
Hrm, this hasn't been backported to squid either. @prazumovsky Can you create a squid backport as well? |
It would go under https://tracker.ceph.com/issues/69778. |
Fix rook/rook#10110, which occurs when _SC_OPEN_MAX/RLIMIT_NOFILE is set to very large values (2^30), leaving fork_function pegging a core busylooping.
The glibc wrappers closefrom(3)/close_range(3) are not available before glibc 2.34, so we invoke the syscall directly. When glibc 2.34 is old enough to be a reasonable hard minimum dependency, we should switch to using closefrom.
If we're not running on (recent enough) Linux, we fall back to the existing approach.
Checklist
Show available Jenkins commands
jenkins retest this pleasejenkins test classic perfjenkins test crimson perfjenkins test signedjenkins test make checkjenkins test make check arm64jenkins test submodulesjenkins test dashboardjenkins test dashboard cephadmjenkins test apijenkins test docsjenkins render docsjenkins test ceph-volume alljenkins test ceph-volume toxjenkins test windows