Skip to content

[core] (cgroups 4/n) adding constraint bounds checking to the CgroupManager.#56246

Merged
edoakes merged 78 commits intomasterfrom
irabbani/cgroups-4
Sep 6, 2025
Merged

[core] (cgroups 4/n) adding constraint bounds checking to the CgroupManager.#56246
edoakes merged 78 commits intomasterfrom
irabbani/cgroups-4

Conversation

@israbbani
Copy link
Copy Markdown
Contributor

This PR continues to implement the CgroupManager. CgroupManager will be used by the Raylet to manage the cgroup hierarchy. The implementation will be completed in subsequent PRs.

This PR stacks on #56186.

For more details about the resource isolation project see #54703.

In this PR:

  • CgroupManager now bound checks constraints (e.g. cpu.weight is within [1,10000].
  • CgroupDriver no longer bound checks constraints.

irabbani and others added 30 commits July 24, 2025 20:39
to perform cgroup operations.

Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
instead of clone for older kernel headers < 5.7 (which is what we have
in CI)

Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: Ibrahim Rabbani <israbbani@gmail.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: Ibrahim Rabbani <israbbani@gmail.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: Ibrahim Rabbani <israbbani@gmail.com>
bug
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
fix CI.

Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
up
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
@israbbani israbbani changed the title [core] (cgroups 4/n) adding constraint bounds checking to the CgroupManager. [core] (cgroups 4/n) adding constraint bounds checking to the CgroupManager. Sep 4, 2025
Ibrahim Rabbani added 3 commits September 5, 2025 15:02
Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
Signed-off-by: irabbani <irabbani@anyscale.com>
@edoakes edoakes enabled auto-merge (squash) September 5, 2025 22:58
@edoakes edoakes merged commit 704c579 into master Sep 6, 2025
6 checks passed
@edoakes edoakes deleted the irabbani/cgroups-4 branch September 6, 2025 00:05
sampan-s-nayak pushed a commit to sampan-s-nayak/ray that referenced this pull request Sep 8, 2025
…anager. (ray-project#56246)

This PR continues to implement the CgroupManager. CgroupManager will be
used by the Raylet to manage the cgroup hierarchy. The implementation
will be completed in subsequent PRs.

This PR stacks on ray-project#56186.

For more details about the resource isolation project see
ray-project#54703.

In this PR:
* CgroupManager now bound checks constraints (e.g. cpu.weight is within
[1,10000].
* CgroupDriver no longer bound checks constraints.

---------

Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: sampan <sampan@anyscale.com>
edoakes added a commit that referenced this pull request Sep 8, 2025
… and SysFsCgroupDriver (#56255)

This PR stacks on #56246.

For more details about the resource isolation project see
#54703.

This PR adds the DeleteCgroup method (along with unit and integration
tests) to the CgroupDriverInterface and the SysFsCgroupDriver to allow
CgroupManager to clean up the cgroup hierarchy is graceful shutdown.

---------

Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
jugalshah291 pushed a commit to jugalshah291/ray_fork that referenced this pull request Sep 11, 2025
…anager. (ray-project#56246)

This PR continues to implement the CgroupManager. CgroupManager will be
used by the Raylet to manage the cgroup hierarchy. The implementation
will be completed in subsequent PRs.

This PR stacks on ray-project#56186.

For more details about the resource isolation project see
ray-project#54703.

In this PR:
* CgroupManager now bound checks constraints (e.g. cpu.weight is within
[1,10000].
* CgroupDriver no longer bound checks constraints.

---------

Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: jugalshah291 <shah.jugal291@gmail.com>
jugalshah291 pushed a commit to jugalshah291/ray_fork that referenced this pull request Sep 11, 2025
… and SysFsCgroupDriver (ray-project#56255)

This PR stacks on ray-project#56246.

For more details about the resource isolation project see
ray-project#54703.

This PR adds the DeleteCgroup method (along with unit and integration
tests) to the CgroupDriverInterface and the SysFsCgroupDriver to allow
CgroupManager to clean up the cgroup hierarchy is graceful shutdown.

---------

Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: jugalshah291 <shah.jugal291@gmail.com>
wyhong3103 pushed a commit to wyhong3103/ray that referenced this pull request Sep 12, 2025
…anager. (ray-project#56246)

This PR continues to implement the CgroupManager. CgroupManager will be
used by the Raylet to manage the cgroup hierarchy. The implementation
will be completed in subsequent PRs.

This PR stacks on ray-project#56186.

For more details about the resource isolation project see
ray-project#54703.

In this PR:
* CgroupManager now bound checks constraints (e.g. cpu.weight is within
[1,10000].
* CgroupDriver no longer bound checks constraints.

---------

Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: yenhong.wong <yenhong.wong@grabtaxi.com>
ZacAttack pushed a commit to ZacAttack/ray that referenced this pull request Sep 24, 2025
…anager. (ray-project#56246)

This PR continues to implement the CgroupManager. CgroupManager will be
used by the Raylet to manage the cgroup hierarchy. The implementation
will be completed in subsequent PRs.

This PR stacks on ray-project#56186.

For more details about the resource isolation project see
ray-project#54703.

In this PR:
* CgroupManager now bound checks constraints (e.g. cpu.weight is within
[1,10000].
* CgroupDriver no longer bound checks constraints.

---------

Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: zac <zac@anyscale.com>
ZacAttack pushed a commit to ZacAttack/ray that referenced this pull request Sep 24, 2025
… and SysFsCgroupDriver (ray-project#56255)

This PR stacks on ray-project#56246.

For more details about the resource isolation project see
ray-project#54703.

This PR adds the DeleteCgroup method (along with unit and integration
tests) to the CgroupDriverInterface and the SysFsCgroupDriver to allow
CgroupManager to clean up the cgroup hierarchy is graceful shutdown.

---------

Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: zac <zac@anyscale.com>
dstrodtman pushed a commit that referenced this pull request Oct 6, 2025
…anager. (#56246)

This PR continues to implement the CgroupManager. CgroupManager will be
used by the Raylet to manage the cgroup hierarchy. The implementation
will be completed in subsequent PRs.

This PR stacks on #56186.

For more details about the resource isolation project see
#54703.

In this PR:
* CgroupManager now bound checks constraints (e.g. cpu.weight is within
[1,10000].
* CgroupDriver no longer bound checks constraints.

---------

Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: Douglas Strodtman <douglas@anyscale.com>
dstrodtman pushed a commit to dstrodtman/ray that referenced this pull request Oct 6, 2025
… and SysFsCgroupDriver (ray-project#56255)

This PR stacks on ray-project#56246.

For more details about the resource isolation project see
ray-project#54703.

This PR adds the DeleteCgroup method (along with unit and integration
tests) to the CgroupDriverInterface and the SysFsCgroupDriver to allow
CgroupManager to clean up the cgroup hierarchy is graceful shutdown.

---------

Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: Douglas Strodtman <douglas@anyscale.com>
justinyeh1995 pushed a commit to justinyeh1995/ray that referenced this pull request Oct 20, 2025
… and SysFsCgroupDriver (ray-project#56255)

This PR stacks on ray-project#56246.

For more details about the resource isolation project see
ray-project#54703.

This PR adds the DeleteCgroup method (along with unit and integration
tests) to the CgroupDriverInterface and the SysFsCgroupDriver to allow
CgroupManager to clean up the cgroup hierarchy is graceful shutdown.

---------

Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
landscapepainter pushed a commit to landscapepainter/ray that referenced this pull request Nov 17, 2025
…anager. (ray-project#56246)

This PR continues to implement the CgroupManager. CgroupManager will be
used by the Raylet to manage the cgroup hierarchy. The implementation
will be completed in subsequent PRs.

This PR stacks on ray-project#56186.

For more details about the resource isolation project see
ray-project#54703.

In this PR:
* CgroupManager now bound checks constraints (e.g. cpu.weight is within
[1,10000].
* CgroupDriver no longer bound checks constraints.

---------

Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
landscapepainter pushed a commit to landscapepainter/ray that referenced this pull request Nov 17, 2025
… and SysFsCgroupDriver (ray-project#56255)

This PR stacks on ray-project#56246.

For more details about the resource isolation project see
ray-project#54703.

This PR adds the DeleteCgroup method (along with unit and integration
tests) to the CgroupDriverInterface and the SysFsCgroupDriver to allow
CgroupManager to clean up the cgroup hierarchy is graceful shutdown.

---------

Signed-off-by: Ibrahim Rabbani <irabbani@anyscale.com>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

core Issues that should be addressed in Ray Core go add ONLY when ready to merge, run all tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants