Add option to disable pod affinity by Wielewout · Pull Request #235 · actions/runner-container-hooks

Wielewout · 2025-07-16T09:36:52Z

In #212 pod affinity was added when the kube scheduler is enabled. While a way better default, it makes less optimal use of resources in the cluster (or even breaks? some setups #201 (comment)).

By default pod affinity remains set, but by setting ACTIONS_RUNNER_USE_POD_AFFINITY=false pod affinity rules will be skipped.

When disabling, the runner and workflow pod can then be scheduled on different nodes again. It is up to the user though to support RWX volumes in the cluster, a node selector for architecture if using a multi-arch cluster (on both the runner and workflow pod so they match), ...

zchenyu · 2025-07-16T16:41:58Z

Thanks! fwiw, I had the exact same implementation in my fork :)

gigabyte132 · 2025-08-06T09:31:49Z

This would help us a lot in our setup as well, as without it breaks our GPU runners whenever there is no availability of GPUs on the node that the job pod runs (even though there are GPUs available on other nodes).

oed-lipphausent · 2025-08-20T12:16:03Z

This is exactly what we need ! :D

Currently, we have the problem that the runner pods are created as expected, but we always have to wait for the workflow pods. Since the introduction of node affinity, our nodes do not have enough resources to process every workflow pod for every runner pod.
This is why we originally switched to the Kube Scheduler, but since the change, we have the problem again.

The change with node affinity makes using the Kube Scheduler pointless for us. We deliberately chose this path so that we could use smaller nodes and scale the number of nodes to save costs.

However, since this PR has been open for more than a month, I wonder if it is realistic to expect this change to be considered in the near future.

I don't know who is responsible for this maybe, @nikola-jokic, but it would be nice to see some feedback on this PR

nikola-jokic · 2025-08-21T13:01:50Z

Hey everyone, we are currently working on a PR that will disable the affinity and volume mounts completely.

oed-lipphausent · 2025-08-22T06:59:32Z

@nikola-jokic, thank you for the update. I didn't know you were working on something like this, and I'm very excited to see how it develops. I think it's a nice idea.

jennyluciav · 2025-09-12T14:38:41Z

@nikola-jokic! thanks for working on this. Do you have a rough timeline for the fix to be implemented?

nikola-jokic · 2025-09-12T15:31:32Z

Hey @jennyluciav,

The target date is Okt 13th, but it might be sooner. Most of the work has been done, and I'd love to test it a bit more to make sure everything works at least for most cases. I'm a bit worried about permissions, since with volume mounts, we could easily apply it to any directory while with copy, there are certain folders where it might result in an error, but that is mostly for user volume mounts that are likely not frequent.
There are few things that we are working on right now, not just this feature, but if we finish that work sooner, we will issue a release as soon as we are done, and we will not wait for the target date.

LeonoreMangold · 2025-10-06T14:25:06Z

@nikola-jokic @Wielewout Why was this pull-request closed, is the work being moved to another PR ? I also have the problem where I'm using RWX for the work volume and I need the runner and workflow pod to be able to schedule to different nodes to fit with the resource requests they are attributed.

Wielewout · 2025-10-06T14:42:35Z

@nikola-jokic @Wielewout Why was this pull-request closed, is the work being moved to another PR ?

This PR became obsolete because of #244. Instead of using a volume in both the runner and workflow pod, the runner will copy files to the workflow pod. This also removes the requirement to keep both pods on the same node.

vvanouytsel · 2025-10-08T13:40:10Z

Did anyone get #244 in a working state?
When using container-hooks version 0.8.0 my Initialize Containers step always fails with:

Run '/home/runner/k8s/index.js'
(node:66) [DEP0005] DeprecationWarning: Buffer() is deprecated due to security and usability issues. Please use the Buffer.alloc(), Buffer.allocUnsafe(), or Buffer.from() methods instead.
(Use `node --trace-deprecation ...` to show where the warning was created)
Error: Error: cpToPod failed after 30 attempts: {}
Error: Process completed with exit code 1.
Error: Executing the custom container implementation failed. Please contact your self hosted runner administrator.

vvanouytsel · 2025-11-07T08:01:21Z

I still think the possibility should be added to disable nodeAffinity.

The solution in #244 does not support all use cases.

For example:

You can't use container actions that are based on the musl libc implementation
Container actions that use the musl libc implementation do not work in v0.8.0 #271
You can't use containers that do not run as root, as the copy mechanism will fail due to permissions
cpToPod failed after 30 attempts (related to permission issue) #257

Any chance we can get this PR reopened?

cc @Wielewout @nikola-jokic

nikola-jokic · 2025-11-07T10:43:45Z

Hey @vvanouytsel , we can't keep two parallel versions of the hook at the same time. Since the implementation is 0.8.0 is not heavily tested on every environment, we applied both versions on the runner, so you can fallback to the 0.7.0 version of the hook.
If you are blocked, you can re-build the hook and put your own version inside the container.
In the meantime, we will be fixing the reported issues on the 0.8.0 version, which should eventually fully replace the 0.7.0 version.

Add option to disable pod affinity

67d9390

Wielewout requested a review from a team as a code owner July 16, 2025 09:36

Wielewout mentioned this pull request Jul 16, 2025

Enforce same node when turning K8s Scheduler on for Workflow Pods #201

Closed

Merge branch 'actions:main' into optional-pod-affinity

f44a09e

Wielewout requested a review from nikola-jokic as a code owner August 21, 2025 05:46

vvanouytsel mentioned this pull request Sep 17, 2025

workflow container pod fails immediately upon unscheduled status waiting for node to provision #173

Open

Wielewout closed this Oct 2, 2025

vvanouytsel-trendminer mentioned this pull request Nov 13, 2025

Copy mechanism does not seem to be consistent #275

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option to disable pod affinity#235

Add option to disable pod affinity#235
Wielewout wants to merge 2 commits intoactions:mainfrom
Wielewout:optional-pod-affinity

Wielewout commented Jul 16, 2025

Uh oh!

zchenyu commented Jul 16, 2025

Uh oh!

gigabyte132 commented Aug 6, 2025 •

edited

Loading

Uh oh!

oed-lipphausent commented Aug 20, 2025

Uh oh!

nikola-jokic commented Aug 21, 2025

Uh oh!

oed-lipphausent commented Aug 22, 2025

Uh oh!

jennyluciav commented Sep 12, 2025

Uh oh!

nikola-jokic commented Sep 12, 2025

Uh oh!

LeonoreMangold commented Oct 6, 2025 •

edited

Loading

Uh oh!

Wielewout commented Oct 6, 2025 •

edited

Loading

Uh oh!

vvanouytsel commented Oct 8, 2025

Uh oh!

vvanouytsel commented Nov 7, 2025

Uh oh!

nikola-jokic commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Conversation

Wielewout commented Jul 16, 2025

Uh oh!

zchenyu commented Jul 16, 2025

Uh oh!

gigabyte132 commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oed-lipphausent commented Aug 20, 2025

Uh oh!

nikola-jokic commented Aug 21, 2025

Uh oh!

oed-lipphausent commented Aug 22, 2025

Uh oh!

jennyluciav commented Sep 12, 2025

Uh oh!

nikola-jokic commented Sep 12, 2025

Uh oh!

LeonoreMangold commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Wielewout commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vvanouytsel commented Oct 8, 2025

Uh oh!

vvanouytsel commented Nov 7, 2025

Uh oh!

nikola-jokic commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

gigabyte132 commented Aug 6, 2025 •

edited

Loading

LeonoreMangold commented Oct 6, 2025 •

edited

Loading

Wielewout commented Oct 6, 2025 •

edited

Loading