unit: add jobs that should be dispatched later back to run_queue by msekletar · Pull Request #21524 · systemd/systemd

msekletar · 2021-11-25T17:36:41Z

Assumption in edc027b was that job we first skipped because of active
ratelimit is still in run_queue. Hence we trigger the queue and dispatch
it in the next iteration. Actually we remove jobs from run_queue in
job_run_and_invalidate() before we call unit_start(). Hence if we want
to attempt to run the job again in the future we need to add it back
to run_queue.

Fixes #21458

src/core/unit.c

msekletar · 2021-11-26T10:06:39Z

Silly mistake, I should have checked once more before pushing. Sorry about that.

Anyway, should be fixed now. @poettering PTAL.

src/core/unit.c

poettering · 2021-11-26T10:21:25Z

lgtm, just one nit

bluca

Thanks for fixing this!

bluca · 2021-11-26T13:41:18Z

Bad news, this doesn't fix the issue :-(

focal-s390x:

TEST-37-RUNTIMEDIRECTORYPRESERVE:   FAIL     (1201 s)

Assumption in edc027b was that job we first skipped because of active ratelimit is still in run_queue. Hence we trigger the queue and dispatch it in the next iteration. Actually we remove jobs from run_queue in job_run_and_invalidate() before we call unit_start(). Hence if we want to attempt to run the job again in the future we need to add it back to run_queue. Fixes systemd#21458

bluca · 2021-11-26T18:24:54Z

looks like the ubuntu CI infrastructure is having issues

mrc0mmand · 2021-11-26T18:28:58Z

I'll give it a try locally, for now, just to see if it indeed helps.

mrc0mmand · 2021-11-26T18:52:55Z

It hasn't failed after 200+ iterations (whereas before it failed in <10 iterations), so it looks like the patch indeed works. But let's wait for the Ubuntu CIs to come back to life for extra measure.

poettering · 2021-11-26T20:48:42Z

src/core/mount.c

+                        continue;
+
+                job_add_to_run_queue(j);
+        }


I think we can safely dumb this down: simply enqueue any job for a mount unit again, regardless what it is. Adding something to the queue can basically be done on a hunch, there's no need to guarantee that anything really changed. Hence, I'd really enqueue any job for a mount unit without trying to be too smart here. It's not that this is going to be a million things, it's just a small number, and it makes things really robust.

poettering · 2021-11-26T22:58:50Z

I posted a dumbed down version of this PR: #21543

mrc0mmand · 2021-11-29T18:08:13Z

Superseded by #21543.

mrc0mmand mentioned this pull request Nov 25, 2021

debug-only: TEST-37 debug #21477

Closed

poettering reviewed Nov 25, 2021

View reviewed changes

src/core/unit.c Outdated Show resolved Hide resolved

poettering added pid1 reviewed/needs-rework 🔨 PR has been reviewed and needs another round of reworks labels Nov 25, 2021

msekletar force-pushed the issue-21458-empty-run-queue branch from 14df643 to bd6eb7b Compare November 26, 2021 10:04

msekletar removed the reviewed/needs-rework 🔨 PR has been reviewed and needs another round of reworks label Nov 26, 2021

poettering reviewed Nov 26, 2021

View reviewed changes

src/core/unit.c Outdated Show resolved Hide resolved

poettering added the good-to-merge/with-minor-suggestions label Nov 26, 2021

msekletar force-pushed the issue-21458-empty-run-queue branch from bd6eb7b to 64ec2c3 Compare November 26, 2021 10:52

poettering added good-to-merge/waiting-for-ci 👍 PR is good to merge, but CI hasn't passed at time of review. Please merge if you see CI has passed and removed good-to-merge/with-minor-suggestions labels Nov 26, 2021

bluca added this to the v250 milestone Nov 26, 2021

bluca approved these changes Nov 26, 2021

View reviewed changes

bluca added ci-fails/needs-rework 🔥 Please rework this, the CI noticed an issue with the PR and removed good-to-merge/waiting-for-ci 👍 PR is good to merge, but CI hasn't passed at time of review. Please merge if you see CI has passed labels Nov 26, 2021

msekletar force-pushed the issue-21458-empty-run-queue branch from 64ec2c3 to 5ca1ebb Compare November 26, 2021 16:56

bluca removed the ci-fails/needs-rework 🔥 Please rework this, the CI noticed an issue with the PR label Nov 26, 2021

poettering reviewed Nov 26, 2021

View reviewed changes

poettering mentioned this pull request Nov 26, 2021

unit: add jobs that were skipped because of ratelimit back to run_queue #21543

Merged

mrc0mmand closed this Nov 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

unit: add jobs that should be dispatched later back to run_queue#21524

unit: add jobs that should be dispatched later back to run_queue#21524
msekletar wants to merge 1 commit intosystemd:mainfrom
msekletar:issue-21458-empty-run-queue

msekletar commented Nov 25, 2021

Uh oh!

Uh oh!

msekletar commented Nov 26, 2021

Uh oh!

Uh oh!

poettering commented Nov 26, 2021

Uh oh!

bluca left a comment

Uh oh!

bluca commented Nov 26, 2021

Uh oh!

bluca commented Nov 26, 2021

Uh oh!

mrc0mmand commented Nov 26, 2021

Uh oh!

mrc0mmand commented Nov 26, 2021

Uh oh!

poettering Nov 26, 2021

Uh oh!

poettering commented Nov 26, 2021

Uh oh!

mrc0mmand commented Nov 29, 2021

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

msekletar commented Nov 25, 2021

Uh oh!

Uh oh!

msekletar commented Nov 26, 2021

Uh oh!

Uh oh!

poettering commented Nov 26, 2021

Uh oh!

bluca left a comment

Choose a reason for hiding this comment

Uh oh!

bluca commented Nov 26, 2021

Uh oh!

bluca commented Nov 26, 2021

Uh oh!

mrc0mmand commented Nov 26, 2021

Uh oh!

mrc0mmand commented Nov 26, 2021

Uh oh!

poettering Nov 26, 2021

Choose a reason for hiding this comment

Uh oh!

poettering commented Nov 26, 2021

Uh oh!

mrc0mmand commented Nov 29, 2021

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants