Allow worker to prioritize tasks based on memory production/consumption by mrocklin · Pull Request #5251 · dask/distributed

mrocklin · 2021-08-22T17:44:44Z

This adds a worker.py::TaskPrefix class that tracks consumption and
production of all computed tasks, grouped by task prefix.

Then we use these values when determining priorities,
(de)prioritizing tasks that (produce)/consume five times more data than
they (consume)/produce.

See #5250 for background

This adds a worker.py::TaskPrefix class that tracks consumption and production of all computed tasks, grouped by task prefix. Then we use these values when determining priorities, (de)prioritizing tasks that (produce)/consume five times more data than they (consume)/produce.

mrocklin · 2021-08-22T19:16:01Z

OK, I've added pausing, although it's ugly and not very future-reader friendly. This will have to be cleaned up, but it is probably simple enough for people to take a look at if they're interested.

test_resources.py rightfully complained

jrbourbeau · 2021-08-23T23:00:32Z

Thanks for pushing this up @mrocklin. I'll plan to read through #5250 and review this tomorrow

TomNicholas · 2022-05-11T23:24:28Z

I'm coming up against workers over-eagerly consuming memory, and wondering what the status of this effort is?

Really I'm just looking for a way to get workers to deprioritise certain memory-consuming root tasks (xr.open_dataset tasks). I can open another issue if that's worthwhile.

gjoseph92 · 2022-05-11T23:46:46Z

@TomNicholas see #5223 and #5555 for discussion of the underlying problem. We're focused on core stability issues (deadlocks) right now, so changes to the scheduling algorithm to address this are not getting attention at the moment.

Opening a separate issue to discuss workarounds for root task overproduction might be useful. There may be tricks you can play using worker resources, though I haven't had much success with them personally.

mrocklin mentioned this pull request Aug 22, 2021

Memory prioritization on workers #5250

Open

Pause workers when they have only poor tasks to run

5189349

continue if we have constrained tasks

d4166ee

test_resources.py rightfully complained

TomNicholas mentioned this pull request May 17, 2022

Ease memory pressure by deprioritizing root tasks? #6360

Open

mrocklin requested a review from fjetter as a code owner January 23, 2024 10:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow worker to prioritize tasks based on memory production/consumption#5251

Allow worker to prioritize tasks based on memory production/consumption#5251
mrocklin wants to merge 3 commits intodask:mainfrom
mrocklin:worker-memory-priority

mrocklin commented Aug 22, 2021

Uh oh!

mrocklin commented Aug 22, 2021

Uh oh!

jrbourbeau commented Aug 23, 2021

Uh oh!

TomNicholas commented May 11, 2022 •

edited

Loading

Uh oh!

gjoseph92 commented May 11, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

mrocklin commented Aug 22, 2021

Uh oh!

mrocklin commented Aug 22, 2021

Uh oh!

jrbourbeau commented Aug 23, 2021

Uh oh!

TomNicholas commented May 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gjoseph92 commented May 11, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

TomNicholas commented May 11, 2022 •

edited

Loading