Revive #2740: Allow tasks with restrictions to be stolen by seibert · Pull Request #3069 · dask/distributed

seibert · 2019-09-19T18:27:54Z

I recently stumbled over an issue that is fixed by #2740, but noticed that review on that PR seems to have stalled because a bunch of extraneous commits were accidentally merged, and the original author hasn't opened a new PR or cleaned up the existing one.

This PR attempts to recreate that PR by cherry-picking those commits (so the original author still gets credit), and adds a minor fix. If there are additional changes needed to finish out code review on this, I'm happy to do those.

Addresses stealing tasks with resource restrictions, as mentioned in dask#1851. If a task has hard restrictions, do not just give up on stealing. Instead, use the restrictions to determine which workers can steal it before attempting to execute a steal operation. A follow up PR will be needed to address the issue of long-running tasks not being stolen because the scheduler has no information about their runtime.

Co-Authored-By: Matthew Rocklin <mrocklin@gmail.com>

seibert · 2019-09-19T18:30:47Z

Ping original author @calebho, and original reviewers @mrocklin, @martindurant, @TomAugspurger.

guillaumeeb · 2019-09-19T19:25:53Z

👍 just to say that this would be wonderfull if this issue could be solve, this has been awaited for several month in our HPC center!

TomAugspurger

Overall, this looks good. I think the concern last time was slowing performance of the no-restriction case. I can look into writing a benchmark for these.

TomAugspurger · 2019-09-19T20:06:42Z

distributed/stealing.py

+    return _has_resources(thief, victim.resources)
+
+
+def _has_resources(ws, required_resources):


This could likely be inlined in _can_steal

ok, I've inlined the function

calebho · 2019-09-19T22:44:05Z

Thanks for reviving this. I haven't had the bandwidth to continue pushing for this so please feel free to take over from here. Happy to provide review input

distributed/stealing.py

mrocklin · 2019-09-19T23:27:46Z

distributed/stealing.py

+    ``victim``.
+    """
+    if not _has_restrictions(ts):
+        return True


This check makes sense to have semantically, but it's also unnecessary given where _can_steal is called.

mrocklin · 2019-09-19T23:30:00Z

distributed/stealing.py

+    """
+    return not ts.loose_restrictions and (
+        ts.host_restrictions or ts.worker_restrictions or ts.resource_restrictions
+    )


I wonder if maybe we can cache this on the TaskState object itself in order to avoid recomputation.

if ts.restricted: ...

It seems that sometimes these are updated

# in scheduler.py L1707 ts = self.tasks[k] ts.loose_restrictions = True

If that's the case, I'd prefer to not cache, so that we don't have to worry about invalidating the cache (though perhaps this is the only case?).

That's in update_graph which is the time when we construct most tasks. My guess is that that this isn't an issue. We won't check ts.restricted before calling those lines.

mrocklin · 2019-09-19T23:32:08Z

In general this seems fine. I am concerned about performance. Anecdotally I'll say that work stealing accounts for something like 20-30% of scheduler time when under heavy load. At some point I went through and micro-optimized things a bit. It would be good to be mindful of costs here.

The scheduler is definitely a bottleneck in lots of larger important user workflows (Pangeo cares about this a bunch) so we need to be performance conscious. I think that we can achieve that fairly easily here though with modest work.

mrocklin · 2019-09-26T13:41:03Z

@seibert checking in. Is this still something that you're likely to pursue?

seibert · 2019-09-26T13:44:51Z

Yes, I had to switch to doing some conference prep / speaking this week, but I am still interested in solving this issue, as it reduces throughput in the Dask-based testing system we have. I'll need some guidance how to benchmark the impact of any changes, though.

mrocklin · 2019-09-26T14:09:46Z

I'll need some guidance how to benchmark the impact of any changes, though.

Unfortunately we don't have good benchmarks for the Dask scheduler (or any part of Dask really) (although this would be great work for someone in the future).

I've made a few concrete suggestions above that I think might be helpful. Also, if anything pop's out at you it might be good to bring it up (I trust your performance intuition over pretty much anyone's).

benchmarks the improvement on dask/distributed#3069

TomAugspurger · 2019-11-06T22:32:05Z

Added a benchmark for the improvement in dask/dask-benchmarks#22. Things look good for the new one. Things are being stolen by the new worker with the resource.

I think we'll want to add a test using resources here. Right now it's just testing worker restrictions. I'll do that tomorrow.

Finally, I'll need to verify that we haven't regressed on performance for the common case of no restrictions. I think we have a benchmark for that, but will verify.

cc @mattilyra.

TomAugspurger

I think we'll want to add a test using resources here. Right now it's just testing worker restrictions. I'll do that tomorrow.

We apparently already have a tests for that. test_steal_resource_restrictions.

verify that we haven't regressed on performance for the common case of no restrictions

Things seem OK over in the dask benchmarks repo.

# master
[ 75.00%] ··· client.ClientSuite.time_trivial_tasks                                                                                                                          220±5ms
[100.00%] ··· client.WorkerRestrictionsSuite.time_trivial_tasks_restrictions                                                                                                 1.06±0s


# this PR
[ 75.00%] ··· client.ClientSuite.time_trivial_tasks                                                                                                                          224±6ms
[100.00%] ··· client.WorkerRestrictionsSuite.time_trivial_tasks_restrictions                                                                                                629±20ms

TomAugspurger · 2019-11-07T21:10:31Z

distributed/stealing.py

+    """
+    return not ts.loose_restrictions and (
+        ts.host_restrictions or ts.worker_restrictions or ts.resource_restrictions
+    )


It seems that sometimes these are updated

# in scheduler.py L1707 ts = self.tasks[k] ts.loose_restrictions = True

If that's the case, I'd prefer to not cache, so that we don't have to worry about invalidating the cache (though perhaps this is the only case?).

mrocklin · 2019-11-07T21:30:26Z

Things seem OK over in the dask benchmarks repo.

I added a few comments there. I don't think that that benchmark is likely to be sensitive to the performance impacts here. I would recommend cranking the stealing interval up super high.

mrocklin · 2019-11-07T21:33:19Z

To be more explicit, lets say that we're aiming for a 200us overhead per task. How much are we willing to increase that overhead to support stealing restricted tasks? 10us? 20us? This isn't a very common case, so I think that even those numbers might be fairly high.

This isn't a long term solution, but I encourage people to run a workload locally on your laptop and then look through the /profile-server route on the scheduler's dashboard. You'll find stealing there as taking up 10-15% of the time (or at least this was the case last time I checked, about a year ago). I recommend diving through this just to get a sense of what things cost today.

mrocklin · 2019-11-07T22:12:23Z

I just did this myself to verify my previous experience. I was surprised to learn that we seem to be spending almost all of our time sending and receiving things from sockets that are supposed to be non-blocking. This is probably something that we should look into in the near future.

mberglundmx · 2020-02-26T08:37:22Z

I would really benefit from this PR being merged - unless there is a work-around...?

My workload consists of short running (5-20s) GPU tasks, followed by longer running (2-10 minutes) CPU tasks. I set resource CPU or GPU on my workers to make sure tasks are run on the right hosts.

I launch the GPU tasks (.submit), followed by the CPU tasks. If I e.g. restart (or add) a CPU-worker, that worker will not steal any of the tasks from the other workers.

TomAugspurger · 2020-02-27T18:38:49Z

unless there is a work-around...?

For now, a workaround is to disable work stealing, and add a worker plugin with the changes from this PR.

But we're hoping to get this merged in the next day or so. Just need to verify a few performance things.

TomAugspurger

@leej3 did some nice work expanding our benchmarks in this area in dask/dask-benchmarks#35. From the timings there, we have

On master:

========== ========= ========= ========= =========
--                      steal_interval
---------- ---------------------------------------
resource     0.01      0.1        1        100
========== ========= ========= ========= =========
  1        1.08±0s   1.06±0s   1.07±0s   1.07±0s
 None      436±0ms   430±0ms   436±0ms   436±0ms
========== ========= ========= ========= =========

On #3069:

========== ========= ========= ========= =========
--                      steal_interval
---------- ---------------------------------------
resource     0.01      0.1        1        100
========== ========= ========= ========= =========
  1        625±0ms   542±0ms   547±0ms   644±0ms
 None      428±0ms   425±0ms   431±0ms   438±0ms
========== ========= ========= ========= =========

I'm reasonably confident that this benchmark is hitting the relevant code (proved by the with-resource restriction case being faster, since we have stealing). And it looks like it isn't slowing down the common case of no resource restrictions.

TomAugspurger · 2020-03-02T12:48:12Z

@mrocklin do the benchmarks above sufficiently satisfy your concerns about performance?

mrocklin · 2020-03-03T20:55:59Z

@mrocklin do the benchmarks above sufficiently satisfy your concerns about performance?

Not entirely. We're still iterating through every worker for every stealable task. I think that this will get more interesting when this starts being run in production. But, I'm at least satisfied that it doesn't seem to be slowing down the common case, and folks seem pretty excited about trying this out. So let's merge and see what happens.

mrocklin · 2020-03-03T20:57:11Z

Thank you @seibert for doing this work, and to @leej3 and friends for benchmarking things.

mberglundmx · 2020-03-09T11:35:11Z

Thanks a lot everyone for getting this in, will try this release ASAP

calebho and others added 6 commits September 19, 2019 13:02

Use old-style formatting in logger calls

b1d4517

Noop if the given task has no restrictions

78680ca

Apply suggestions from code review

178f3c3

Co-Authored-By: Matthew Rocklin <mrocklin@gmail.com>

Review comments WIP

84ede9e

Convert ncores to nthreads in gen_cluster calls

b6716c0

TomAugspurger reviewed Sep 19, 2019

View reviewed changes

seibert added 2 commits September 19, 2019 15:45

Inline _has_resources to single caller

a2c7df2

Make linter happy

bd1e7dc

mrocklin reviewed Sep 19, 2019

View reviewed changes

distributed/stealing.py Show resolved Hide resolved

mrocklin reviewed Sep 19, 2019

View reviewed changes

Merge remote-tracking branch 'upstream/master' into revive_2740

efd7636

TomAugspurger added a commit to TomAugspurger/dask-benchmarks that referenced this pull request Nov 6, 2019

Added resource stealing benchmark

c3f2d85

benchmarks the improvement on dask/distributed#3069

TomAugspurger mentioned this pull request Nov 6, 2019

Added resource stealing benchmark dask/dask-benchmarks#22

Merged

TomAugspurger added 2 commits November 7, 2019 08:55

Merge remote-tracking branch 'upstream/master' into revive_2740

55e1989

fixups

6cabc74

TomAugspurger reviewed Nov 7, 2019

View reviewed changes

Merge remote-tracking branch 'upstream/master' into revive_2740

210ca1c

lr4d mentioned this pull request Feb 27, 2020

make work stealing callback time configurable #3523

Merged

leej3 mentioned this pull request Feb 27, 2020

Timing with resources dask/dask-benchmarks#35

Merged

TomAugspurger reviewed Feb 27, 2020

View reviewed changes

TomAugspurger approved these changes Feb 28, 2020

View reviewed changes

mrocklin merged commit d8d0d4e into dask:master Mar 3, 2020

jrbourbeau mentioned this pull request Mar 4, 2020

Allow tasks with restrictions to be stolen #2740

Closed

lesteve mentioned this pull request Mar 5, 2020

Work stealing in case of job restrictions / resources would be useful #1389

Closed

jrbourbeau mentioned this pull request Apr 1, 2020

Call Scheduler.reschedule(key) on tasks with constraints results in memory leak in scheduler #3663

Closed

bnaul mentioned this pull request Jun 13, 2020

Tasks lost by cluster rescale during stealing #3892

Open

This was referenced Jun 22, 2020

Dask with jobqueue not using multiple nodes dask/dask-jobqueue#206

Closed

Resource dependent tasks do not get re-distributed dask/dask#3264

Closed

kevinziroldi mentioned this pull request Mar 23, 2026

Update outdated work stealing docs regarding worker restrictions #9214

Open

		return _has_resources(thief, victim.resources)


		def _has_resources(ws, required_resources):

Uh oh!

Conversation

seibert commented Sep 19, 2019

Uh oh!

seibert commented Sep 19, 2019

Uh oh!

guillaumeeb commented Sep 19, 2019

Uh oh!

TomAugspurger left a comment

Choose a reason for hiding this comment

Uh oh!

TomAugspurger Sep 19, 2019

Choose a reason for hiding this comment

Uh oh!

seibert Sep 19, 2019

Choose a reason for hiding this comment

Uh oh!

calebho commented Sep 19, 2019

Uh oh!

Uh oh!

mrocklin Sep 19, 2019

Choose a reason for hiding this comment

Uh oh!

mrocklin Sep 19, 2019

Choose a reason for hiding this comment

Uh oh!

TomAugspurger Nov 7, 2019

Choose a reason for hiding this comment

Uh oh!

mrocklin Nov 7, 2019

Choose a reason for hiding this comment

Uh oh!

mrocklin commented Sep 19, 2019

Uh oh!

mrocklin commented Sep 26, 2019

Uh oh!

seibert commented Sep 26, 2019

Uh oh!

mrocklin commented Sep 26, 2019

Uh oh!

TomAugspurger commented Nov 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TomAugspurger left a comment

Choose a reason for hiding this comment

Uh oh!

TomAugspurger Nov 7, 2019

Choose a reason for hiding this comment

Uh oh!

mrocklin commented Nov 7, 2019

Uh oh!

mrocklin commented Nov 7, 2019

Uh oh!

mrocklin commented Nov 7, 2019

Uh oh!

mberglundmx commented Feb 26, 2020

Uh oh!

TomAugspurger commented Feb 27, 2020

Uh oh!

TomAugspurger left a comment

Choose a reason for hiding this comment

Uh oh!

TomAugspurger commented Mar 2, 2020

Uh oh!

mrocklin commented Mar 3, 2020

Uh oh!

mrocklin commented Mar 3, 2020

Uh oh!

mberglundmx commented Mar 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

TomAugspurger commented Nov 6, 2019 •

edited

Loading