Sign every compute task with run ID to correlate response by hendrikmakait · Pull Request #7463 · dask/distributed

hendrikmakait · 2023-01-09T19:13:11Z

Supersedes #7372

Tests added / passed
Passes pre-commit run --all-files

github-actions · 2023-01-09T19:58:27Z

Unit Test Results

See test report for an extended history of previous test failures. This is useful for diagnosing flaky tests.

      24 files ±  0       24 suites ±0 10h 18m 43s ⏱️ + 5m 18s
  3 315 tests +  1   3 208 ✔️ +  3   105 💤 ±0 2 ❌ - 2
39 084 runs +12 37 193 ✔️ +16 1 889 💤 - 2 2 ❌ - 2

For more details on these failures, see this check.

Results for commit 4c4663c. ± Comparison against base commit 6dd3c70.

♻️ This comment has been updated with latest results.

hendrikmakait · 2023-01-10T15:03:09Z

distributed/tests/test_cancelled_state.py

                (f3.key, "executing", "released", "cancelled", {}),
                (f3.key, "cancelled", "fetch", "resumed", {}),
                (f3.key, "resumed", "memory", "memory", {}),
+                (


The resumed task should be rejected by the scheduler because its run ID is stale, which triggers the task to be released and recomputed.

…ging stimulus_task_finished

hendrikmakait · 2023-01-10T19:02:33Z

test_deadlock_cancelled_after_inflight_before_gather_from_worker and test_scheduler_story_stimulus_success seem related.

hendrikmakait · 2023-01-11T10:47:40Z

From what I understand, failures are likely unrelated and just general CI flakiness.

hendrikmakait · 2023-01-11T10:49:04Z

distributed/scheduler.py

                    "stimulus_id": stimulus_id,
                }
            ]
+        elif ts.run_id != run_id:


The clauses in stimulus_task_finished could likely be improved, but that would also mean some breaking changes to the transition logic and should be done in a PR focussing on that.

distributed/scheduler.py

fjetter and others added 7 commits January 4, 2023 16:39

Sign every compute task with a unique counter to correlated responses

a6d9cc7

Fix attempt before dispatching to threadpool

4ef8975

Fix tests

7ecab11

Check for sentinel

68dfeeb

Minor

e4b2674

Run ID

b1af340

Fix dummy

8e0f23c

hendrikmakait added 4 commits January 10, 2023 14:22

Fix freeing of outdated keys

8552568

Merge branch 'main' into sign-tasks-with-run-id

061c3e8

Improve stimulus handling

4272a41

More verbose story

062c019

hendrikmakait commented Jan 10, 2023

View reviewed changes

hendrikmakait added 4 commits January 10, 2023 17:00

Fix

fc3b690

Retore old condition

6ea5f78

Improve conditions

f9e67b1

Return to old-style checks to avoid unintended side-effects when chan…

15e80cf

…ging stimulus_task_finished

hendrikmakait added 3 commits January 11, 2023 10:11

Naming

f3008ff

Cleanup

399733a

Fix tests

215ab59

hendrikmakait marked this pull request as ready for review January 11, 2023 10:47

hendrikmakait commented Jan 11, 2023

View reviewed changes

fjetter reviewed Jan 11, 2023

View reviewed changes

distributed/scheduler.py Outdated Show resolved Hide resolved

fjetter approved these changes Jan 11, 2023

View reviewed changes

Reduce complexity

4c4663c

fjetter mentioned this pull request Jan 12, 2023

Getting concurrent.futures._base.CancelledError from simple binary tree built from futures #4612

Open

fjetter merged commit 2e5ce9e into dask:main Jan 17, 2023

hendrikmakait mentioned this pull request Jan 19, 2023

Sign task-erred with run_id and reject outdated responses #7489

Closed

gjoseph92 mentioned this pull request Jan 28, 2023

Scheduler TaskState objects should be unique, not hashed by key #7510

Open

hendrikmakait deleted the sign-tasks-with-run-id branch June 17, 2023 06:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sign every compute task with run ID to correlate response#7463

Sign every compute task with run ID to correlate response#7463
fjetter merged 19 commits intodask:mainfrom
hendrikmakait:sign-tasks-with-run-id

hendrikmakait commented Jan 9, 2023 •

edited

Loading

Uh oh!

github-actions bot commented Jan 9, 2023 •

edited

Loading

Uh oh!

hendrikmakait Jan 10, 2023

Uh oh!

hendrikmakait commented Jan 10, 2023

Uh oh!

hendrikmakait commented Jan 11, 2023

Uh oh!

hendrikmakait Jan 11, 2023 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

hendrikmakait commented Jan 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Unit Test Results

Uh oh!

hendrikmakait Jan 10, 2023

Choose a reason for hiding this comment

Uh oh!

hendrikmakait commented Jan 10, 2023

Uh oh!

hendrikmakait commented Jan 11, 2023

Uh oh!

hendrikmakait Jan 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hendrikmakait commented Jan 9, 2023 •

edited

Loading

github-actions bot commented Jan 9, 2023 •

edited

Loading

hendrikmakait Jan 11, 2023 •

edited

Loading