Refactor scheduler state mod #1913

yahoNanJing · 2022-03-03T11:24:26Z

Which issue does this PR close?

Closes #1910.

Rationale for this change

Details see #1910.

What changes are included in this PR?

Details see #1910.

Are there any user-facing changes?

It's blocked by #1908 and #1909

thinkharderdev · 2022-03-04T20:48:14Z

ballista/rust/scheduler/src/state/persistent_state.rs

+    pub(crate) codec: BallistaCodec<T, U>,
+
+    // for in-memory cache
+    executors_metadata: Arc<RwLock<HashMap<String, ExecutorMetadata>>>,


Should executor metadata be persistent? Seems like it could be useful to speed up a scheduler restart since it would have to wait for executors to re-register but it seems like in the case of push-based scheduling we need to wait for heartbeats anyway and in the poll-based scheduling we wouldn't schedule work until the executor registers.

Thanks @thinkharderdev for reviewing this PR. Actually the executor metadata is persistent to some backend storage. However, for fast reading, they are cached in memory. For all of those states in PersistentSchedulerState, firstly they will be persistent. Then they will be cached in memory.

Right, that part makes sense. I just am wondering whether it makes sense to, on a scheduler restart, to reload the executor state from persistent storage rather than let the executors re-register. In the latter case, executor metadata could be volatile state.

Suppose there's a cluster deployed on k8s in a standalone mode and we want to redeploy the scheduler. It would be better for the scheduler to reload the topology info from persistent storage. Whether the executor metadata be overdue or not, it should be managed by the executor heartbeat.

alamb · 2022-03-07T21:41:17Z

Let's try and get #1909 merged as soon as possible so you don't have to manage a large chain of PRs

The work going into Ballista these days is pretty exciting!

yahoNanJing · 2022-03-09T15:51:24Z

Hi @alamb and @thinkharderdev, could you help review this PR?

thinkharderdev · 2022-03-09T18:22:13Z

Sorry, meant to approve earlier. I'm good with this PR.

alamb · 2022-03-10T14:17:26Z

This seems like it is moving code around and thus largely unobjectionable. Merging it to keep things moving.

Thanks @yahoNanJing and @thinkharderdev

cc @liukun4515 @realno @mingmwang

github-actions bot added the ballista label Mar 3, 2022

thinkharderdev reviewed Mar 4, 2022

View reviewed changes

yahoNanJing mentioned this pull request Mar 7, 2022

Introduce Ballista query stage scheduler #1935

Merged

Refactor scheduler state mod

ac82e10

thinkharderdev approved these changes Mar 9, 2022

View reviewed changes

alamb merged commit 962c018 into apache:master Mar 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor scheduler state mod #1913

Refactor scheduler state mod #1913

Uh oh!

yahoNanJing commented Mar 3, 2022

Uh oh!

thinkharderdev Mar 4, 2022

Uh oh!

yahoNanJing Mar 4, 2022

Uh oh!

thinkharderdev Mar 5, 2022

Uh oh!

yahoNanJing Mar 7, 2022

Uh oh!

alamb commented Mar 7, 2022

Uh oh!

yahoNanJing commented Mar 9, 2022

Uh oh!

thinkharderdev commented Mar 9, 2022

Uh oh!

alamb commented Mar 10, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Refactor scheduler state mod #1913

Refactor scheduler state mod #1913

Uh oh!

Conversation

yahoNanJing commented Mar 3, 2022

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

Uh oh!

thinkharderdev Mar 4, 2022

Choose a reason for hiding this comment

Uh oh!

yahoNanJing Mar 4, 2022

Choose a reason for hiding this comment

Uh oh!

thinkharderdev Mar 5, 2022

Choose a reason for hiding this comment

Uh oh!

yahoNanJing Mar 7, 2022

Choose a reason for hiding this comment

Uh oh!

alamb commented Mar 7, 2022

Uh oh!

yahoNanJing commented Mar 9, 2022

Uh oh!

thinkharderdev commented Mar 9, 2022

Uh oh!

alamb commented Mar 10, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants