colflow: release disk resources in hash router in all cases by yuzefovich · Pull Request #81491 · cockroachdb/cockroach

yuzefovich · 2022-05-19T00:23:36Z

Previously, it was possible for the disk-backed spilling queue used
by the hash router outputs to not be closed when the hash router exited.
Namely, this could occur if the router output was not fully exhausted
(i.e. it could still produce more batches, but the consumer of the
router output was satisfied and called DrainMeta). In such a scenario,
routerOutput.closeLocked was never called because a zero-length batch
was never given to addBatch nor the output was canceled due to an
error. The flow cleanup also didn't save us because the router outputs
are not added into ToClose slice.

The bug is now fixed by closing the router output in DrainMeta. This
behavior is acceptable because the caller is not interested in any more
data, and closing the output can be done multiple times (it is a no-op
on all calls except for the first one). There is no regression test
since it's quite tricky to come up with given that the behavior of
router outputs is non-deterministic, and I don't think it's worth
introducing special knobs inside of DrainMeta / Next for this.

The impact of not closing the spilling queue is that it might lead to
leaking a file descriptor until the node restarts. Although the
temporary directory is deleted on the flow cleanup, the bug would result
in a leak of the disk space which is also "fixed" by the node restarts.

Fixes: #81490.

Release note: None

cockroach-teamcity · 2022-05-19T00:23:45Z

This change is

Previously, it was possible for the disk-backed spilling queue used by the hash router outputs to not be closed when the hash router exited. Namely, this could occur if the router output was not fully exhausted (i.e. it could still produce more batches, but the consumer of the router output was satisfied and called `DrainMeta`). In such a scenario, `routerOutput.closeLocked` was never called because a zero-length batch was never given to `addBatch` nor the output was canceled due to an error. The flow cleanup also didn't save us because the router outputs are not added into `ToClose` slice. The bug is now fixed by closing the router output in `DrainMeta`. This behavior is acceptable because the caller is not interested in any more data, and closing the output can be done multiple times (it is a no-op on all calls except for the first one). There is no regression test since it's quite tricky to come up with given that the behavior of router outputs is non-deterministic, and I don't think it's worth introducing special knobs inside of `DrainMeta` / `Next` for this. The impact of not closing the spilling queue is that it might lead to leaking a file descriptor until the node restarts. Although the temporary directory is deleted on the flow cleanup, the bug would result in a leak of the disk space which is also "fixed" by the node restarts. Release note: None

michae2

Maybe this is the wrong place to ask, but why does the hash router have a disk-backed spilling queue? Naively, it seems like a router should have some kind of backpressure on inputs rather than spilling to disk...

Is it because rows destined for different receivers are mixed together in the batch? So we're trying to prevent a single slow receiver from stopping all sending?

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @cucaroach)

yuzefovich · 2022-05-19T20:51:34Z

Is it because rows destined for different receivers are mixed together in the batch? So we're trying to prevent a single slow receiver from stopping all sending?

Yes, this is the reason. Imagine we have an input to the hash router that can quickly produce batches, and, say, we have two outputs with one of the consumers being extremely slow. If each output gets a half of the rows in the batch, then eventually the buffer for the slow consumer will fill up. At that point we have to make a choice: either block the input from producing more batches altogether (which means that the fast consumer is also blocked) until the slow consumer catches up with the buffer, or continue buffering more rows on disk for the slow consumer (meaning that the fast consumer is not impacted). We choose the latter option since it seems like a faster approach overall (imagine that the fast consumer satisfies LIMIT clause, so we won't have to wait for the slow consumer to consume the buffer at all). Also these buffers are relatively large (namely about distsql_workmem / # of outputs of memory is given to each in-memory buffer), so it should be quite unlikely for the buffers to spill to disk.

TFTR!

bors r+

michae2 · 2022-05-19T21:36:57Z

Makes sense, thank you!

craig · 2022-05-19T21:55:49Z

Build succeeded:

GitHub CI (Cockroach)

yuzefovich force-pushed the routers-leak branch 2 times, most recently from f8f49b0 to 071298f Compare May 19, 2022 16:53

yuzefovich added backport-21.2.x labels May 19, 2022

yuzefovich requested review from a team, cucaroach and michae2 May 19, 2022 16:54

yuzefovich marked this pull request as ready for review May 19, 2022 16:54

yuzefovich force-pushed the routers-leak branch from 071298f to 8d35dcc Compare May 19, 2022 16:55

yuzefovich force-pushed the routers-leak branch from 8d35dcc to c51af93 Compare May 19, 2022 18:00

michae2 approved these changes May 19, 2022

View reviewed changes

craig bot merged commit 73e7263 into cockroachdb:master May 19, 2022

This was referenced May 19, 2022

release-21.2: colflow: release disk resources in hash router in all cases #81555

Merged

release-22.1: colflow: release disk resources in hash router in all cases #81556

Merged

yuzefovich deleted the routers-leak branch May 19, 2022 22:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

colflow: release disk resources in hash router in all cases#81491

colflow: release disk resources in hash router in all cases#81491
craig[bot] merged 1 commit intocockroachdb:masterfrom
yuzefovich:routers-leak

yuzefovich commented May 19, 2022 •

edited

Loading

Uh oh!

cockroach-teamcity commented May 19, 2022

Uh oh!

michae2 left a comment

Uh oh!

yuzefovich commented May 19, 2022

Uh oh!

michae2 commented May 19, 2022

Uh oh!

craig bot commented May 19, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yuzefovich commented May 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cockroach-teamcity commented May 19, 2022

Uh oh!

michae2 left a comment

Choose a reason for hiding this comment

Uh oh!

yuzefovich commented May 19, 2022

Uh oh!

michae2 commented May 19, 2022

Uh oh!

craig bot commented May 19, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yuzefovich commented May 19, 2022 •

edited

Loading