sql: chunk deletions by tbg · Pull Request #22991 · cockroachdb/cockroach

tbg · 2018-02-23T06:14:43Z

In the absence of a fast path deletion, DELETE would generate one
potentially giant batch and OOM the gateway node. This became obvious
quickly via heap profiling.

Added chunking of the deletions to tableDeleter. SQL folks may have
stronger opinions on how to achieve this, or a better idea of a
preexisting chunking mechanism that works more reliably. If nothing
else, this change serves as a prototype to fix #17921.

With this change, roachtest run drop works (as in, it doesn't
out-of-memory right away; the run takes a long time so I can't yet
confirm that it actually passes).

Release note (sql change): deleting many rows at once now consumes less
memory.

cockroach-teamcity · 2018-02-23T06:14:49Z

This change is

knz

LGTM with nit

knz · 2018-02-23T14:29:40Z

pkg/sql/tablewriter.go

 func (td *tableDeleter) row(
 	ctx context.Context, values tree.Datums, traceKV bool,
 ) (tree.Datums, error) {
+	if td.batchSize > 10000 {


extract the constant in the global scope and give it an explanatory comment.

knz · 2018-02-23T14:30:50Z

if this merges into master, please also cherry-pick into 2.0

petermattis · 2018-02-23T14:49:28Z

Review status: 0 of 1 files reviewed at latest revision, 1 unresolved discussion, all commit checks successful.

pkg/sql/tablewriter.go, line 740 at r1 (raw file):

Previously, knz (kena) wrote…

extract the constant in the global scope and give it an explanatory comment.

Not sure the constant needs to be in the global scope given it is only accessed here, but 👍 on making a constant.

Also, shouldn't this be >=? Otherwise you'll be running batches of 10001 rows, which while not wrong, offends my sense of propriety.

Comments from Reviewable

In the absence of a fast path deletion, `DELETE` would generate one potentially giant batch and OOM the gateway node. This became obvious quickly via heap profiling. Added chunking of the deletions to `tableDeleter`. SQL folks may have stronger opinions on how to achieve this, or a better idea of a preexisting chunking mechanism that works more reliably. If nothing else, this change serves as a prototype to fix cockroachdb#17921. With this change, `roachtest run drop` works (as in, it doesn't out-of-memory right away; the run takes a long time so I can't yet confirm that it actually passes). Release note (sql change): deleting many rows at once now consumes less memory.

spencerkimball · 2018-02-24T18:56:13Z

@tschottdorf (+@knz, @andreimatei) when I ran this locally, I noticed that we seem to be scanning and buffering the entire set of rows to be deleted. It seems with this change that the total memory footprint should not soar into the GiBs. Why doesn't SQL consume rows from the scanner in a streaming fashion to feed to the table writer?

andreimatei · 2018-02-24T19:08:35Z

Spencer is this what you're looking for? #16180

…

On Feb 24, 2018 1:56 PM, "Spencer Kimball" ***@***.***> wrote: @tschottdorf <https://github.com/tschottdorf> ***@***.*** <https://github.com/knz>, @andreimatei <https://github.com/andreimatei>) when I ran this locally, I noticed that we seem to be scanning and buffering the entire set of rows to be deleted. It seems with this change that the total memory footprint should not soar into the GiBs. Why doesn't SQL consume rows from the scanner in a streaming fashion to feed to the table writer? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#22991 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAXBcTG21dwJAhCTJST8yPRyKV7CxcECks5tYFtagaJpZM4SQbsB> .

tbg requested review from a team February 23, 2018 06:14

tbg mentioned this pull request Feb 23, 2018

DELETE FROM ... [returning nothing] crashes a node #17921

Closed

knz approved these changes Feb 23, 2018

View reviewed changes

tbg force-pushed the delete-oom branch from ca55a93 to a2da685 Compare February 23, 2018 16:59

tbg merged commit 494b091 into cockroachdb:master Feb 23, 2018

tbg deleted the delete-oom branch February 23, 2018 17:15

tbg restored the delete-oom branch April 16, 2018 15:23

tbg deleted the delete-oom branch May 8, 2018 15:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: chunk deletions#22991

sql: chunk deletions#22991
tbg merged 1 commit intocockroachdb:masterfrom
tbg:delete-oom

tbg commented Feb 23, 2018

Uh oh!

cockroach-teamcity commented Feb 23, 2018

Uh oh!

knz left a comment

Uh oh!

knz Feb 23, 2018

Uh oh!

knz commented Feb 23, 2018

Uh oh!

petermattis commented Feb 23, 2018

Uh oh!

spencerkimball commented Feb 24, 2018

Uh oh!

andreimatei commented Feb 24, 2018 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

tbg commented Feb 23, 2018

Uh oh!

cockroach-teamcity commented Feb 23, 2018

Uh oh!

knz left a comment

Choose a reason for hiding this comment

Uh oh!

knz Feb 23, 2018

Choose a reason for hiding this comment

Uh oh!

knz commented Feb 23, 2018

Uh oh!

petermattis commented Feb 23, 2018

Uh oh!

spencerkimball commented Feb 24, 2018

Uh oh!

andreimatei commented Feb 24, 2018 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants