colencoding: reuse scratch space when key decoding bytes and decimals by yuzefovich · Pull Request #70734 · cockroachdb/cockroach

yuzefovich · 2021-09-24T23:23:45Z

When we're key decoding bytes-like columns, we need to use the scratch
byte slice to decode into (in case of decimals, we might need the space
temporarily). Previously, we would always allocate a new byte slice only
to deep copy it later when calling coldata.Bytes.Set. This commit
teaches the cFetcher and the relevant decoding methods to reuse the same
scratch space which should reduce the memory allocations.

One notable change is that now when we're calling
DecodeBytesAscending, we have to make sure to perform a deep copy so
that it is safe to reuse the returned value as the scratch space in the
future.

Release note: None

cockroach-teamcity · 2021-09-24T23:23:52Z

This change is

yuzefovich · 2021-09-25T01:46:26Z

Okay, this doesn't just work, yet.

yuzefovich · 2021-09-27T17:20:29Z

I think it should now work. RFAL.

mgartner

Reviewed 2 of 2 files at r1, all commit messages.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @michae2 and @yuzefovich)

pkg/sql/colencoding/key_encoding.go, line 294 at r1 (raw file):

		}
		// Note that since we're about to perform a deep copy, it'll be ok to
		// return the scratch slice to be reused by the caller.

I don't understand this comment. Since we've already deep-copied, why does it matter that Set deep-copies or not?

yuzefovich

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @mgartner and @michae2)

pkg/sql/colencoding/key_encoding.go, line 294 at r1 (raw file):

Previously, mgartner (Marcus Gartner) wrote…

I don't understand this comment. Since we've already deep-copied, why does it matter that Set deep-copies or not?

There are two concerns at play: one is about not referencing the memory used by key (to allow for GC to run on the batch response), another one is making sure it is ok to return scratch in such a manner that if scratch is later modified, then vec.Bytes().Get(idx) will still return the correct result. The first concern is addressed by making a deep-copy when decoding, the second concern is addressed automatically by vec.Bytes().Set() which performs the deep copy. The comment is about the second concern.

Do you think it's worth extending the comment? Or maybe removing it to reduce possible confusion?

mgartner

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @michae2 and @yuzefovich)

pkg/sql/colencoding/key_encoding.go, line 294 at r1 (raw file):

Previously, yuzefovich (Yahor Yuzefovich) wrote…

There are two concerns at play: one is about not referencing the memory used by key (to allow for GC to run on the batch response), another one is making sure it is ok to return scratch in such a manner that if scratch is later modified, then vec.Bytes().Get(idx) will still return the correct result. The first concern is addressed by making a deep-copy when decoding, the second concern is addressed automatically by vec.Bytes().Set() which performs the deep copy. The comment is about the second concern.

Do you think it's worth extending the comment? Or maybe removing it to reduce possible confusion?

Got it. I think maybe just reword it to something like: "Set() performs a deep copy, so it is safe to return the scratch slice to the caller. Any modifications to the scratch slice made by the caller will not affect the value in the vector".

When we're key decoding bytes-like columns, we need to use the scratch byte slice to decode into (in case of decimals, we might need the space temporarily). Previously, we would always allocate a new byte slice only to deep copy it later when calling `coldata.Bytes.Set`. This commit teaches the cFetcher and the relevant decoding methods to reuse the same scratch space which should reduce the memory allocations. One notable change is that now when we're calling `DecodeBytesAscending`, we have to make sure to perform a deep copy so that it is safe to reuse the returned value as the scratch space in the future. Release note: None

yuzefovich

TFTR!

bors r+

Reviewable status: complete! 0 of 0 LGTMs obtained (and 1 stale) (waiting on @mgartner and @michae2)

craig · 2021-09-28T20:27:02Z

Build succeeded:

GitHub CI (Cockroach)

yuzefovich requested review from a team, mgartner and michae2 September 24, 2021 23:23

yuzefovich force-pushed the decoding-scratch branch from 416f22b to a862a52 Compare September 24, 2021 23:26

yuzefovich force-pushed the decoding-scratch branch 2 times, most recently from 7bed2fe to 5d469ae Compare September 25, 2021 17:43

yuzefovich added the do-not-merge bors won't merge a PR with this label. label Sep 25, 2021

yuzefovich force-pushed the decoding-scratch branch 2 times, most recently from 520f480 to d78f0aa Compare September 27, 2021 17:19

yuzefovich removed the do-not-merge bors won't merge a PR with this label. label Sep 27, 2021

mgartner reviewed Sep 27, 2021

View reviewed changes

yuzefovich commented Sep 27, 2021

View reviewed changes

mgartner approved these changes Sep 28, 2021

View reviewed changes

yuzefovich force-pushed the decoding-scratch branch from d78f0aa to 78698c7 Compare September 28, 2021 17:36

yuzefovich requested a review from a team as a code owner September 28, 2021 17:36

yuzefovich force-pushed the decoding-scratch branch from 78698c7 to d3c354d Compare September 28, 2021 17:37

yuzefovich commented Sep 28, 2021

View reviewed changes

yuzefovich removed the request for review from a team September 28, 2021 17:39

craig bot merged commit bdb4c1a into cockroachdb:master Sep 28, 2021

yuzefovich deleted the decoding-scratch branch September 28, 2021 21:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

colencoding: reuse scratch space when key decoding bytes and decimals#70734

colencoding: reuse scratch space when key decoding bytes and decimals#70734
craig[bot] merged 1 commit intocockroachdb:masterfrom
yuzefovich:decoding-scratch

yuzefovich commented Sep 24, 2021 •

edited

Loading

Uh oh!

cockroach-teamcity commented Sep 24, 2021

Uh oh!

yuzefovich commented Sep 25, 2021

Uh oh!

yuzefovich commented Sep 27, 2021

Uh oh!

mgartner left a comment

Uh oh!

yuzefovich left a comment

Uh oh!

mgartner left a comment

Uh oh!

yuzefovich left a comment

Uh oh!

craig bot commented Sep 28, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yuzefovich commented Sep 24, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cockroach-teamcity commented Sep 24, 2021

Uh oh!

yuzefovich commented Sep 25, 2021

Uh oh!

yuzefovich commented Sep 27, 2021

Uh oh!

mgartner left a comment

Choose a reason for hiding this comment

Uh oh!

yuzefovich left a comment

Choose a reason for hiding this comment

Uh oh!

mgartner left a comment

Choose a reason for hiding this comment

Uh oh!

yuzefovich left a comment

Choose a reason for hiding this comment

Uh oh!

craig bot commented Sep 28, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yuzefovich commented Sep 24, 2021 •

edited

Loading