sql: enable simple stats for all types by madelynnblue · Pull Request #49801 · cockroachdb/cockroach

madelynnblue · 2020-06-02T16:28:52Z

Improve datum fingerprinting (used by rowexec/sampler.go) to work
correctly for all types. Previously, due to the hardcoded assumption that
JSON was the only non-key encodable type, it would error if presented
a geo/geom type.

Use MustBeValueEncoded when fingerprinting and determining if a type can
create a histogram. This should be future proof if we add new types or
teach types how to key encode.

Fixes #35844
Informs #48219

Release note: None

cockroach-teamcity · 2020-06-02T16:29:00Z

This change is

madelynnblue · 2020-06-02T16:29:14Z

@yuzefovich can you review the change to Fingerprint?

yuzefovich

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @mjibson, @rytaft, and @yuzefovich)

pkg/sql/sqlbase/encoded_datum.go, line 291 at r1 (raw file):

	// case uses ed.Encode, which has a fast path if the encoded bytes are already
	// the right encoding.
	if MustBeValueEncoded(typ) {

I think @rohany made some recent changes here so that ArrayFamily would use ed.Encode method below although it "must be value encoded".

rohany · 2020-06-02T17:49:58Z

I think @rohany made some recent changes here so that ArrayFamily would use ed.Encode method below although it "must be value encoded".

Nah, arrays have key encodings now.

yuzefovich

Reviewed 1 of 4 files at r1.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @mjibson and @rytaft)

pkg/sql/sqlbase/encoded_datum.go, line 291 at r1 (raw file):

Previously, yuzefovich wrote…

I think @rohany made some recent changes here so that ArrayFamily would use ed.Encode method below although it "must be value encoded".

So I guess this change looks good, right Rohan?

rohany · 2020-06-02T17:53:53Z

yeah

madelynnblue

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @rohany and @rytaft)

pkg/sql/sqlbase/encoded_datum.go, line 291 at r1 (raw file):

Previously, yuzefovich wrote…

So I guess this change looks good, right Rohan?

Arrays now only must value encode if their contents must:

cockroach/pkg/sql/sqlbase/structured.go

Line 1333 in 455a67b

func MustBeValueEncoded(semanticType *types.T) bool {

madelynnblue · 2020-06-02T23:25:10Z

Ok sql exec people. Here's a stumper. The Fingerprint change appears to have broken a few tests.

cockroach/pkg/sql/logictest/testdata/logic_test/where

Line 51 in a18b32f

SELECT * FROM kv WHERE (k,v) IN (SELECT * FROM kv)

is now returning 0 rows only in the fakedist-vec-auto-disk configuration.

cockroach/pkg/sql/logictest/testdata/logic_test/apply_join

Line 344 in a18b32f

query error couldn't find WITH expression \"new_values\" with ID 1

no longer produces an error but is reported as success in the fakedist-vec-auto-disk and fakedist-disk configurations. Any idea why that might happen? It is possible the Fingerprint change has exposed some places where there are some incorrect assumptions about uhh things.

For example. sqlbase.MustBeValueEncoded says that tuples must be value encoded:

cockroach/pkg/sql/sqlbase/structured.go

Line 1342 in a18b32f

    
           case types.JsonFamily, types.TupleFamily, types.GeographyFamily, types.GeometryFamily:

and yet the above query for where uses the key encoding in https://github.com/cockroachdb/cockroach/blob/master/pkg/sql/sqlbase/column_type_encoding.go#L148 due to some disk-backed hash join thing, and even though tuples don't have a corresponding decode method. Also, their key encode method is highly suspect. (I discovered this in a separate PR I was working on trying to make sense of MustBeValueEncoded, EncodeTableKey, and DecodeTableKey, none of which agree on what things must be key encoded, see https://github.com/mjibson/cockroach/tree/enc-tests for details. It surprisingly showed up here too, and so it looks like my investigation now needs to be completed instead of abandoned.)

Questions:

Why is the tuple key encoding used in the disk hash joiner if MustBeValueEncoded returns true for tuples?
Why does the tuple key encoding exist since it has no corresponding decode method?
Why did the two above tests change for certain configurations due to the Fingerprint change?

rohany · 2020-06-02T23:39:41Z

I think there is some slight overloading of how MustBeValueEncoded is used unfortunately. It seems to be mostly when we want to see if a type can be indexed, rather than "can we actually call encode table key on this type". We seem to have added key encodings for things that we would never actually index for use in various key encoding needs during execution (group by, order by). In these cases, we use the key encoding just as a hash (since some of this code was written before we had finger print) -- i think this answers 1 and 2. I don't know about three.

madelynnblue · 2020-06-02T23:42:35Z

Ok so maybe...we can use Fingerprint in more places now? I'll audit uses of EncodeTableKey.

rohany · 2020-06-02T23:50:34Z

That sounds reasonable — encdatum.encode is probably useful to look at too. Most of the calls come from there.

yuzefovich · 2020-06-03T00:08:33Z

I think I see the issue: in DiskRowContainer.encodeRow we use Encode (on the right side of the hash join), but when we're probing in hashDiskRowBucketIterator.Reset we're calling encodeEqualityCols which under the hood is using Fingerprint.

rytaft

stats changes

Reviewed 2 of 4 files at r1, 2 of 2 files at r2.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @mjibson and @rohany)

pkg/sql/opt/exec/execbuilder/testdata/stats, line 265 at r2 (raw file):


query T
EXPLAIN (OPT, VERBOSE) SELECT DISTINCT j FROM tj WHERE j IS NULL

I wonder if this test is going to be flaky... if auto stats are collected before this runs, we'll get different results. Maybe add this to the top of this file (since I think other tests in this file could also have a problem):

statement ok
SET CLUSTER SETTING sql.stats.automatic_collection.enabled = false

(I think auto stats are currently disabled on logic tests, but if we enable them we still want this to work)

yuzefovich · 2020-06-03T18:11:59Z

This diff fixes the issue with row containers:

diff --git a/pkg/sql/rowcontainer/hash_row_container.go b/pkg/sql/rowcontainer/hash_row_container.go
index 93fe65c1d2..b8ad6bfe95 100644
--- a/pkg/sql/rowcontainer/hash_row_container.go
+++ b/pkg/sql/rowcontainer/hash_row_container.go
@@ -121,7 +121,12 @@ func encodeColumnsOfRow(
                if row[colIdx].IsNull() && !encodeNull {
                        return nil, true, nil
                }
-               appendTo, err = row[colIdx].Fingerprint(colTypes[i], da, appendTo)
+               // Note: we cannot compare VALUE encodings because they contain column IDs
+               // which can vary.
+               // TODO(radu): we should figure out what encoding is readily available and
+               // use that (though it needs to be consistent across all rows). We could add
+               // functionality to compare VALUE encodings ignoring the column ID.
+               appendTo, err = row[colIdx].Encode(colTypes[i], da, sqlbase.DatumEncoding_ASCENDING_KEY, appendTo)
                if err != nil {
                        return appendTo, false, err
                }
@@ -422,7 +427,7 @@ func (i *hashMemRowIterator) computeKey() error {
        i.curKey = i.curKey[:0]
        for _, col := range i.storedEqCols {
                var err error
-               i.curKey, err = row[col].Fingerprint(i.types[col], &i.columnEncoder.datumAlloc, i.curKey)
+               i.curKey, err = row[col].Encode(i.types[col], &i.columnEncoder.datumAlloc, sqlbase.DatumEncoding_ASCENDING_KEY, i.curKey)
                if err != nil {
                        return err
                }

The explanation is that HashDiskRowContainer is implemented using DiskRowContainer with the equality columns (i.e. the columns to hash) of the former being the ordering columns for the latter, and those ordering columns are used to compute the keys of the rows (in encodeRow) so that we could store the row in the sorted order. This way we store the build (right) side of the join, but for the probe (left) side we use hashMemRowIterator to compute the key of the probing row. The key computation methods must be the same in both places, otherwise, the results of the join can be incorrect. #45229 broke this synchronization by changing the key computation method in hashMemRowIterator.computeKey to use Fingerprint. So we have to either use Fingerprint in encodeRow or use Encode in computeKey. The first choice doesn't seem to work because Fingerprint doesn't provide the ordering we need in DiskRowContainer, so we need to use the second approach. @rohany does this sound reasonable?

rohany · 2020-06-03T19:09:19Z

That seems correct, but we will then error out for types that we can't key encode. We can't know in advance whether we are going to spill to disk here either.

Why does the disk row container necessarily need ordering properties of the columns?

yuzefovich · 2020-06-03T19:35:44Z

Why does the disk row container necessarily need ordering properties of the columns?

DiskRowContainer implements "hash row container" by sorting all rows on the ordering (i.e. hash) columns and using the ordering property to provide the "hashing" behavior (i.e. we would seek to the first row that has the same hash columns and then iterate from that row one row at a time forward until the hash columns remain the same). If we don't have the ordering property, then the necessary invariant that all rows that hash to the same value are contiguous is not maintained.

yuzefovich

We merged the fix to the disk row container, so once you rebase, the build should be green.

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @mjibson and @rohany)

madelynnblue

Reviewable status: complete! 0 of 0 LGTMs obtained (and 1 stale) (waiting on @rohany and @rytaft)

pkg/sql/opt/exec/execbuilder/testdata/stats, line 265 at r2 (raw file):

Previously, rytaft (Rebecca Taft) wrote…

I wonder if this test is going to be flaky... if auto stats are collected before this runs, we'll get different results. Maybe add this to the top of this file (since I think other tests in this file could also have a problem):
statement ok
SET CLUSTER SETTING sql.stats.automatic_collection.enabled = false
(I think auto stats are currently disabled on logic tests, but if we enable them we still want this to work)

Done.

rytaft

Reviewed 4 of 4 files at r3.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @rohany)

madelynnblue · 2020-06-08T22:15:20Z

bors r+

craig · 2020-06-08T22:35:30Z

Merge conflict (retrying...)

craig · 2020-06-08T23:19:44Z

Build failed

GitHub CI (Cockroach)

madelynnblue · 2020-06-08T23:23:17Z

bors r+

craig · 2020-06-09T00:48:14Z

Build failed (retrying...)

GitHub CI (Cockroach)

craig · 2020-06-09T01:37:54Z

Canceled

madelynnblue · 2020-06-09T01:38:16Z

bors r+

craig · 2020-06-09T02:24:07Z

Build failed

GitHub CI (Cockroach)

Improve datum fingerprinting (used by rowexec/sampler.go) to work correctly for all types. Previously, due to the hardcoded assumption that JSON was the only non-key encodable type, it would error if presented a geo/geom type. Use MustBeValueEncoded when fingerprinting and determining if a type can create a histogram. This should be future proof if we add new types or teach types how to key encode. Fixes #35844 Informs #48219 Release note: None

madelynnblue · 2020-06-09T16:46:55Z

bors r+

craig · 2020-06-09T18:04:15Z

Build succeeded

GitHub CI (Cockroach)

madelynnblue requested review from rytaft and yuzefovich June 2, 2020 16:28

madelynnblue requested a review from a team as a code owner June 2, 2020 16:28

yuzefovich reviewed Jun 2, 2020

View reviewed changes

madelynnblue commented Jun 2, 2020

View reviewed changes

rytaft approved these changes Jun 3, 2020

View reviewed changes

yuzefovich mentioned this pull request Jun 3, 2020

rowcontainer: fix hash row container for some types #49851

Merged

yuzefovich reviewed Jun 8, 2020

View reviewed changes

madelynnblue commented Jun 8, 2020

View reviewed changes

rytaft approved these changes Jun 8, 2020

View reviewed changes

craig bot merged commit 5fad258 into cockroachdb:master Jun 9, 2020

madelynnblue deleted the inv-stats branch June 9, 2020 18:05

Conversation

madelynnblue commented Jun 2, 2020

Uh oh!

cockroach-teamcity commented Jun 2, 2020

Uh oh!

madelynnblue commented Jun 2, 2020

Uh oh!

yuzefovich left a comment

Choose a reason for hiding this comment

Uh oh!

rohany commented Jun 2, 2020

Uh oh!

yuzefovich left a comment

Choose a reason for hiding this comment

Uh oh!

rohany commented Jun 2, 2020

Uh oh!

madelynnblue left a comment

Choose a reason for hiding this comment

Uh oh!

madelynnblue commented Jun 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rohany commented Jun 2, 2020

Uh oh!

madelynnblue commented Jun 2, 2020

Uh oh!

rohany commented Jun 2, 2020

Uh oh!

yuzefovich commented Jun 3, 2020

Uh oh!

rytaft left a comment

Choose a reason for hiding this comment

Uh oh!

yuzefovich commented Jun 3, 2020

Uh oh!

rohany commented Jun 3, 2020

Uh oh!

yuzefovich commented Jun 3, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yuzefovich left a comment

Choose a reason for hiding this comment

Uh oh!

madelynnblue left a comment

Choose a reason for hiding this comment

Uh oh!

rytaft left a comment

Choose a reason for hiding this comment

Uh oh!

madelynnblue commented Jun 8, 2020

Uh oh!

craig bot commented Jun 8, 2020

Merge conflict (retrying...)

Uh oh!

craig bot commented Jun 8, 2020

Build failed

Uh oh!

madelynnblue commented Jun 8, 2020

Uh oh!

craig bot commented Jun 9, 2020

Build failed (retrying...)

Uh oh!

craig bot commented Jun 9, 2020

Canceled

Uh oh!

madelynnblue commented Jun 9, 2020

Uh oh!

craig bot commented Jun 9, 2020

Build failed

Uh oh!

madelynnblue commented Jun 9, 2020

Uh oh!

craig bot commented Jun 9, 2020

Build succeeded

Uh oh!

Reviewers

Assignees

Labels

Projects

madelynnblue commented Jun 2, 2020 •

edited

Loading

yuzefovich commented Jun 3, 2020 •

edited

Loading