opt: Make additional perf improvements#28609
Conversation
rytaft
left a comment
There was a problem hiding this comment.
Reviewed 2 of 2 files at r1, 7 of 7 files at r2, 2 of 2 files at r3, 45 of 45 files at r4, 83 of 83 files at r5.
Reviewable status:complete! 0 of 0 LGTMs obtained (and 1 stale)
pkg/sql/opt/bench/out, line 0 at r5 (raw file):
I think this file was added by accident
pkg/sql/opt/exec/execbuilder/testdata/join, line 185 at r4 (raw file):
---- render · · (x, two, plus1) · │ render 0 COALESCE(x, x) · ·
I thought in this case the name would be qualified since x is ambiguous...
pkg/sql/opt/memo/expr_view_format.go, line 546 at r5 (raw file):
// as "a.x + b.x" to "x". It is better to err on the side of not shortening // than to incorrectly shorten a column name representing an expression. func isSimpleColumnName(label string) bool {
The comment for this function needs to be updated since now we're only using it to decide whether to wrap an expression in quotation marks. Alternatively, maybe we can just delete it since column names are no longer expressions. (Unnamed columns are now ?column?).
pkg/sql/opt/memo/memo_format.go, line 244 at r4 (raw file):
} // forEachDependency runs f for each child group of g.
update comment - f doesn't exist anymore
petermattis
left a comment
There was a problem hiding this comment.
r2 looks good to me. The rest is beyond me.
Reviewed 2 of 2 files at r1, 7 of 7 files at r2, 2 of 2 files at r3, 45 of 45 files at r4.
Reviewable status:complete! 1 of 0 LGTMs obtained
Inlining logicalPropsBuilder allows us to reuse the keyBuffer when computing statistics. Release note: None
Inline Metadata struct into Memo struct, rather than creating Metadat on the heap. This gives a small perf boost. Release note: None
Move ExprView formatting-related methods into new expr_view_format.go file. Release note: None
The current code builds column labels that are qualified with their owner
table name (e.g. a.x). This has resulted in several problems:
1. Sometimes we want the unqualified column name, and extracting this
has resulted in the addition of some awkward parsing code.
2. Constructing the qualified column names adds measurable time to
simple queries, even though the only time we use it is for debug
and explain display. It's better to build qualified names only when
we need them rather than up-front.
The trouble is that switching to store unqualified column labels has a
big effect on code all over the opt packages. This commit tries to minimize
the impact on tests, to make reviews easier. It contains a hack to keep
output as close as it can to the old output. In the next commit, the hack is
removed, which triggers a major update of tests across opt.
As part of the changes required for reworking column labeling, I refactored
the ExprView and Memo formatting code so that it's more consolidated and
consistent. Formatting state is now always passed via ExprFmtCtx, which has
been expanded to include a scratch buffer as well as the Memo (rather than
just Metadata).
Release note: None
This removes the hack added in the previous commit so that the new column formatting rules can work. We now only qualify column names if it was explicitly requested via a formatting flag, or else if not qualifying would result in ambiguous column names, such as in this case: select a.x, b.x from a, b where a.x=b.x Release note: None
andy-kimball
left a comment
There was a problem hiding this comment.
Reviewable status:
complete! 0 of 0 LGTMs obtained (and 1 stale)
pkg/sql/opt/exec/execbuilder/testdata/join, line 185 at r4 (raw file):
Previously, rytaft wrote…
I thought in this case the name would be qualified since x is ambiguous...
The execbuilder is not using the qualified names code. It sometimes uses ColumnLabel, if there happen to be at least one synthesized columns in the projection list. But if there are only passthrough columns, it does not even use ColumnLabel. Therefore, I thought it best just to make it consistent for now, in that it will never use qualified names. As an example of a test case that doesn't use qualified names, see this one above in the file:
EXPLAIN (VERBOSE) SELECT * FROM twocolumn AS a JOIN twocolumn AS b ON a.x = 44
I also spoke to Radu, and he's fine with changing this behavior for this edge case, since column names below the top-level are only used for debugging.
pkg/sql/opt/memo/expr_view_format.go, line 546 at r5 (raw file):
Previously, rytaft wrote…
The comment for this function needs to be updated since now we're only using it to decide whether to wrap an expression in quotation marks. Alternatively, maybe we can just delete it since column names are no longer expressions. (Unnamed columns are now
?column?).
I updated the comment, since it's still useful for column names that have spaces, etc.
pkg/sql/opt/memo/memo_format.go, line 244 at r4 (raw file):
Previously, rytaft wrote…
update comment - f doesn't exist anymore
Done.
pkg/sql/opt/bench/out, line at r5 (raw file):
Previously, rytaft wrote…
I think this file was added by accident
Done.
rytaft
left a comment
There was a problem hiding this comment.
Reviewed 1 of 114 files at r6, 1 of 7 files at r7, 29 of 44 files at r9, 83 of 83 files at r10.
Reviewable status:complete! 0 of 0 LGTMs obtained (and 1 stale)
RaduBerinde
left a comment
There was a problem hiding this comment.
Reviewable status:
complete! 0 of 0 LGTMs obtained (and 2 stale)
|
bors r+ |
23885: kv: evict leaseholder on RPC error r=solongoron a=tschottdorf This addresses a situation in which we would not evict a stale leaseholder for a long time. Consider the replicas [s1,s2,s3] and s1 is down but is the cached leaseholder, while s2 is the actual lease holder. The RPC layer will try s1, get an RPC error, try s2 and succeed. Since there is no NotLeaseHolderError involved, the cache would not get updated, and so every request pays the overhead of trying s1 first. WIP because needs testing. Touches #23601. Release note (bug fix): Improve request routing during node outages. 28609: opt: Make additional perf improvements r=andy-kimball a=andy-kimball Make several more fixes: 1. Do not qualify column names in metadata, since that requires expensive string formatting up-front (also cleanup the factoring of this code, which had gotten messy). 2. Inline Metadata into Memo. 3. Inline logicalPropsBuilder into the Memo. Together, these changes improve KV perf from: ``` Phases/kv-read/OptBuild 18.4µs ± 1% ``` to: ``` Phases/kv-read/OptBuild 17.8µs ± 1% ``` 28661: storage: don't include RHS data in merge trigger r=bdarnell,tschottdorf a=benesch Now that we require ranges to be collocated during a merge and the RHS replicas to be up-to-date before the merge commits, we no longer need to include a snapshot of the RHS in the merge trigger. We know that the copy of the data that already exists in the local store is perfectly up-to-date. So, stop sending the data in the merge trigger. Release note: None 28689: sqlbase: avoid using SERIAL in system tables r=knz a=knz Needed for #28575. We'll soon want special behavior for SERIAL. We can't afford the definition of system tables to be subject to a discussion about what SERIAL means. So this patch ensures system tables don't use SERIAL. Release note: None Co-authored-by: Tobias Schottdorf <tobias.schottdorf@gmail.com> Co-authored-by: Andrew Kimball <andyk@cockroachlabs.com> Co-authored-by: Nikhil Benesch <nikhil.benesch@gmail.com> Co-authored-by: Raphael 'kena' Poss <knz@cockroachlabs.com>
Build succeeded |
Make several more fixes:
requires expensive string formatting up-front (also cleanup
the factoring of this code, which had gotten messy).
Together, these changes improve KV perf from:
to: