Skip to content

colfetcher: emit periodic query progress update metadata#158029

Merged
craig[bot] merged 1 commit intocockroachdb:masterfrom
yuzefovich:vec-meta-wip
Jan 10, 2026
Merged

colfetcher: emit periodic query progress update metadata#158029
craig[bot] merged 1 commit intocockroachdb:masterfrom
yuzefovich:vec-meta-wip

Conversation

@yuzefovich
Copy link
Copy Markdown
Member

@yuzefovich yuzefovich commented Nov 19, 2025

This commit extends the query progress reporting that we do in the row-by-row tableReader to the vectorized scan operators too. Namely, after about 20k rows have been output, we'll emit the RowsRead metadata that we then use in DistSQLReceiver to update progressAtomic. Then the result shows up in phase column of SHOW QUERIES.

Fixes: #26639.

Release note (sql change): Queries executed via the vectorized engine now display their progress in phase column of SHOW QUERIES. Previously, this feature was only available in the row-by-row engine.

@yuzefovich yuzefovich added the do-not-merge bors won't merge a PR with this label. label Nov 19, 2025
@cockroach-teamcity
Copy link
Copy Markdown
Member

This change is Reviewable

This commit extends the query progress reporting that we do in the
row-by-row tableReader to the vectorized scan operators too. Namely,
after about 20k rows have been output, we'll emit the RowsRead metadata
that we then use in DistSQLReceiver to update progressAtomic. Then the
result shows up in `phase` column of SHOW QUERIES.

Release note (sql change): Queries executed via the vectorized engine
now display their progress in `phase` column of SHOW QUERIES.
Previously, this feature was only available in the row-by-row engine.
@cockroachdb cockroachdb deleted a comment from blathers-crl bot Jan 10, 2026
@yuzefovich yuzefovich changed the title [DNM] colexecop: modify Operator.Next to return metadata colfetcher: emit periodic query progress update metadata Jan 10, 2026
@yuzefovich yuzefovich removed the do-not-merge bors won't merge a PR with this label. label Jan 10, 2026
@yuzefovich yuzefovich marked this pull request as ready for review January 10, 2026 03:49
@yuzefovich yuzefovich requested a review from a team as a code owner January 10, 2026 03:49
Copy link
Copy Markdown
Collaborator

@DrewKimball DrewKimball left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:lgtm: Cool!

@DrewKimball reviewed 4 files and all commit messages, and made 1 comment.
Reviewable status: :shipit: complete! 1 of 0 LGTMs obtained (waiting on @yuzefovich).

@yuzefovich
Copy link
Copy Markdown
Member Author

Thanks for the quick review!

bors r+

craig bot pushed a commit that referenced this pull request Jan 10, 2026
158029: colfetcher: emit periodic query progress update metadata r=yuzefovich a=yuzefovich

This commit extends the query progress reporting that we do in the row-by-row tableReader to the vectorized scan operators too. Namely, after about 20k rows have been output, we'll emit the RowsRead metadata that we then use in DistSQLReceiver to update progressAtomic. Then the result shows up in `phase` column of SHOW QUERIES.

Fixes: #26639.

Release note (sql change): Queries executed via the vectorized engine now display their progress in `phase` column of SHOW QUERIES. Previously, this feature was only available in the row-by-row engine.

Co-authored-by: Yahor Yuzefovich <yahor@cockroachlabs.com>
@craig
Copy link
Copy Markdown
Contributor

craig bot commented Jan 10, 2026

Build failed:

@yuzefovich
Copy link
Copy Markdown
Member Author

bors retry

craig bot pushed a commit that referenced this pull request Jan 10, 2026
158029: colfetcher: emit periodic query progress update metadata r=yuzefovich a=yuzefovich

This commit extends the query progress reporting that we do in the row-by-row tableReader to the vectorized scan operators too. Namely, after about 20k rows have been output, we'll emit the RowsRead metadata that we then use in DistSQLReceiver to update progressAtomic. Then the result shows up in `phase` column of SHOW QUERIES.

Fixes: #26639.

Release note (sql change): Queries executed via the vectorized engine now display their progress in `phase` column of SHOW QUERIES. Previously, this feature was only available in the row-by-row engine.

Co-authored-by: Yahor Yuzefovich <yahor@cockroachlabs.com>
@craig
Copy link
Copy Markdown
Contributor

craig bot commented Jan 10, 2026

Build failed:

@yuzefovich
Copy link
Copy Markdown
Member Author

bors retry

craig bot pushed a commit that referenced this pull request Jan 10, 2026
158029: colfetcher: emit periodic query progress update metadata r=yuzefovich a=yuzefovich

This commit extends the query progress reporting that we do in the row-by-row tableReader to the vectorized scan operators too. Namely, after about 20k rows have been output, we'll emit the RowsRead metadata that we then use in DistSQLReceiver to update progressAtomic. Then the result shows up in `phase` column of SHOW QUERIES.

Fixes: #26639.

Release note (sql change): Queries executed via the vectorized engine now display their progress in `phase` column of SHOW QUERIES. Previously, this feature was only available in the row-by-row engine.

160632: sql/bulkmerge: reuse SST iterator across bulk merge tasks r=spilchen a=spilchen

This change reduces overhead in the bulk merge processor by initializing a single iterator over all input SSTs at startup, rather than creating a new one per task. The iterator is reused across tasks, seeking only when needed.

Informs #159414
Epic: CRDB-48845
Release note: none

Co-authored by: `@jeffswenson`

Co-authored-by: Yahor Yuzefovich <yahor@cockroachlabs.com>
Co-authored-by: Matt Spilchen <matt.spilchen@cockroachlabs.com>
@craig
Copy link
Copy Markdown
Contributor

craig bot commented Jan 10, 2026

Build failed (retrying...):

craig bot pushed a commit that referenced this pull request Jan 10, 2026
158029: colfetcher: emit periodic query progress update metadata r=yuzefovich a=yuzefovich

This commit extends the query progress reporting that we do in the row-by-row tableReader to the vectorized scan operators too. Namely, after about 20k rows have been output, we'll emit the RowsRead metadata that we then use in DistSQLReceiver to update progressAtomic. Then the result shows up in `phase` column of SHOW QUERIES.

Fixes: #26639.

Release note (sql change): Queries executed via the vectorized engine now display their progress in `phase` column of SHOW QUERIES. Previously, this feature was only available in the row-by-row engine.

Co-authored-by: Yahor Yuzefovich <yahor@cockroachlabs.com>
@craig
Copy link
Copy Markdown
Contributor

craig bot commented Jan 10, 2026

Build failed:

@yuzefovich
Copy link
Copy Markdown
Member Author

bors r+

@craig
Copy link
Copy Markdown
Contributor

craig bot commented Jan 10, 2026

@craig craig bot merged commit cbc6c60 into cockroachdb:master Jan 10, 2026
41 of 43 checks passed
@yuzefovich yuzefovich deleted the vec-meta-wip branch January 10, 2026 16:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

sql: No way to get insight into VALIDATE CONSTRAINT progress

3 participants