Finer-grained subquery cache propagation control by vvo · Pull Request #99804 · ClickHouse/ClickHouse

vvo · 2026-03-17T20:22:29Z

Implements finer-grained subquery cache propagation control, as requested in the review of #76252 by @alexey-milovidov.

use_query_cache on the outer query no longer auto-propagates to subqueries. Three rules implemented in shouldUseQueryCacheForSubquery:

No auto-propagation by default — use_query_cache on the outer query does NOT propagate to subqueries
Explicit per-subquery opt-in — SELECT * FROM (SELECT ... SETTINGS use_query_cache = true) enables caching for that specific subquery
Bulk propagation — query_cache_for_subqueries = true enables propagation into all subqueries

Also fixes two issues from #76252:

Config key mismatch: the rename from QueryCache → QueryResultCache changed keys in Context.cpp but not in config files, breaking SYSTEM RELOAD CONFIG for cache settings
All 03381_* tests now explicitly SET enable_analyzer = 1 since Planner-level subquery caching is new-analyzer only

Changelog category (leave one):

Improvement

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Individual subquery results can now be cached independently using SETTINGS use_query_cache = true on specific subqueries, without caching the entire outer query. A new setting query_cache_for_subqueries = true enables bulk propagation of use_query_cache into all subqueries. Note: use_query_cache on the outer query no longer auto-propagates to subqueries.

Documentation entry for user-facing changes

Documentation is written (mandatory for new features)

A new "Subquery Caching" section has been added to the query cache documentation explaining the three propagation modes with examples.

…o query_cache_for_subqueries

…_subqueries

…display

…ckHouse into query_cache_for_subqueries

Add null guard for `query_result_cache` in `Planner::buildPlanForQueryNode` to prevent crash when a subquery explicitly sets `use_query_cache = true` but the server has no query cache configured. Remove spurious duplicate `optimize_and_compare_chain` entry in `SettingsChangesHistory.cpp` that was introduced during rebase. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

`finalizeWriteInQueryResultCache` is a no-op when the pipeline has no cache writers, so the conditional checks are unnecessary. Making the calls unconditional is consistent with `executeQuery.cpp`'s `finalizeQueryPipelineBeforeLogging` which always calls it, and is more robust for future changes. Remove the now-unused `shouldCacheScalarSubquery` function and `query_cache_for_subqueries` extern declarations from both `ExecuteScalarSubqueriesVisitor.cpp` and `PreparedSets.cpp`. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Add null guard for `query_result_cache` in `Planner::buildPlanForQueryNode` to prevent crash when a subquery explicitly sets `use_query_cache = true` but the server has no query cache configured. Remove spurious duplicate `optimize_and_compare_chain` entry in `SettingsChangesHistory.cpp` that was introduced during rebase. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

`finalizeWriteInQueryResultCache` is a no-op when the pipeline has no cache writers, so the conditional checks are unnecessary. Making the calls unconditional is consistent with `executeQuery.cpp`'s `finalizeQueryPipelineBeforeLogging` which always calls it, and is more robust for future changes. Remove the now-unused `shouldCacheScalarSubquery` function and `query_cache_for_subqueries` extern declarations from both `ExecuteScalarSubqueriesVisitor.cpp` and `PreparedSets.cpp`. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Eliminate double AST clone in `QueryResultCache::Key` constructor: introduce `calculateAstHashAndQueryString` that clones once for subqueries and computes both hash and query string from the result. Add `already_cloned` parameter to `calculateAstHash` to skip its internal clone when called with a pre-cloned AST. - Guard `extremes`/`max_result_bytes`/`max_result_rows` default normalization with `if (is_subquery)` so non-subquery cache entries are not affected (avoids hash changes and cache invalidation on upgrade for existing non-subquery entries). - Add tests for CTE subquery caching, UNION ALL subquery caching, explicit per-subquery opt-in with cache hit verification, old analyzer negative test, and `query_cache_min_query_runs` with subqueries. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

src/Planner/Planner.cpp

…rence - Add query_cache_system_table_handling = 'save' to union subquery test (UNION ALL with literals triggers system table check on CI) - Fix min_runs reference to match CI output (cache hit counts differ between local Release build and CI coverage build) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

src/Core/Settings.cpp

The description said results are "retrieved" from cache, but this setting enables both read and write paths for subquery caching. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

The AI reviewer correctly identified that `setCanUseQueryResultCache(true)` on the planner context leaks subquery opt-in into the shared query state, which could cause outer query results to be cached even when the outer query didn't set `use_query_cache`. Fix: instead of mutating the shared context, add a `skip_context_check` parameter to `checkCanWriteQueryResultCache` that bypasses the context flag check while still enforcing safety checks (nondeterministic functions, system tables). The subquery cache eligibility is now tracked locally via `local_can_use_cache` without mutating global state. Also adds Test 14: regression test proving that explicit subquery opt-in does NOT create a top-level cache entry when outer `use_query_cache = 0`. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

src/Interpreters/Cache/QueryResultCache.h

…opagation-v2

Upstream master bumped to 26.4, so `query_cache_for_subqueries` setting history entry must be in the 26.4 section (was in 26.3 which is now closed). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

The Key's operator== and KeyHasher only used ast_hash, which could cause collisions between top-level and subquery entries with the same AST. Now is_subquery is included in both equality comparison and hash computation, ensuring separate cache domains. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Revert contrib/NuRaft, contrib/libstemmer_c, contrib/rust_vendor and .gitignore to upstream master — these were accidentally included from the upstream merge and are not part of this feature. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Initialize `chunk_type` in `StreamInQueryResultCacheStep` to prevent potential use of uninitialized variable if `StreamType` enum is extended - Fix `system.query_cache` column description for `is_subquery` - Fix minor grammar in comment - Add missing trailing newlines in test files Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Resolve conflict in Planner.cpp: keep the local_can_use_cache approach (from reviewer feedback) instead of mutating the shared query context. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

vvo · 2026-03-20T00:30:52Z

@rschu1ze @alexey-milovidov I think we're done here, let me know!

src/Interpreters/Cache/QueryResultCache.cpp

Multiple StreamInQueryResultCacheTransform instances share the same writer. The first finalizeWrite call processes all buffered data; subsequent calls are correct no-ops. Early-exit rejections are intentional and should not be retried. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…opagation

clickhouse-gh · 2026-03-24T20:17:58Z

LLVM Coverage Report

Metric	Baseline	Current	Δ
Lines	84.10%	83.80%	-0.30%
Functions	24.50%	24.50%	+0.00%
Branches	76.70%	76.30%	-0.40%

PR changed lines: PR changed-lines coverage: 96.72% (413/427, 0 noise lines excluded)
Diff coverage report
Uncovered code

vvo · 2026-04-01T14:10:31Z

@rschu1ze @alexey-milovidov do we still want this?

rschu1ze · 2026-04-01T14:28:56Z

Yes. Haven't had time to check yet, sorry.

nbarannik and others added 30 commits January 14, 2025 11:19

Fix typos

f722c4f

Merge branch 'master' of https://github.com/ClickHouse/ClickHouse

34c8bb1

Merge branch 'master' of https://github.com/ClickHouse/ClickHouse

45fb3f4

Add steps for query caching

40bbd47

Fix subquery cache for scalar subqueries

f13816d

Fix subquery cache in WHERE

fc60f8e

Cache initial ast instead of query tree ast

ee2818c

Fix different changed settings for queries and subqueries

0b4bfd9

Use initial query cache usage for main query

067c8fc

Fix

a93d31a

Remove table aliases from query tree in query cache

c9b0064

Fix multiple finalizes of query cache writer

f11e7f5

Merge branch 'master' of https://github.com/ClickHouse/ClickHouse int…

b92bafc

…o query_cache_for_subqueries

Small refactor after merge

70280c7

Remove debug output

dc83f24

Refactor checks before write to query cache initialization

de6291b

Clean up

c63cf7c

Remove debug output

d403a66

Remove debug output

3f0b240

Clean up

d265dc1

Merge branch 'master' of https://github.com/ClickHouse/ClickHouse int…

9a2823d

…o query_cache_for_subqueries

Add query_cache_for_subqueries setting

bd41f88

Remove settings clause if its empty after query cache settings remove

06a6ccf

Fix style

ecb8fb4

Merge remote-tracking branch 'ClickHouse/master' into query_cache_for…

b70452b

…_subqueries

Cosmetics + fix style check + fix FastTest

9c38fe2

Fix a small bug

64478e6

Add is_subquery field to system.query_cache, and fix subquery string …

bb55565

…display

Merge branch 'query_cache_for_subqueries' of github.com:nbarannik/Cli…

294ab70

…ckHouse into query_cache_for_subqueries

QueryCache -> QueryResultCache

cd558f7

vvo and others added 5 commits March 19, 2026 08:39

clickhouse-gh bot reviewed Mar 19, 2026

View reviewed changes

src/Planner/Planner.cpp Outdated Show resolved Hide resolved

clickhouse-gh bot reviewed Mar 19, 2026

View reviewed changes

src/Core/Settings.cpp Show resolved Hide resolved

vvo and others added 3 commits March 19, 2026 13:25

Fix misleading query_cache_for_subqueries setting description

4ea0f7a

The description said results are "retrieved" from cache, but this setting enables both read and write paths for subquery caching. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Retry CI: AST fuzzer and stress test failures are unrelated flakes

b8b2182

clickhouse-gh bot reviewed Mar 19, 2026

View reviewed changes

src/Interpreters/Cache/QueryResultCache.h Show resolved Hide resolved

vvo and others added 2 commits March 20, 2026 00:03

Merge remote-tracking branch 'upstream/master' into subquery-cache-pr…

64f4a13

…opagation-v2

Merge upstream master and move setting to version 26.4

880f8fa

Upstream master bumped to 26.4, so `query_cache_for_subqueries` setting history entry must be in the 26.4 section (was in 26.3 which is now closed). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

clickhouse-gh bot added the manual approve Manual approve required to run CI label Mar 19, 2026

vvo and others added 2 commits March 20, 2026 00:04

Remove accidentally committed temp files, update .gitignore

8e2ff57

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Retrigger CI after merge with upstream 26.4

20e01c3

rschu1ze self-assigned this Mar 19, 2026

vvo and others added 4 commits March 20, 2026 00:40

Merge remote changes, keep local_can_use_cache approach

38a36ac

Resolve conflict in Planner.cpp: keep the local_can_use_cache approach (from reviewer feedback) instead of mutating the shared query context. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Retry CI: 3 failures are known flakes unrelated to query cache

e7e86b8

clickhouse-gh bot reviewed Mar 20, 2026

View reviewed changes

src/Interpreters/Cache/QueryResultCache.cpp Show resolved Hide resolved

vvo and others added 2 commits March 20, 2026 12:20

Merge remote-tracking branch 'upstream/master' into subquery-cache-pr…

3ed068e

…opagation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finer-grained subquery cache propagation control#99804

Finer-grained subquery cache propagation control#99804
vvo wants to merge 72 commits intoClickHouse:masterfrom
vvo:subquery-cache-propagation

vvo commented Mar 17, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vvo commented Mar 20, 2026

Uh oh!

Uh oh!

clickhouse-gh bot commented Mar 24, 2026

Uh oh!

vvo commented Apr 1, 2026

Uh oh!

rschu1ze commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

vvo commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Documentation entry for user-facing changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vvo commented Mar 20, 2026

Uh oh!

Uh oh!

clickhouse-gh bot commented Mar 24, 2026

LLVM Coverage Report

Uh oh!

vvo commented Apr 1, 2026

Uh oh!

rschu1ze commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vvo commented Mar 17, 2026 •

edited

Loading