Fix bad creation of sparse columns during mutation by Avogar · Pull Request #92860 · ClickHouse/ClickHouse

Avogar · 2025-12-22T22:33:25Z

Changelog category (leave one):

Bug Fix (user-visible misbehavior in an official stable release)

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Fix possible error FILE_DOESNT_EXIST after mutation of a sparse column with ratio_of_defaults_for_sparse_serialization=0.0. Closes #92633

Documentation entry for user-facing changes

Documentation is written (mandatory for new features)

clickhouse-gh · 2025-12-22T22:33:53Z

Workflow [PR], commit [a663c06]

Summary: ❌

job_name	test_name	status	info	comment
Stateless tests (amd_debug, distributed plan, s3 storage, parallel)		failure
	02346_text_index_parallel_replicas	FAIL	issue	ISSUE EXISTS
BuzzHouse (amd_debug)		failure
	Logical error: 'Inconsistent AST formatting: the query: (STID: 1941-1bfa)	FAIL	issue	ISSUE EXISTS

src/Storages/MergeTree/MutateTask.cpp

tests/queries/0_stateless/03774_wide_part_mutation_sparse_ratio_setting_bug.sql

…o_setting_bug.sql Co-authored-by: Azat Khuzhin <a3at.mail@gmail.com>

azat · 2025-12-23T12:45:05Z

src/Storages/MergeTree/MutateTask.cpp

-        settings = serialization_infos.getSettings();
+        settings = SerializationInfo::Settings
+        {
+            (*source_part->storage.getSettings())[MergeTreeSetting::ratio_of_defaults_for_sparse_serialization],


Also why serialization_infos.getSettings().ratio_of_defaults_for_sparse initialized with default value (1)? Maybe this should be fixed instead?

Also why serialization_infos.getSettings().ratio_of_defaults_for_sparse initialized with default value (1)?

I think to disable Sparse serialization if default settings are used. Maybe just for historical reasons.

Maybe this should be fixed instead?

The main thing here is that serialization_infos.getSettings().ratio_of_defaults_for_sparse will be default always, because we don't serialize this value in serializations.json and during deserialization of it from source part we always use default value. So we need to use value from storage settings always.

Thanks, make sense.
Maybe it make sense to initialize this value from storage for now?

But for bug-fix better to keep patches short, so LGTM

Maybe it make sense to initialize this value from storage for now?

What do you mean? Propagate value from storage to SerializationInfo deserialization, so we initialize it always with the correct value?

Propagate value from storage to SerializationInfo deserialization, so we initialize it always with the correct value?

Yes

Well, I am not sure about it. ratio_of_defaults_for_sparse is used only in serialization, not deserialization. And before serialization we usually set fresh values for all settings from the storage settings (as it was here before and as it's done for merge and new parts creation). So ideally we should not use the value of ratio_of_defaults_for_sparse from deserialized SerializationInfo. It was used here because of my mistake only.

Cherry pick #92860 to 25.10: Fix bad creation of sparse columns during mutation in Wide part

…utation in Wide part

Cherry pick #92860 to 25.11: Fix bad creation of sparse columns during mutation in Wide part

…utation in Wide part

Cherry pick #92860 to 25.12: Fix bad creation of sparse columns during mutation in Wide part

…utation in Wide part

Backport #92860 to 25.12: Fix bad creation of sparse columns during mutation in Wide part

azat · 2025-12-23T23:08:36Z

Hm, but it can be not only wide parts right? For instance indexes stored as a separate column in compact parts, so after delete index mutation for compact parts we can also have this problem, correct?

Avogar · 2025-12-24T13:09:23Z

Hm, but it can be not only wide parts right? For instance indexes stored as a separate column in compact parts, so after delete index mutation for compact parts we can also have this problem, correct?

Sounds like yes

Avogar · 2025-12-24T14:29:41Z

Actually, this is incorrect fix. It just hides the real bug. And #92419 didn't introduce it, it just uncovered it using the test 02319_lightweight_delete_on_merge_tree.

https://fiddle.clickhouse.com/e89aeaeb-200d-45c6-9649-e8d70ca2dc71

The problem is: if updated column in source part is Sparse, but in the mutated part is not Sparse (for example due to changed setting to 1.0 that disables sparse serialization), the checksums.txt file of mutated part will still contain files with sparse serialization for some reason (I guess because we copied checksums and didn't updated it or something like that). I will investigate it futher

Backport #92860 to 25.11: Fix bad creation of sparse columns during mutation in Wide part

Backport #92860 to 25.10: Fix bad creation of sparse columns during mutation in Wide part

Fix bad creation of sparse columns during mutation in Wide part

0b236e0

Avogar added the pr-must-backport Pull request should be backported intentionally. Use this label with great care! label Dec 22, 2025

clickhouse-gh bot added the pr-bugfix Pull request with bugfix, not backported by default label Dec 22, 2025

azat self-assigned this Dec 22, 2025

Update test reference

99744fc

amosbird reviewed Dec 23, 2025

View reviewed changes

src/Storages/MergeTree/MutateTask.cpp Outdated Show resolved Hide resolved

zlareb1 mentioned this pull request Dec 23, 2025

Stabilize 02319_lightweight_delete_on_merge_tree by draining mutations with a synchronous OPTIMIZE FINAL after DROP INDEX #92751

Closed

1 task

Avogar added 2 commits December 23, 2025 12:28

Remove bad changes

4ec90c5

Fix typo in the comment

5c9c64c

azat reviewed Dec 23, 2025

View reviewed changes

tests/queries/0_stateless/03774_wide_part_mutation_sparse_ratio_setting_bug.sql Outdated Show resolved Hide resolved

Update tests/queries/0_stateless/03774_wide_part_mutation_sparse_rati…

a663c06

…o_setting_bug.sql Co-authored-by: Azat Khuzhin <a3at.mail@gmail.com>

azat reviewed Dec 23, 2025

View reviewed changes

azat approved these changes Dec 23, 2025

View reviewed changes

Avogar added this pull request to the merge queue Dec 23, 2025

Merged via the queue into ClickHouse:master with commit f89fa69 Dec 23, 2025
128 of 131 checks passed

Avogar deleted the fix-sparse-wide-mutation branch December 23, 2025 21:03

robot-clickhouse-ci-1 added the pr-must-backport-synced The `*-must-backport` labels are synced into the cloud Sync PR label Dec 23, 2025

robot-ch-test-poll1 added a commit that referenced this pull request Dec 23, 2025

Merge pull request #92959 from ClickHouse/cherrypick/25.10/92860

c1a8dcb

Cherry pick #92860 to 25.10: Fix bad creation of sparse columns during mutation in Wide part

robot-clickhouse added a commit that referenced this pull request Dec 23, 2025

Backport #92860 to 25.10: Fix bad creation of sparse columns during m…

9843104

…utation in Wide part

This was referenced Dec 23, 2025

Backport #92860 to 25.10: Fix bad creation of sparse columns during mutation in Wide part #92960

Merged

Cherry pick #92860 to 25.11: Fix bad creation of sparse columns during mutation in Wide part #92961

Merged

robot-ch-test-poll1 added a commit that referenced this pull request Dec 23, 2025

Merge pull request #92961 from ClickHouse/cherrypick/25.11/92860

bfce23d

Cherry pick #92860 to 25.11: Fix bad creation of sparse columns during mutation in Wide part

robot-clickhouse added a commit that referenced this pull request Dec 23, 2025

Backport #92860 to 25.11: Fix bad creation of sparse columns during m…

5bd85cb

…utation in Wide part

This was referenced Dec 23, 2025

Backport #92860 to 25.11: Fix bad creation of sparse columns during mutation in Wide part #92962

Merged

Cherry pick #92860 to 25.12: Fix bad creation of sparse columns during mutation in Wide part #92963

Merged

robot-ch-test-poll1 added a commit that referenced this pull request Dec 23, 2025

Merge pull request #92963 from ClickHouse/cherrypick/25.12/92860

057e534

Cherry pick #92860 to 25.12: Fix bad creation of sparse columns during mutation in Wide part

robot-clickhouse added a commit that referenced this pull request Dec 23, 2025

Backport #92860 to 25.12: Fix bad creation of sparse columns during m…

4942c2d

…utation in Wide part

robot-ch-test-poll1 mentioned this pull request Dec 23, 2025

Backport #92860 to 25.12: Fix bad creation of sparse columns during mutation in Wide part #92964

Merged

robot-clickhouse added the pr-synced-to-cloud The PR is synced to the cloud repo label Dec 23, 2025

clickhouse-gh bot added a commit that referenced this pull request Dec 23, 2025

Merge pull request #92964 from ClickHouse/backport/25.12/92860

5eb2607

Backport #92860 to 25.12: Fix bad creation of sparse columns during mutation in Wide part

azat mentioned this pull request Dec 23, 2025

Flaky test: 02943_rmt_alter_metadata_merge_checksum_mismatch #92670

Closed

Avogar changed the title ~~Fix bad creation of sparse columns during mutation in Wide part~~ Fix bad creation of sparse columns during mutation Dec 24, 2025

robot-ch-test-poll4 added the pr-backports-created Backport PRs are successfully created, it won't be processed by CI script anymore label Dec 24, 2025

Avogar mentioned this pull request Dec 24, 2025

Fix possible error FILE_DOESNT_EXIST after sparse column mutation #93016

Merged

1 task

Avogar added a commit that referenced this pull request Dec 24, 2025

Merge pull request #92962 from ClickHouse/backport/25.11/92860

90d50e0

Backport #92860 to 25.11: Fix bad creation of sparse columns during mutation in Wide part

Avogar added a commit that referenced this pull request Dec 24, 2025

Merge pull request #92960 from ClickHouse/backport/25.10/92860

eeeef90

Backport #92860 to 25.10: Fix bad creation of sparse columns during mutation in Wide part

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bad creation of sparse columns during mutation#92860

Fix bad creation of sparse columns during mutation#92860
Avogar merged 5 commits intoClickHouse:masterfrom
Avogar:fix-sparse-wide-mutation

Avogar commented Dec 22, 2025 •

edited

Loading

Uh oh!

clickhouse-gh bot commented Dec 22, 2025 •

edited by Avogar

Loading

Uh oh!

Uh oh!

Uh oh!

azat Dec 23, 2025

Uh oh!

Avogar Dec 23, 2025 •

edited

Loading

Uh oh!

azat Dec 23, 2025

Uh oh!

azat Dec 23, 2025

Uh oh!

Avogar Dec 23, 2025

Uh oh!

azat Dec 23, 2025

Uh oh!

Avogar Dec 23, 2025

Uh oh!

Uh oh!

azat commented Dec 23, 2025

Uh oh!

Avogar commented Dec 24, 2025

Uh oh!

Avogar commented Dec 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

Avogar commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Documentation entry for user-facing changes

Uh oh!

clickhouse-gh bot commented Dec 22, 2025 • edited by Avogar Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

azat Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

Avogar Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

azat Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

azat Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

Avogar Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

azat Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

Avogar Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

azat commented Dec 23, 2025

Uh oh!

Avogar commented Dec 24, 2025

Uh oh!

Avogar commented Dec 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Avogar commented Dec 22, 2025 •

edited

Loading

clickhouse-gh bot commented Dec 22, 2025 •

edited by Avogar

Loading

Avogar Dec 23, 2025 •

edited

Loading