Recover the Replicated Database forcefully after restoring database metadata in Keeper by tuanpach · Pull Request #85960 · ClickHouse/ClickHouse

tuanpach · 2025-08-21T04:51:54Z

Changelog category (leave one):

Bug Fix

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Recover the Replicated Database forcefully after restoring the database metadata in Keeper.

The issue in #85664 is that when restoring metadata, it sets the digest of the replica to "0" in DatabaseReplicated::createReplicaNodesInZooKeeper.
If another node restores the table metadata, it will just reinitialize the DDL Worker.
When restarting, the DB might not has any tables locally, and the local digest is 0, it matches to the keeper digest, so it won't update the restored metadata.

In this PR, after restoring database metadata in Keeper, before reinitializing the DDL Worker, set the replica digest to 42 to force the database to recover to update the restored metadata.

Closes #85664

Documentation entry for user-facing changes

Documentation is written (mandatory for new features)

…etadata in Keeper

clickhouse-gh · 2025-08-21T04:53:23Z

Workflow [PR], commit [22dba5d]

Summary: ❌

job_name	test_name	status
Stateless tests (amd_binary, old analyzer, s3 storage, DatabaseReplicated, parallel)		failure
	02443_detach_attach_partition	FAIL
	03595_alter_drop_column_comment_if_exists	FAIL
	Lost s3 keys	FAIL
	S3_ERROR No such key thrown (in clickhouse-server.log or clickhouse-server.err.log)	FAIL
Integration tests (amd_tsan, 5/6)		failure
	test_threadpool_readers/test.py::test_local_fs_threadpool_reader	FAIL

evillique · 2025-08-21T15:36:26Z

The changelog category should probably be bugfix, if I understand it correctly CI fix is for fixes in the CI infrastructure.

tuanpach · 2025-08-22T03:34:54Z

The changelog category should probably be bugfix, if I understand it correctly CI fix is for fixes in the CI infrastructure.

I thought it also fixes the CI failed tests: https://github.com/ClickHouse/ClickHouse/issues?q=state%3Aclosed%20label%3Apr-ci

tuanpach · 2025-08-22T08:02:49Z

Stateless tests (amd_binary, old analyzer, s3 storage, DatabaseReplicated, parallel)
- 02443_detach_attach_partition: 02443_detach_attach_partition is flaky #54748
- 03595_alter_drop_column_comment_if_exists: Fix broken test 03595_alter_drop_column_comment_if_exists #86001
Integration tests (amd_tsan, 5/6)
- test_threadpool_readers/test.py::test_local_fs_threadpool_reader: test_threadpool_readers/test.py::test_local_fs_threadpool_reader is flaky #85689

evillique · 2025-08-22T13:56:24Z

I thought it also fixes the CI failed tests: https://github.com/ClickHouse/ClickHouse/issues?q=state%3Aclosed%20label%3Apr-ci

Well, in this list I see the changes to CI itself and test fixes insofar as fixing the test itself, I don't see any bugfixes that change our main code. And if I understand correctly this PR fixes a real bug found in the test, but not the test itself.

tuanpach · 2025-08-25T09:25:05Z

I thought it also fixes the CI failed tests: https://github.com/ClickHouse/ClickHouse/issues?q=state%3Aclosed%20label%3Apr-ci

Well, in this list I see the changes to CI itself and test fixes insofar as fixing the test itself, I don't see any bugfixes that change our main code. And if I understand correctly this PR fixes a real bug found in the test, but not the test itself.

I updated the category.

Recover the Replicated Database forcefully after restoring database m…

22dba5d

…etadata in Keeper

tuanpach added the can be tested Allows running workflows for external contributors label Aug 21, 2025

clickhouse-gh bot added the pr-ci label Aug 21, 2025

evillique approved these changes Aug 21, 2025

View reviewed changes

evillique self-assigned this Aug 21, 2025

tuanpach added this pull request to the merge queue Aug 22, 2025

Merged via the queue into ClickHouse:master with commit e21a13b Aug 22, 2025
119 of 122 checks passed

tuanpach deleted the fix-issue-85664 branch August 22, 2025 08:17

robot-clickhouse-ci-1 added the pr-synced-to-cloud The PR is synced to the cloud repo label Aug 22, 2025

tuanpach added pr-bugfix Pull request with bugfix, not backported by default and removed pr-ci labels Aug 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recover the Replicated Database forcefully after restoring database metadata in Keeper#85960

Recover the Replicated Database forcefully after restoring database metadata in Keeper#85960
tuanpach merged 1 commit intoClickHouse:masterfrom
tuanpach:fix-issue-85664

tuanpach commented Aug 21, 2025 •

edited

Loading

Uh oh!

clickhouse-gh bot commented Aug 21, 2025 •

edited

Loading

Uh oh!

evillique commented Aug 21, 2025

Uh oh!

tuanpach commented Aug 22, 2025

Uh oh!

tuanpach commented Aug 22, 2025

Uh oh!

Uh oh!

evillique commented Aug 22, 2025

Uh oh!

tuanpach commented Aug 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tuanpach commented Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Documentation entry for user-facing changes

Uh oh!

clickhouse-gh bot commented Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

evillique commented Aug 21, 2025

Uh oh!

tuanpach commented Aug 22, 2025

Uh oh!

tuanpach commented Aug 22, 2025

Uh oh!

Uh oh!

evillique commented Aug 22, 2025

Uh oh!

tuanpach commented Aug 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tuanpach commented Aug 21, 2025 •

edited

Loading

clickhouse-gh bot commented Aug 21, 2025 •

edited

Loading