fix: Added check to prevent CP from joining with same name as worker node. by ethandcosta · Pull Request #2263 · canonical/k8s-snap

ethandcosta · 2026-01-13T15:15:12Z

Currently, if a CP node attempts to join the cluster with the same name as an existing worker node, it will fail in Kubernetes but persist in the microcluster, creating a mismatch in data. This PR addresses this issue by adding a check in the join logic to kill a CP join if there's a worker with the same name.

louiseschmidtgen

Thanks for picking this up Ethan!

Could I ask you to create an integration test as well please?
This way we can test the actual hook.

I'm thinking 1) worker, cp and 2) cp, worker (for the later I'm not sure if the worker has permissions to get all nodes so just double check this case).

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 6 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

ethandcosta · 2026-01-21T02:19:19Z

I've moved the node name duplication check into the get-join-token section instead. This won't cover the case where node A creates a join token, an identically-named node B creates another join token and joins the cluster, and then A attempts to join the cluster, as I think this will probably take some greater design reworking better suited for a follow-on task, i.e. creating an exposed endpoint for joining nodes to see who's in the cluster, and it's not a case that is likely to happen often.

ethandcosta · 2026-01-21T16:58:52Z

edit: this PR is kind of in a limbo state because of the k8sd migration. Once this PR canonical/k8sd#4 gets merged in, then we can update the version used here and this should be good to merge.

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

louiseschmidtgen

Two nits, apologies I didn't catch the one earlier:

louiseschmidtgen · 2026-02-10T19:51:18Z

+            f'a node with this name is already part of the cluster: "{shared_node_name}"'
+            in error_output
+        ), f"Join error message should indicate duplicate node name. Got: {error_output}"


I'm sorry that I didn't catch it earlier: can we capitalize the errors in k8sd to export them. Then we can use those here instead of string messages that may become outdated.

canonical/k8sd#14 👍

louiseschmidtgen · 2026-02-10T19:54:57Z

+    try:
+        util.get_join_token(
+            cluster_node, joining_worker, "--worker", name=shared_node_name
+        )
+        assert False, "get-join-token should have failed due to duplicate node name"
+    except tenacity.RetryError as e:
+        LOG.info("get-join-token failed as expected")
+        cause = e.last_attempt.exception()
+        if not isinstance(cause, subprocess.CalledProcessError):
+            raise e
+        error_output = (
+            cause.stderr if cause.stderr else cause.stdout if cause.stdout else ""
+        )
+        if isinstance(error_output, bytes):
+            error_output = error_output.decode()
+        assert (
+            f'a node with this name is already part of the cluster: "{shared_node_name}"'
+            in error_output
+        ), f"Join error message should indicate duplicate node name. Got: {error_output}"


There's something neat we can use to clean this piece up a bit: https://docs.pytest.org/en/7.1.x/how-to/assert.html#assertions-about-expected-exceptions
pytest expected exceptions.

ethandcosta requested review from Copilot and louiseschmidtgen January 13, 2026 15:15

louiseschmidtgen reviewed Jan 13, 2026

View reviewed changes

Copilot AI reviewed Jan 13, 2026

louiseschmidtgen requested a review from Copilot January 13, 2026 19:00

Copilot started reviewing on behalf of louiseschmidtgen January 13, 2026 19:00 View session

Copilot AI reviewed Jan 13, 2026

View reviewed changes

ethandcosta requested a review from louiseschmidtgen January 14, 2026 16:21

louiseschmidtgen reviewed Jan 15, 2026

View reviewed changes

Comment thread tests/integration/tests/test_clustering.py Outdated

ethandcosta requested a review from louiseschmidtgen January 21, 2026 02:19

ethandcosta force-pushed the KU-3296/no-dupe-node-names branch from 870e308 to 038719d Compare January 21, 2026 16:14

ethandcosta mentioned this pull request Jan 21, 2026

fix: add check for node name duplication canonical/k8sd#4

Merged

ethandcosta added 13 commits February 5, 2026 18:13

added integration tests

c61fede

fixed comment formatting

2366fab

removed name flag from join command

51c8786

linting

177b8fe

explicitly set name

8cfdbac

util

07deeeb

fix join token

1db4fb7

fix join token

180f4a0

moved node check logic to the join token

802367b

added name specification to wait check

6d3a1a2

rebasing to new origin

eb6edcb

updated tests to use new error text

0d01dad

rebasing

c0be9cf

ethandcosta force-pushed the KU-3296/no-dupe-node-names branch from 016f271 to c0be9cf Compare February 5, 2026 23:16

louiseschmidtgen requested a review from Copilot February 10, 2026 19:44

Copilot started reviewing on behalf of louiseschmidtgen February 10, 2026 19:45 View session

Copilot AI reviewed Feb 10, 2026

View reviewed changes

Comment thread tests/integration/tests/test_util/util.py

Comment thread tests/integration/tests/test_clustering.py

Comment thread tests/integration/tests/test_clustering.py

louiseschmidtgen approved these changes Feb 10, 2026

View reviewed changes

ethandcosta merged commit bed0586 into main Feb 11, 2026
287 of 288 checks passed

ethandcosta deleted the KU-3296/no-dupe-node-names branch February 11, 2026 14:16

Conversation

ethandcosta commented Jan 13, 2026

Uh oh!

louiseschmidtgen left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ethandcosta commented Jan 21, 2026

Uh oh!

ethandcosta commented Jan 21, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

louiseschmidtgen left a comment

Choose a reason for hiding this comment

Uh oh!

louiseschmidtgen Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

ethandcosta Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

louiseschmidtgen Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

louiseschmidtgen left a comment •

edited

Loading