Skip to content

roachtest: move node-kill operation to pkill/pgrep-based kill approach#123517

Merged
craig[bot] merged 1 commit intocockroachdb:masterfrom
itsbilal:node-kill-script
May 2, 2024
Merged

roachtest: move node-kill operation to pkill/pgrep-based kill approach#123517
craig[bot] merged 1 commit intocockroachdb:masterfrom
itsbilal:node-kill-script

Conversation

@itsbilal
Copy link
Copy Markdown
Contributor

@itsbilal itsbilal commented May 2, 2024

For some reason, StopServiceForVirtualCluster fails with this error on drt clusters:

20:23:41 node_kill.go:51: operation status: killing node 1  with signal 15
20:23:41 cluster.go:2148: stoping virtual cluster
20:23:41 operation_impl.go:128: operation failure #1: no service for virtual cluster ""

The debug message has a bug, the virtual cluster is set to "system" but it seems like the service discovery process isn't able to determine the cockroach process based on dns settings in the drt project. This change makes the node-kill operation more dns-agnostic by looking for the cockroach process.

Epic: none

Release note: None

For some reason, `StopServiceForVirtualCluster` fails with this error on
drt clusters:

```
20:23:41 node_kill.go:51: operation status: killing node 1  with signal 15
20:23:41 cluster.go:2148: stoping virtual cluster
20:23:41 operation_impl.go:128: operation failure cockroachdb#1: no service for virtual cluster ""
```

The debug message has a bug, the virtual cluster is set to "system" but it
seems like the service discovery process isn't able to determine the cockroach
process based on dns settings in the drt project. This change makes the
node-kill operation more dns-agnostic by looking for the cockroach process.

Epic: none

Release note: None
@itsbilal itsbilal requested a review from a team as a code owner May 2, 2024 20:49
@itsbilal itsbilal requested review from DarrylWong and herkolategan and removed request for a team May 2, 2024 20:49
@cockroach-teamcity
Copy link
Copy Markdown
Member

This change is Reviewable

Copy link
Copy Markdown

@renatolabs renatolabs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I forgot DRT skips registration 👍

@itsbilal
Copy link
Copy Markdown
Contributor Author

itsbilal commented May 2, 2024

TFTR!

bors r=renatolabs

@herkolategan
Copy link
Copy Markdown
Collaborator

herkolategan commented May 2, 2024

Also ran into this earlier with something else, and was about to see how node-kill does it.

@craig
Copy link
Copy Markdown
Contributor

craig bot commented May 2, 2024

@craig craig bot merged commit 1f6e966 into cockroachdb:master May 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants