-
Notifications
You must be signed in to change notification settings - Fork 4.1k
kv: partial/asymmetric partition that does not isolate node liveness does not force range lease revocation #49220
Description
ERROR: internal error while retrieving user account
Please describe the issue you observed, and any steps we can take to reproduce it:
Setup a 6 nodes cluster (3 on "datacenter" A, 3 on "datacenter" B (172.31.2.0/24)), set default replication zone to num_replicas = 6, create a table with a num_replica = 6 as well.
Create a iptables rule on a node of A such that it can not reach any node on B.
sudo iptables -I OUTPUT -d 172.31.2.0/24 -j DROP
Trying to connect to that node I get:
$ cockroach-v20.1.0.linux-amd64/cockroach sql --insecure --url postgresql://kalman@192.168.26.157:26257/testdb
#
# Welcome to the CockroachDB SQL shell.
# All statements must be terminated by a semicolon.
# To exit, type: \q.
#
Connecting to another node works, but issuing a command like:
select count(*) from test2;
it hangs forever, as soon I dropped that rule with:
sudo iptables -D OUTPUT -d 172.31.2.0/24 -j DROP
the query in hang completed and now I'm able to connect to that node without issues
Environment:
- CockroachDB version: 20.1
- Server OS: Linux Ubuntu 18.04
- Client app: cockroach sql or psql give the same error
Additional context
What was the impact? 1 Node isolated from other 3 nodes the entire cluster looks
like is not working
For the dashboard all 6 nodes are available.
gz#7690
gz#8203
gz#8949
gz#10844
Jira issue: CRDB-4243
gz#14119
gz#14290
gz#16890