-
Notifications
You must be signed in to change notification settings - Fork 4.1k
stability: Poor performance of race-enabled build #10388
Copy link
Copy link
Closed
Labels
S-1-stabilitySevere stability issues that can be fixed by upgrading, but usually don’t resolve by restartingSevere stability issues that can be fixed by upgrading, but usually don’t resolve by restarting
Milestone
Description
On rho, upgrading from 5bc7bf1 to 7915024 (race-enabled builds) caused performance to plummet, from ~130 inserts per second to ~6. Raft elections are twice as frequent as before (about 10 MsgVote and 10 MsgVoteResp messages per second), and msgProp is 5 times more common (~35 vs ~7).
Perhaps the biggest problem is cmdQMu contention. Messages like this are not uncommon:
cockroach@104.196.147.189: W161102 09:06:12.118239 147887 storage/replica.go:551 [n3,s3,r5973/2:/System/tsd/cr.node.txn.resta…] cmdQMu: mutex held by github.com/cockroachdb/cockroach/pkg/storage.(*Replica).beginCmds for 982.044622ms (>500ms):
and the 95th percentile mutex duration is 900µs, up from 20µs (other mutex metrics haven't changed significantly).
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
S-1-stabilitySevere stability issues that can be fixed by upgrading, but usually don’t resolve by restartingSevere stability issues that can be fixed by upgrading, but usually don’t resolve by restarting