sql: add multitenant fairness tests by cucaroach · Pull Request #77481 · cockroachdb/cockroach

cucaroach · 2022-03-08T13:44:58Z

Informs: #65954

Roachtests intended to validate that kv and store admission control
queues distribute database resources fairly.

There are 8 tests: 2 for "kv" ie CPU stressing and 2 for "store" that are
intended to stress the LSM. One test is "same" where each of N sql pods hits the
kvserver with equal concurrency and another "concurrency-skew" which
varies the concurrency from each pod.

We measure the variation in througphput across the pod ("max_tput_delta")
and the max/min throughput and latency across the pods. Sample results:

multitenant/fairness/kv/concurrency-skew/admission/run_1/1.perf/stats.json
{ "max_tput_delta": 0.340925, "max_tput": 483.200000, "min_tput": 256.056667, "max_latency": 3.282347, "min_latency": 0.771148}
multitenant/fairness/kv/concurrency-skew/no-admission/run_1/1.perf/stats.json
{ "max_tput_delta": 0.330760, "max_tput": 205.740000, "min_tput": 108.903333, "max_latency": 7.151178, "min_latency": 1.618236}

multitenant/fairness/kv/same/admission/run_1/1.perf/stats.json
{ "max_tput_delta": 0.245294, "max_tput": 293.990000, "min_tput": 197.026667, "max_latency": 0.831686, "min_latency": 0.762475}
multitenant/fairness/kv/same/no-admission/run_1/1.perf/stats.json
{ "max_tput_delta": 0.031199, "max_tput": 132.443333, "min_tput": 124.676667, "max_latency": 1.915801, "min_latency": 1.776664}

multitenant/fairness/store/same/admission/run_1/1.perf/stats.json
{ "max_tput_delta": 0.018095, "max_tput": 139.950000, "min_tput": 136.336667, "max_latency": 0.346295, "min_latency": 0.341212}
multitenant/fairness/store/same/no-admission/run_1/1.perf/stats.json
{ "max_tput_delta": 0.001886, "max_tput": 149.296667, "min_tput": 148.878333, "max_latency": 0.306853, "min_latency": 0.303392}

multitenant/fairness/store/concurrency-skew/admission/run_1/1.perf/stats.json
{ "max_tput_delta": 0.024872, "max_tput": 143.875000, "min_tput": 138.346667, "max_latency": 1.094262, "min_latency": 0.346674}
multitenant/fairness/store/concurrency-skew/no-admission/run_1/1.perf/stats.json
{ "max_tput_delta": 0.005439, "max_tput": 148.848333, "min_tput": 147.500000, "max_latency": 1.007741, "min_latency": 0.313597}

Release note: None

cockroach-teamcity · 2022-03-08T13:45:06Z

This change is

sumeerbhola

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @cucaroach and @tbg)

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 63 at r1 (raw file):

}

// Test that the kvserver fairly distributes CPU token on a highly concurrent 4 MT workload.

4 MT?

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 146 at r1 (raw file):

	fmt.Printf("Max latency delta: %d%% %v\n", int(maxDeltaPercentage*100), meanLatencies)

	// TODO: Can we do anything to verify that admission control kicked in?  Check some stats?

What does the admin console look like when this is run, just as a rough sanity check?
Is there enough concurrency from each tenant to saturate some resource? I thought we would need --concurrency to be high (e.g. see roachtest/tests/kv.go).
Also, if each tenant has the same concurrency, I suspect they will get similar allocation even without admission control. We could manually turn off admission control and see if there is any change in the observed throughput for each tenant. An alternative would be to have tenants with different concurrency values, say tenant1 with C concurrency, tenant2 with 2C, tenant3 with 4C, tenant4 with 8*C, such that C is enough to use significantly more than 25% of the resource, and then see if everyone gets equal throughput.

cucaroach

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @sumeerbhola and @tbg)

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 63 at r1 (raw file):

Previously, sumeerbhola wrote…

4 MT?

Sorry, what terminology should I use, 4 sql pod?

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 146 at r1 (raw file):

Previously, sumeerbhola wrote…

What does the admin console look like when this is run, just as a rough sanity check?
Is there enough concurrency from each tenant to saturate some resource? I thought we would need --concurrency to be high (e.g. see roachtest/tests/kv.go).
Also, if each tenant has the same concurrency, I suspect they will get similar allocation even without admission control. We could manually turn off admission control and see if there is any change in the observed throughput for each tenant. An alternative would be to have tenants with different concurrency values, say tenant1 with C concurrency, tenant2 with 2C, tenant3 with 4C, tenant4 with 8*C, such that C is enough to use significantly more than 25% of the resource, and then see if everyone gets equal throughput.

What part of the console? The kv server looks very un busy since the console bias is towards SQL activity. It only shows %25 CPU usage but I know from top et al that the CPU is tapped out. Each workload instance has default concurrency 8 so there's 32 concurrency. I can see from custom metrics that there are 40 kv slot so I'll try to crank up concurrency to see if I can get any queuing to happen. Then I'll play with varying concurrency per tenant. Thanks for the suggestions!

sumeerbhola

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @cucaroach, @sumeerbhola, and @tbg)

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 146 at r1 (raw file):

Previously, cucaroach (Tommy Reilly) wrote…

What part of the console? The kv server looks very un busy since the console bias is towards SQL activity. It only shows %25 CPU usage but I know from top et al that the CPU is tapped out. Each workload instance has default concurrency 8 so there's 32 concurrency. I can see from custom metrics that there are 40 kv slot so I'll try to crank up concurrency to see if I can get any queuing to happen. Then I'll play with varying concurrency per tenant. Thanks for the suggestions!

the overload page should show queuing in admission control if we are overloading cpu (higher runnable goroutine count) or causing higher read amp in the store.

cucaroach · 2022-03-10T17:49:55Z

Here's what 200 concurrency per sql pod looks like:

sumeerbhola

The mean delay is < 150us, and the 75th percentile is ~10ms. The slots don't look anywhere close to full. There must be some tiny spikes that are not caught by the low sampling frequency of the metric, which is why admission delay is not 0. What does the runnable goroutine count on the overload dashboard look like? I think we need significantly more load.

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @cucaroach, @sumeerbhola, and @tbg)

cucaroach · 2022-03-11T15:11:41Z

I think the machine is so overloaded that we're adjusting the number of available slots down? Isn't the dark line in the slots graph the number of slots we think we can handle? The CPU is pegged at %100 (this is a 4 CPU machine running 4 workload processes, 4 SQL pods and 1 kvserver). Maybe going from 32 to 400 was too big a jump in concurrency, I'll experiment some more...

cucaroach · 2022-03-11T15:33:38Z

Here's what the machine's CPU looks like:

sumeerbhola

I think the machine is so overloaded that we're adjusting the number of available slots down? Isn't the dark line in the slots graph the number of slots we think we can handle?

I have no idea what the dark line is. What do the labels say when you hover over it? I doubt we would see such low admission control latency if the slots were truly scarce.

The CPU is pegged at %100 (this is a 4 CPU machine running 4 workload processes, 4 SQL pods and 1 kvserver). Maybe going from 32 to 400 was too big a jump in concurrency, I'll experiment some more...

We shouldn't run the kv server (or storage server in the terminology doc) on a shared machine with the others. It's going to add noise due to cpu fluctuations caused by the cpu usage in the SQL pods and the workload processes.

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @cucaroach and @tbg)

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 63 at r1 (raw file):

Previously, cucaroach (Tommy Reilly) wrote…

Sorry, what terminology should I use, 4 sql pod?

yes, that's fine https://docs.google.com/presentation/d/1IvqkP3EOiKLPPf7lPsMpJsoUk6hcaaDm4xg5VqJ97TI/edit#slide=id.g1125de19347_0_185

tbg

Gave an initial review with some high-level suggestions. Sorry this took so long, I was out on PTO until earlier this week and had been catching up.

Reviewed 3 of 3 files at r1, all commit messages.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @cucaroach)

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 38 at r1 (raw file):

		Run: func(ctx context.Context, t test.Test, c cluster.Cluster) {
			runMultiTenantFairness(ctx, t, c, *t.BuildVersion(),
				10,  // init duration

I'd introduce a multiTenantFairnessOptions struct to hold these values. It also seems like you can just have a BlockSize parameter, since all uses currently specify min=max.

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 69 at r1 (raw file):

	c cluster.Cluster,
	v version.Version,
	initDuration int,

Use time.Duration here or the caller can invoke this with time.Second expecting to get a second but it's really going to be 1E9 seconds. The workload flags are also time.Duration so you shouldn't need to perform any conversions.

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 100 at r1 (raw file):

	// Init kv on each tenant.
	var wg sync.WaitGroup

Throughout: Use a c.NewMonitor, not a WaitGroup. A WaitGroup is oblivious to the cluster becoming unavailable and will cause test hangs that the test harness has a hard time recovering from. Monitor does the right thing.

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 105 at r1 (raw file):

		go func(pgurl string) {
			defer wg.Done()
			cmd := fmt.Sprintf("./cockroach workload run kv  --min-block-bytes %d --max-block-bytes %d --splits 10 --init --batch 100 --duration=%ds %s", minBlockSize, maxBlockSize, initDuration, pgurl)

nit: I think it's deprecated to use run --init. If you're just initializing, ./cockroach workload init kv is the right thing to use.

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 122 at r1 (raw file):

	wg.Wait()

	// Pull workload performance from crdb_internal.statement_statistics.

I think this is one of the tests that would really benefit from using prometheus instead of ad-hoc metrics stuff. For a recent example, see https://github.com/cockroachdb/cockroach/pull/76147/files#diff-28a39375143eb2ad1e76868f4086035dc8099dc89f98658c3f62f52987945b1eR139-R152. There are also the multi-region DRT tests, drt.go, which do something similar (but that's where I learned from, so my code is probably more easily digested for your use case).

The main reasons I am suggestion prometheus here is that a) you can test whatever metrics you like without any limits and b) if the test fails, you can easily go through the metrics and figure out exactly what was wrong since there'll be a prometheus snapshot in the artifacts.

cucaroach · 2022-03-25T16:59:11Z

Here's the results from the latest full run, some mixed signals but overall admission control on looks like a win:

admission-control-on/store-fairness/size-skew/run_1/test.log
27:19:22:25 multitenant_fairness.go:236: Max throughput delta: 4% [8046 8117 8071 8570]
28:19:22:25 multitenant_fairness.go:247: Max latency delta: 31% [1.292192061218867 1.1910183562547731 1.1579871814502536 0.7477568048697794]

admission-control-off/store-fairness/size-skew/run_1/test.log
27:19:04:10 multitenant_fairness.go:236: Max throughput delta: 44% [3542 5170 8372 8518]
28:19:04:10 multitenant_fairness.go:247: Max latency delta: 33% [1.9760166527478815 1.9266345608191482 1.7435293913158136 1.1201716625158469]

admission-control-on/store-fairness/same/run_1/test.log
27:19:21:57 multitenant_fairness.go:236: Max throughput delta: 5% [8042 7832 8105 8608]
28:19:21:57 multitenant_fairness.go:247: Max latency delta: 35% [1.9362360887518 1.1774040776615182 1.7664052900354095 0.8348145489724682]

admission-control-off/store-fairness/same/run_1/test.log
27:19:04:17 multitenant_fairness.go:236: Max throughput delta: 42% [3724 6174 7504 8471]
28:19:04:17 multitenant_fairness.go:247: Max latency delta: 51% [1.953238985834854 1.5062504626903128 0.9549237519677499 0.7405189034660602]

admission-control-on/store-fairness/concurrency-skew/run_1/test.log
27:19:04:54 multitenant_fairness.go:236: Max throughput delta: 37% [4335 7058 7875 8621]
28:19:04:54 multitenant_fairness.go:247: Max latency delta: 62% [2.043226593744872 1.0792064205476055 1.1588510597380306 0.7411893042964846]

admission-control-off/store-fairness/concurrency-skew/run_1/test.log
27:19:04:07 multitenant_fairness.go:236: Max throughput delta: 44% [3497 4694 8398 8611]
28:19:04:07 multitenant_fairness.go:247: Max latency delta: 18% [0.9557216485936532 1.0696940393372412 1.1102637238685396 0.8027469069018707]

admission-control-on/kv-fairness/concurrency-skew/run_1/test.log
27:19:03:28 multitenant_fairness.go:236: Max throughput delta: 11% [2.261697e+06 2.502861e+06 2.847614e+06 2.607714e+06]
28:19:03:28 multitenant_fairness.go:247: Max latency delta: 7% [0.003673017991528602 0.004078486600023887 0.0035989890397046588 0.0037721962858242616]

admission-control-off/kv-fairness/concurrency-skew/run_1/test.log
27:19:03:23 multitenant_fairness.go:236: Max throughput delta: 21% [2.238545e+06 2.080805e+06 1.912632e+06 2.708984e+06]
28:19:03:23 multitenant_fairness.go:247: Max latency delta: 6% [0.003659826819451616 0.003537415493861285 0.003886044276673637 0.0035207262351404723]

admission-control-on/kv-fairness/same/run_1/test.log
27:19:03:29 multitenant_fairness.go:236: Max throughput delta: 2% [2.514439e+06 2.543065e+06 2.596264e+06 2.493127e+06]
28:19:03:29 multitenant_fairness.go:247: Max latency delta: 12% [0.004385765617854339 0.0036885013617139165 0.0034431416468633656 0.0040650634405109826]

admission-control-off/kv-fairness/same/run_1/test.log
27:19:03:28 multitenant_fairness.go:236: Max throughput delta: 24% [2.462123e+06 2.045751e+06 2.975329e+06 2.082289e+06]
28:19:03:28 multitenant_fairness.go:247: Max latency delta: 5% [0.00375622026082275 0.004142495671776924 0.0037409535538152072 0.004016245383471014]

admission-control-on/kv-fairness/size-skew/run_1/test.log
27:19:03:22 multitenant_fairness.go:236: Max throughput delta: 14% [2.249276e+06 2.351748e+06 2.490747e+06 1.923401e+06]
28:19:03:22 multitenant_fairness.go:247: Max latency delta: 10% [0.0034406955788691936 0.0038719533858636 0.003989169634826118 0.004103228254822516]

admission-control-off/kv-fairness/size-skew/run_1/test.log
27:19:03:24 multitenant_fairness.go:236: Max throughput delta: 11% [1.959306e+06 2.269946e+06 2.411642e+06 2.196523e+06]
28:19:03:24 multitenant_fairness.go:247: Max latency delta: 11% [0.004394470820938169 0.0034889320494695454 0.0038784322837821248 0.003976539002935638]

Going to look at some metrics for all these to make sure we're really hitting overload conditions.

cucaroach · 2022-03-25T17:00:52Z

I'm at a loss on how to turn these tests into reliable pass/fail style tests and am leaning towards just having numbers published so we track over time how we're doing, any thoughts on this would be much appreciated. I really don't want to add a flakey test that needs constant attention...

tbg · 2022-03-28T09:03:35Z

I'm at a loss on how to turn these tests into reliable pass/fail style tests and am leaning towards just having numbers published so we track over time how we're doing, any thoughts on this would be much appreciated. I really don't want to add a flakey test that needs constant attention...

That would be my suggestion as well. I would think of these as performance tests so that data accumulates over time and improvements/regressions can be identified. @stevendanna could you help Tommy figure out how to do that in case he needs help (which I assume he will). I also wonder if we wrote down the basics somewhere (I'd like to read that myself, because I'm shaky on what exactly we support and where changes need to be made), for example in our knowledge base: https://cockroachlabs.atlassian.net/wiki/spaces/TE/pages/2208629091/Knowledge+Base

sumeerbhola

Numbers are quite nice!

I am wondering about the following size-skew.

admission-control-on/kv-fairness/size-skew/run_1/test.log
27:19:03:22 multitenant_fairness.go:236: Max throughput delta: 14% [2.249276e+06 2.351748e+06 2.490747e+06 1.923401e+06]
28:19:03:22 multitenant_fairness.go:247: Max latency delta: 10% [0.0034406955788691936 0.0038719533858636 0.003989169634826118 0.004103228254822516]

is the one with the lowest throughput the one with the largest size? Could it be encountering more hits when reading keys (vs the key not being found)? Also, I am confused how the default hashGenerator for the kv workload behaves when the read percent is 100. Isn't it going to keep reading key 0 since it hasn't seen any writes?

Reviewed 1 of 3 files at r1.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @cucaroach, @sumeerbhola, and @tbg)

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 54 at r3 (raw file):

				name:           "concurrency-skew",
				warmUpDuration: func(i int) time.Duration { return 5 * time.Minute },
				concurrency:    func(i int) int { return i * 8 },

looks like this will be called with i in [1,5]. Is that correct? So lowest concurrency is 8 and highest is 40.

And we are using a 4 cpu KV node?

Do you know if 4 tenants with a concurrency of 8 each are sufficient to saturate cpu for this no writes (kv-fairness) case, and sufficient to saturate the store in the only writes (store-fairness) case below?

Perhaps the delta number below is because a concurrency of 8 is too low to utilize 25% of the store capacity.

admission-control-on/store-fairness/concurrency-skew/run_1/test.log
27:19:04:54 multitenant_fairness.go:236: Max throughput delta: 37% [4335 7058 7875 8621]

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 90 at r3 (raw file):

			{
				name:           "size-skew",
				warmUpDuration: func(i int) time.Duration { return time.Duration(i*2) * time.Minute },

Do you expect initial size skew to matter in this case that does only writes? I am assuming the writes after setup are not limited to overwriting the existing keys, yes?

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 250 at r3 (raw file):

	}

	// TODO: Can we do anything to verify that admission control kicked in?  Check some stats?

there are no tables with admission control stats. Only metrics. We may be able to do something with prometheus wrt doing an aggregation over time. I am fine with leaving a TODO if you've eyeballed these runs and seen admission control kick in.

cucaroach

Yes, although there's no clear relationship. I wonder if I should use --max-ops to get an exact size distribution instead of using time which is probably noisy. I will dig into the how the reads behave I have wondered about that. I also wonder if I should simplify this and do just one kv run invocation and just concurrently reset the stats after warmup period expires. That should eliminate a lot of pitfalls. Then maybe instead of doing a duration based size skew I can skew the size based on min/max block sizes..

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @cucaroach, @sumeerbhola, and @tbg)

a discussion (no related file):
Thanks for the review feedback! Here's version 2, I've moved the pods/workloads to their own nodes and expanded things a bit to see how AC handles heterogenous concurrency and database size. Still a work in progress and need to analyze some runs and how well this is exercising things. Any and all feedback welcome!

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 122 at r1 (raw file):

Previously, tbg (Tobias Grieger) wrote…

I think this is one of the tests that would really benefit from using prometheus instead of ad-hoc metrics stuff. For a recent example, see https://github.com/cockroachdb/cockroach/pull/76147/files#diff-28a39375143eb2ad1e76868f4086035dc8099dc89f98658c3f62f52987945b1eR139-R152. There are also the multi-region DRT tests, drt.go, which do something similar (but that's where I learned from, so my code is probably more easily digested for your use case).

The main reasons I am suggestion prometheus here is that a) you can test whatever metrics you like without any limits and b) if the test fails, you can easily go through the metrics and figure out exactly what was wrong since there'll be a prometheus snapshot in the artifacts.

Sounds nice, what can I do with a prometheus snapshot, can that encompass any metrics from the db or is that just for metrics I produce from my test?

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 54 at r3 (raw file):

Previously, sumeerbhola wrote…

looks like this will be called with i in [1,5]. Is that correct? So lowest concurrency is 8 and highest is 40.

And we are using a 4 cpu KV node?

Do you know if 4 tenants with a concurrency of 8 each are sufficient to saturate cpu for this no writes (kv-fairness) case, and sufficient to saturate the store in the only writes (store-fairness) case below?

Perhaps the delta number below is because a concurrency of 8 is too low to utilize 25% of the store capacity.
admission-control-on/store-fairness/concurrency-skew/run_1/test.log
27:19:04:54 multitenant_fairness.go:236: Max throughput delta: 37% [4335 7058 7875 8621]

Here's what CPU % looks like for admission-control-on/kv-fairness/same:

Maybe not fully saturated but close...

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 90 at r3 (raw file):

Previously, sumeerbhola wrote…

Do you expect initial size skew to matter in this case that does only writes? I am assuming the writes after setup are not limited to overwriting the existing keys, yes?

Hmm, since I'm not passing write-seq I guess its possible we're overwriting existing keys if the seed is stable. One more reason to get rid of multi pass and just have one workload run.

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 250 at r3 (raw file):

Previously, sumeerbhola wrote…

there are no tables with admission control stats. Only metrics. We may be able to do something with prometheus wrt doing an aggregation over time. I am fine with leaving a TODO if you've eyeballed these runs and seen admission control kick in.

Seems like its kicking in a little. I'll try to crank it up a bit.

sumeerbhola

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @cucaroach, @sumeerbhola, and @tbg)

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 54 at r3 (raw file):

Previously, cucaroach (Tommy Reilly) wrote…

Here's what CPU % looks like for admission-control-on/kv-fairness/same:

Maybe not fully saturated but close...

it is not close enough for admission control to kick in for cpu. The default for admission.kv_slot_adjuster.overload_threshold (for runnable goroutines) is 32.

pkg/cmd/roachtest/tests/multitenant_fairness.go, line 250 at r3 (raw file):

Previously, cucaroach (Tommy Reilly) wrote…

Seems like its kicking in a little. I'll try to crank it up a bit.

We do want the admission delay rate to consistently stay positive.
And the delay rate/work rate should be a few ms at least (this is the mean admission latency). Even the spike in admission delay rate above of 120k, when divided by 20K work rate is 6 micros, which is too low.

cucaroach · 2022-04-28T20:27:58Z

Here's the latest results that I think look pretty good.

multitenant/fairness/store/concurrency-skew/admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.238019, "max_latency_delta": 0.275468, "max_tput": 174.334444, "min_tput": 108.147778, "max_latency": 0.173205, "min_latency": 0.101687}

multitenant/fairness/store/concurrency-skew/no-admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.450391, "max_latency_delta": 0.059209, "max_tput": 169.963333, "min_tput": 64.195556, "max_latency": 0.174815, "min_latency": 0.155879}

multitenant/fairness/store/same/admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.024674, "max_latency_delta": 0.043759, "max_tput": 140.780000, "min_tput": 135.815556, "max_latency": 0.134441, "min_latency": 0.126053}

multitenant/fairness/store/same/no-admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.012418, "max_latency_delta": 0.022016, "max_tput": 128.071111, "min_tput": 124.966667, "max_latency": 0.138694, "min_latency": 0.133920}

multitenant/fairness/kv/concurrency-skew/admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.329216, "max_latency_delta": 0.418922, "max_tput": 1886.764167, "min_tput": 922.146667, "max_latency": 0.116354, "min_latency": 0.049378}

multitenant/fairness/kv/concurrency-skew/no-admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.447648, "max_latency_delta": 0.218034, "max_tput": 1578.764167, "min_tput": 635.807500, "max_latency": 0.106256, "min_latency": 0.068819}

multitenant/fairness/kv/same/admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.328848, "max_latency_delta": 0.039585, "max_tput": 1754.685000, "min_tput": 1167.888333, "max_latency": 0.117428, "min_latency": 0.109470}

multitenant/fairness/kv/same/no-admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.011448, "max_latency_delta": 0.021655, "max_tput": 1107.762500, "min_tput": 1086.889167, "max_latency": 0.127527, "min_latency": 0.122893}

multitenant/fairness/kv/concurrency-skew/admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.260764, "max_latency_delta": 0.487317, "max_tput": 1292.920000, "min_tput": 828.346667, "max_latency": 0.193207, "min_latency": 0.052886}

multitenant/fairness/kv/concurrency-skew/no-admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.488463, "max_latency_delta": 0.189269, "max_tput": 1308.253333, "min_tput": 470.410000, "max_latency": 0.121861, "min_latency": 0.087920}

multitenant/fairness/kv/same/admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.033435, "max_latency_delta": 0.154574, "max_tput": 1143.290000, "min_tput": 1082.263333, "max_latency": 0.136882, "min_latency": 0.103692}

multitenant/fairness/kv/same/no-admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.008568, "max_latency_delta": 0.204921, "max_tput": 985.156667, "min_tput": 971.610000, "max_latency": 0.155348, "min_latency": 0.104403}

multitenant/fairness/store/concurrency-skew/admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.221799, "max_latency_delta": 0.299106, "max_tput": 152.531111, "min_tput": 101.156667, "max_latency": 0.187678, "min_latency": 0.101447}

multitenant/fairness/store/concurrency-skew/no-admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.510869, "max_latency_delta": 0.077613, "max_tput": 166.635556, "min_tput": 53.558889, "max_latency": 0.176941, "min_latency": 0.153929}

multitenant/fairness/store/same/admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.057072, "max_latency_delta": 0.024260, "max_tput": 140.596667, "min_tput": 125.931111, "max_latency": 0.131501, "min_latency": 0.125317}

multitenant/fairness/store/same/no-admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.005693, "max_latency_delta": 0.049709, "max_tput": 120.416667, "min_tput": 119.298889, "max_latency": 0.140880, "min_latency": 0.131197}

sumeerbhola · 2022-05-03T02:07:16Z

@cucaroach these are older than the ones you shared on slack on april 29th yes?
Also, I'll wait until there are updated numbers for the kv one, as we discussed.

cucaroach · 2022-05-03T17:42:35Z

After bumping up kv/same concurrency to 250 we get more impressive throughput improvements but still tput_delta I think its just too consistently low for no-admission. But we can hang our hats on the concurrency-skew tput_delta looking good.

multitenant/fairness/kv/same/admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.144249, "max_latency_delta": 0.036923, "max_tput": 304.520000, "min_tput": 245.673333, "max_latency": 0.781684, "min_latency": 0.728603}

multitenant/fairness/kv/same/no-admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.026597, "max_latency_delta": 0.087155, "max_tput": 163.460000, "min_tput": 156.700000, "max_latency": 1.525155, "min_latency": 1.324899}

multitenant/fairness/kv/concurrency-skew/admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.129229, "max_latency_delta": 0.630038, "max_tput": 286.176667, "min_tput": 232.210000, "max_latency": 2.107538, "min_latency": 0.472873}

multitenant/fairness/kv/concurrency-skew/no-admission/run_1/1.perf/stats.json
1:{ "max_tput_delta": 0.457215, "max_latency_delta": 0.378929, "max_tput": 225.376667, "min_tput": 96.706667, "max_latency": 2.449109, "min_latency": 1.436515}

cucaroach · 2022-05-03T17:44:36Z

Rebased and ready to go I think.

sumeerbhola · 2022-05-04T19:48:10Z

Looks good! "max_tput_delta": 0.129229 is certainly better than "max_tput_delta": 0.457215.

Can you update the PR and commit description with the latest results for kv and store?
I suggest removing max_latency_delta since it is confusing -- there should be a high latency delta in the skewed case if we fairly allocate to all tenants. If you want to keep it, please add a code comment explaining to the reader the reason for the observed latency delta.

sumeerbhola

Reviewed 3 of 7 files at r11, 1 of 6 files at r12, 1 of 2 files at r14, all commit messages.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @cucaroach, @sumeerbhola, and @tbg)

pkg/cmd/roachtest/tests/multitenant_fairness.go line 54 at r14 (raw file):

			{
				name:        "concurrency-skew",
				concurrency: func(i int) int { return i * 150 },

this should be i * 250.
We've verified that a concurrency of 250 is enough to consume 25% of the cpu when running "same" with no-admission, since it causes enough queueing in the goroutine scheduler. In the skewed case we want the smallest tenant to continue to be as demanding: with perfect fairness it would get 25%.

pkg/cmd/roachtest/tests/multitenant_fairness.go line 83 at r14 (raw file):

			{
				name:        "concurrency-skew",
				concurrency: func(i int) int { return i * 20 },

same question. Why not i * 50.

cucaroach · 2022-05-16T15:29:47Z

Here's the latest results with beta.5, I don't what happened but for store at least no-admission looks better. Any thoughts for digging into that? I think when we jacked up the operation size to get more lsm 0 files this happened, I'll revert those changes and see what happens.

multitenant/fairness/kv/concurrency-skew/admission/run_1/1.perf/stats.json { "max_tput_delta": 0.340925, "max_tput": 483.200000, "min_tput": 256.056667, "max_latency": 3.282347, "min_latency": 0.771148}
multitenant/fairness/kv/concurrency-skew/no-admission/run_1/1.perf/stats.json { "max_tput_delta": 0.330760, "max_tput": 205.740000, "min_tput": 108.903333, "max_latency": 7.151178, "min_latency": 1.618236}

multitenant/fairness/kv/same/admission/run_1/1.perf/stats.json { "max_tput_delta": 0.245294, "max_tput": 293.990000, "min_tput": 197.026667, "max_latency": 0.831686, "min_latency": 0.762475}
multitenant/fairness/kv/same/no-admission/run_1/1.perf/stats.json { "max_tput_delta": 0.031199, "max_tput": 132.443333, "min_tput": 124.676667, "max_latency": 1.915801, "min_latency": 1.776664}

multitenant/fairness/store/same/admission/run_1/1.perf/stats.json { "max_tput_delta": 0.018095, "max_tput": 139.950000, "min_tput": 136.336667, "max_latency": 0.346295, "min_latency": 0.341212}
multitenant/fairness/store/same/no-admission/run_1/1.perf/stats.json { "max_tput_delta": 0.001886, "max_tput": 149.296667, "min_tput": 148.878333, "max_latency": 0.306853, "min_latency": 0.303392}

multitenant/fairness/store/concurrency-skew/admission/run_1/1.perf/stats.json { "max_tput_delta": 0.024872, "max_tput": 143.875000, "min_tput": 138.346667, "max_latency": 1.094262, "min_latency": 0.346674}
multitenant/fairness/store/concurrency-skew/no-admission/run_1/1.perf/stats.json { "max_tput_delta": 0.005439, "max_tput": 148.848333, "min_tput": 147.500000, "max_latency": 1.007741, "min_latency": 0.313597}

cucaroach · 2022-05-16T23:35:01Z

Actually reverting the value size changes didn't change much, I keep forgetting store admission control can and should lower throughput to prevent LSM from getting unhappy, although it makes me queasy to take performance off the table. I wonder what happen if we ran this test for days, would the LSM implode at some point with too many L0 files? What would that look like?

sumeerbhola · 2022-05-18T17:07:48Z

Any thoughts for digging into that?

I think we'd need to periodically log the slots and tokens allocated to each tenant to debug this.
If you want to leave a TODO to investigate, and merge this, I am ok with that.

I wonder what happen if we ran this test for days, would the LSM implode at some point with too many L0 files?

The node will probably fail heartbeats or other badness would happen. Tolerating a very poorly shaped LSM in order to get higher write throughput does not work out in a production setting.

tbg · 2022-05-18T20:25:00Z

I think we'd need to periodically log the slots and tokens allocated to each tenant to debug this.

Just pointing out
#80724 which may (at some point) be helpful. Not for today.

sumeerbhola

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @cucaroach, @sumeerbhola, and @tbg)

Informs: cockroachdb#65954 Roachtests intended to validate that kv and store admission control queues distribute database resources fairly. There are 8 tests: 2 for "kv" ie CPU stressing and 2 for "store" that are intended to stress the LSM. One test is "same" where each of N sql pods hits the kvserver with equal concurrency and another "concurrency-skew" which varies the concurrency from each pod. We measure the variation in througphput across the pod ("max_tput_delta") and the max/min throughput and latency across the pods. Sample results: ``` multitenant/fairness/kv/concurrency-skew/admission/run_1/1.perf/stats.json { "max_tput_delta": 0.340925, "max_tput": 483.200000, "min_tput": 256.056667, "max_latency": 3.282347, "min_latency": 0.771148} multitenant/fairness/kv/concurrency-skew/no-admission/run_1/1.perf/stats.json { "max_tput_delta": 0.330760, "max_tput": 205.740000, "min_tput": 108.903333, "max_latency": 7.151178, "min_latency": 1.618236} multitenant/fairness/kv/same/admission/run_1/1.perf/stats.json { "max_tput_delta": 0.245294, "max_tput": 293.990000, "min_tput": 197.026667, "max_latency": 0.831686, "min_latency": 0.762475} multitenant/fairness/kv/same/no-admission/run_1/1.perf/stats.json { "max_tput_delta": 0.031199, "max_tput": 132.443333, "min_tput": 124.676667, "max_latency": 1.915801, "min_latency": 1.776664} multitenant/fairness/store/same/admission/run_1/1.perf/stats.json { "max_tput_delta": 0.018095, "max_tput": 139.950000, "min_tput": 136.336667, "max_latency": 0.346295, "min_latency": 0.341212} multitenant/fairness/store/same/no-admission/run_1/1.perf/stats.json { "max_tput_delta": 0.001886, "max_tput": 149.296667, "min_tput": 148.878333, "max_latency": 0.306853, "min_latency": 0.303392} multitenant/fairness/store/concurrency-skew/admission/run_1/1.perf/stats.json { "max_tput_delta": 0.024872, "max_tput": 143.875000, "min_tput": 138.346667, "max_latency": 1.094262, "min_latency": 0.346674} multitenant/fairness/store/concurrency-skew/no-admission/run_1/1.perf/stats.json { "max_tput_delta": 0.005439, "max_tput": 148.848333, "min_tput": 147.500000, "max_latency": 1.007741, "min_latency": 0.313597} ``` Release note: None

cucaroach · 2022-05-23T19:32:42Z

bors r+

craig · 2022-05-23T20:50:42Z

Build succeeded:

GitHub CI (Cockroach)

cucaroach changed the title ~~sql: Add multitenant fairness tests~~ sql: add multitenant fairness tests Mar 8, 2022

cucaroach force-pushed the mt-fairness-tests branch from f936f0a to 69c6b95 Compare March 8, 2022 13:47

cucaroach marked this pull request as ready for review March 8, 2022 13:47

cucaroach requested review from sumeerbhola and tbg March 8, 2022 13:47

cucaroach force-pushed the mt-fairness-tests branch 2 times, most recently from f2c5351 to 33fe46c Compare March 8, 2022 15:00

sumeerbhola requested changes Mar 9, 2022

View reviewed changes

cucaroach commented Mar 10, 2022

View reviewed changes

sumeerbhola requested changes Mar 10, 2022

View reviewed changes

sumeerbhola reviewed Mar 11, 2022

View reviewed changes

sumeerbhola requested changes Mar 11, 2022

View reviewed changes

tbg reviewed Mar 17, 2022

View reviewed changes

cucaroach force-pushed the mt-fairness-tests branch 3 times, most recently from d389d2f to 5f6c554 Compare March 24, 2022 14:52

tbg mentioned this pull request Mar 25, 2022

roachprod,roachtest: support tenants #78484

Closed

cucaroach mentioned this pull request Mar 29, 2022

sql,admission: ERROR: liveness session expired XXXms before transaction #78691

Closed

cucaroach force-pushed the mt-fairness-tests branch from 5f6c554 to 6aa79f9 Compare March 29, 2022 13:48

cucaroach requested a review from a team as a code owner March 29, 2022 13:48

sumeerbhola requested changes Mar 29, 2022

View reviewed changes

cucaroach commented Mar 29, 2022

View reviewed changes

sumeerbhola requested changes Mar 29, 2022

View reviewed changes

otan removed the request for review from a team April 18, 2022 20:34

cucaroach force-pushed the mt-fairness-tests branch from 91d8cb1 to 09c411e Compare April 28, 2022 20:24

cucaroach force-pushed the mt-fairness-tests branch 4 times, most recently from f8649f2 to 9ea06a7 Compare May 2, 2022 19:58

cucaroach force-pushed the mt-fairness-tests branch from 9ea06a7 to 213dd57 Compare May 3, 2022 17:38

cucaroach force-pushed the mt-fairness-tests branch from 213dd57 to 2b1d989 Compare May 3, 2022 17:44

cucaroach requested review from sumeerbhola and tbg May 3, 2022 17:44

sumeerbhola requested changes May 4, 2022

View reviewed changes

cucaroach force-pushed the mt-fairness-tests branch from 2b1d989 to 7683c27 Compare May 4, 2022 20:32

cucaroach force-pushed the mt-fairness-tests branch from 7683c27 to 84459a2 Compare May 16, 2022 18:41

cucaroach force-pushed the mt-fairness-tests branch from 84459a2 to 045c2d6 Compare May 23, 2022 14:26

sumeerbhola approved these changes May 23, 2022

View reviewed changes

cucaroach force-pushed the mt-fairness-tests branch from 045c2d6 to 2191564 Compare May 23, 2022 14:54

cucaroach force-pushed the mt-fairness-tests branch from 2191564 to 02cd048 Compare May 23, 2022 16:21

craig bot merged commit 1d09af8 into cockroachdb:master May 23, 2022

Conversation

cucaroach commented Mar 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cockroach-teamcity commented Mar 8, 2022

Uh oh!

sumeerbhola left a comment

Choose a reason for hiding this comment

Uh oh!

cucaroach left a comment

Choose a reason for hiding this comment

Uh oh!

sumeerbhola left a comment

Choose a reason for hiding this comment

Uh oh!

cucaroach commented Mar 10, 2022

Uh oh!

sumeerbhola left a comment

Choose a reason for hiding this comment

Uh oh!

cucaroach commented Mar 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cucaroach commented Mar 11, 2022

Uh oh!

sumeerbhola left a comment

Choose a reason for hiding this comment

Uh oh!

tbg left a comment

Choose a reason for hiding this comment

Uh oh!

cucaroach commented Mar 25, 2022

Uh oh!

cucaroach commented Mar 25, 2022

Uh oh!

tbg commented Mar 28, 2022

Uh oh!

sumeerbhola left a comment

Choose a reason for hiding this comment

Uh oh!

cucaroach left a comment

Choose a reason for hiding this comment

Uh oh!

sumeerbhola left a comment

Choose a reason for hiding this comment

Uh oh!

cucaroach commented Apr 28, 2022

Uh oh!

sumeerbhola commented May 3, 2022

Uh oh!

cucaroach commented May 3, 2022

Uh oh!

cucaroach commented May 3, 2022

Uh oh!

sumeerbhola commented May 4, 2022

Uh oh!

sumeerbhola left a comment

Choose a reason for hiding this comment

Uh oh!

cucaroach commented May 16, 2022

Uh oh!

cucaroach commented May 16, 2022

Uh oh!

sumeerbhola commented May 18, 2022

Uh oh!

tbg commented May 18, 2022

Uh oh!

sumeerbhola left a comment

Choose a reason for hiding this comment

Uh oh!

cucaroach commented May 23, 2022

Uh oh!

craig bot commented May 23, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

cucaroach commented Mar 8, 2022 •

edited

Loading

cucaroach commented Mar 11, 2022 •

edited

Loading