multitenant: implement a fallback rate by RaduBerinde · Pull Request #70163 · cockroachdb/cockroach

RaduBerinde · 2021-09-13T21:43:12Z

tenantcostclient: maintain a "buffer" of RUs

This change adjusts the tenant cost controller logic to try to
maintain a "buffer" of 5000 RUs. This is useful to prevent waiting for
more RUs if an otherwise lightly loaded pod suddenly gets a spike of
traffic.

Release note: None

multitenant: implement a "backup" rate

This change implements a "backup" (fallback) throttling rate that a
SQL pod can use if it stops being able to complete token bucket
requests.

The goal is keep tenants without burst RUs throttled and tenants with
lots of RUs unthrottled (or throttled at a high rate). To achieve
this, we calculate a rate at which the tenant would burn through all
their available RUs within 1 hour. The premise here is that if we have
some kind of infrastructure problem, 1 hour is a reasonable time frame
to address it. Beyond 1 hour, the tenant will continue at the same
rate, consuming more RUs than they had available.

Informs #68479.

Release note: None

Release justification: Necessary fix for the distributed rate limiting
functionality, which is vital for the upcoming Serverless MVP release.
It allows CRDB to throttle clusters that have run out of free or paid
request units (which measure CPU and I/O usage). This functionality is
only enabled in multi-tenant scenarios and should have no impact on
our dedicated customers.

cockroach-teamcity · 2021-09-13T21:43:23Z

This change is

andy-kimball

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @ajwerner)

ajwerner

If I had a nit it's that I don't like the word backup as much as I like fallback. How come you chose backup, which very much is a term in cockroach, and, probably something users can initiate?

Reviewed 1 of 5 files at r1, 9 of 14 files at r2, all commit messages.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @RaduBerinde)

pkg/ccl/multitenantccl/tenantcostclient/tenant_side.go, line 366 at r2 (raw file):

	ctx context.Context, resp *roachpb.TokenBucketResponse,
) {
	if log.V(1) {

drive-by: this should be log.ExpensiveLogEnabled(ctx, 1)

RaduBerinde · 2021-09-14T02:33:31Z

Yeah I agree. I thought of "fallback" after I wrote most of the code. I will rename.

RaduBerinde · 2021-09-14T16:33:22Z

I fell back to the backup word fallback.

ajwerner

Reviewed 1 of 5 files at r1, 12 of 12 files at r3, all commit messages.
Reviewable status: complete! 1 of 0 LGTMs obtained (and 1 stale) (waiting on @RaduBerinde)

This change adjusts the tenant cost controller logic to try to maintain a "buffer" of 5000 RUs. This is useful to prevent waiting for more RUs if an otherwise lightly loaded pod suddenly gets a spike of traffic. Release note: None

This change implements a fallback throttling rate that a SQL pod can use if it stops being able to complete token bucket requests. The goal is keep tenants without burst RUs throttled and tenants with lots of RUs unthrottled (or throttled at a high rate). To achieve this, we calculate a rate at which the tenant would burn through all their available RUs within 1 hour. The premise here is that if we have some kind of infrastructure problem, 1 hour is a reasonable time frame to address it. Beyond 1 hour, the tenant will continue at the same rate, consuming more RUs than they had available. Informs cockroachdb#68479. Release note: None Release justification: Necessary fix for the distributed rate limiting functionality, which is vital for the upcoming Serverless MVP release. It allows CRDB to throttle clusters that have run out of free or paid request units (which measure CPU and I/O usage). This functionality is only enabled in multi-tenant scenarios and should have no impact on our dedicated customers.

RaduBerinde · 2021-09-14T21:55:38Z

bors r+

craig · 2021-09-14T23:53:27Z

Build succeeded:

GitHub CI (Cockroach)

blathers-crl · 2021-09-14T23:53:38Z

Encountered an error creating backports. Some common things that can go wrong:

The backport branch might have already existed.
There was a merge conflict.
The backport branch contained merge commits.

You might need to create your backport manually using the backport tool.

error creating merge commit from 56f94dd to blathers/backport-release-21.2-70163: POST https://api.github.com/repos/cockroachdb/cockroach/merges: 409 Merge conflict []

you may need to manually resolve merge conflicts with the backport tool.

Backport to branch 21.2.x failed. See errors above.

_{🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is otan.}

RaduBerinde added the backport-21.2.x label Sep 13, 2021

RaduBerinde requested review from ajwerner and andy-kimball September 13, 2021 21:43

RaduBerinde requested a review from a team as a code owner September 13, 2021 21:43

andy-kimball approved these changes Sep 14, 2021

View reviewed changes

ajwerner reviewed Sep 14, 2021

View reviewed changes

RaduBerinde force-pushed the backup-rate branch 2 times, most recently from e135a8b to 52e88c8 Compare September 14, 2021 16:32

ajwerner approved these changes Sep 14, 2021

View reviewed changes

RaduBerinde changed the title ~~multitenant: implement a "backup" rate~~ multitenant: implement a fallback rate Sep 14, 2021

RaduBerinde added 2 commits September 14, 2021 10:59

tenantcostclient: maintain a "buffer" of RUs

271630c

This change adjusts the tenant cost controller logic to try to maintain a "buffer" of 5000 RUs. This is useful to prevent waiting for more RUs if an otherwise lightly loaded pod suddenly gets a spike of traffic. Release note: None

RaduBerinde force-pushed the backup-rate branch from 52e88c8 to 56f94dd Compare September 14, 2021 17:59

craig bot merged commit a59ea3e into cockroachdb:master Sep 14, 2021

RaduBerinde deleted the backup-rate branch September 21, 2021 20:47

This was referenced Sep 24, 2021

multitenant: tasklist for cost control MVP #68479

Closed

release-21.2: multitenant: implement fallback rate and pass next live instance ID #70727

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multitenant: implement a fallback rate#70163

multitenant: implement a fallback rate#70163
craig[bot] merged 2 commits intocockroachdb:masterfrom
RaduBerinde:backup-rate

RaduBerinde commented Sep 13, 2021

Uh oh!

cockroach-teamcity commented Sep 13, 2021

Uh oh!

andy-kimball left a comment

Uh oh!

ajwerner left a comment

Uh oh!

RaduBerinde commented Sep 14, 2021

Uh oh!

RaduBerinde commented Sep 14, 2021

Uh oh!

ajwerner left a comment

Uh oh!

RaduBerinde commented Sep 14, 2021

Uh oh!

craig bot commented Sep 14, 2021

Uh oh!

blathers-crl bot commented Sep 14, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

RaduBerinde commented Sep 13, 2021

tenantcostclient: maintain a "buffer" of RUs

multitenant: implement a "backup" rate

Uh oh!

cockroach-teamcity commented Sep 13, 2021

Uh oh!

andy-kimball left a comment

Choose a reason for hiding this comment

Uh oh!

ajwerner left a comment

Choose a reason for hiding this comment

Uh oh!

RaduBerinde commented Sep 14, 2021

Uh oh!

RaduBerinde commented Sep 14, 2021

Uh oh!

ajwerner left a comment

Choose a reason for hiding this comment

Uh oh!

RaduBerinde commented Sep 14, 2021

Uh oh!

craig bot commented Sep 14, 2021

Uh oh!

blathers-crl bot commented Sep 14, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants