streamingccl: create new tenant automatically for replication stream by samiskin · Pull Request #83646 · cockroachdb/cockroach

samiskin · 2022-06-30T13:42:40Z

Resolves #76952

Originally tenants had to be created manually prior to replication
stream creation. This change instead creates a new tenant upon calling
the RESTORE INTO command using the tenant ID that was provided. The
tenant is considered inactive until the cutover point, at which the
tenant is activated.

Release note: None

cockroach-teamcity · 2022-06-30T13:42:47Z

This change is

samiskin · 2022-06-30T13:46:31Z

I purposefully did not include the necessary OnFailOrCancel behaviour of tearing down the tenant just yet since #83310 is still in-flight as of this comment. I'll add tenant teardown after it merges.

The code to validate that the tenant is inactive is commented out until #83650 is resolved. Right now the tenant is marked inactive but its still possible to open a SQL connection to it and query its tables.

samiskin · 2022-06-30T14:49:48Z

pkg/ccl/streamingccl/streamingest/stream_ingestion_planning.go

+		if _, err := sql.GetTenantRecord(ctx, p.ExecCfg(), nil, newTenantID.ToUint64()); err == nil {
+			return errors.Newf("tenant with id %s already exists", newTenantID)
+		}
+		if err := p.ExtendedEvalContext().Tenant.CreateInactiveTenant(ctx, newTenantID.ToUint64()); err != nil {


I elected to use the full CreateTenant code rather than just creating a record like Restore does because it seemed like just creating a record did eventually initialize the rest of the tenant such that I run queries against it, but prior to that point a sql connection would error and ActivateTenant didn't look like it would necessarily force the initialization so I was concerned about a stream that was created and then completed very quickly, activating the tenant prior to it being actually ready. I didn't end up looking into the mechanism for this eventual initialization.

Coming back to this. I think in the long run we want to do something a little different.

The problem with initialising the tenant is that it means that the keyspace we are writing into isn't completely empty. Namely it will have entries schema entries for the default database, the cluster version setting, the id sequence populated. If this destination tenant is created after the source tenant, then the timestamp on those KVs will likely be above whatever gets replicated to us.

because it seemed like just creating a record did eventually initialize the rest of the tenant such that I run queries against it

Did you see this outside of the streaming replication tests? If you only saw it in the streaming replication tests, my guess for what you were seeing is that eventually enough of the source cluster's tenant data is replicated to it for it to be viable.

I think your theory on "the tenant works when enough has been streamed in" is true, since attempting to connect right after stream creation doesn't work, connecting after waiting 2 seconds but without cutover still works, and connecting after cutover always works.

stevendanna

Thanks for jumping into this. This looks reasonable to me. I've left some comments but nothing blocking.

pkg/ccl/streamingccl/streamingest/stream_ingestion_job.go

pkg/ccl/streamingccl/streamingest/stream_ingestion_job_test.go

pkg/ccl/streamingccl/streamingest/stream_ingestion_job.go

Originally tenants had to be created manually prior to replication stream creation. This change instead creates a new tenant upon calling the RESTORE INTO command using the tenant ID that was provided. The tenant is considered inactive until the cutover point, at which the tenant is activated. Release note: None

samiskin · 2022-07-11T23:59:10Z

bors r+

craig · 2022-07-12T02:09:51Z

Build succeeded:

GitHub CI (Cockroach)

samiskin mentioned this pull request Jun 30, 2022

sql: connecting to an inactive tenant should return an error #83650

Closed

samiskin force-pushed the replication-stream-create-tenant branch 5 times, most recently from 1632472 to 7bdf88d Compare June 30, 2022 14:41

samiskin marked this pull request as ready for review June 30, 2022 14:42

samiskin requested a review from a team June 30, 2022 14:42

samiskin requested a review from a team as a code owner June 30, 2022 14:42

samiskin requested review from a team, HonoreDB, gh-casper and msbutler and removed request for a team June 30, 2022 14:42

samiskin commented Jun 30, 2022

View reviewed changes

samiskin force-pushed the replication-stream-create-tenant branch from 7bdf88d to bf84b8a Compare June 30, 2022 14:58

msbutler requested review from stevendanna and removed request for msbutler June 30, 2022 18:17

samiskin force-pushed the replication-stream-create-tenant branch from bf84b8a to ac4af26 Compare June 30, 2022 18:35

stevendanna approved these changes Jun 30, 2022

View reviewed changes

pkg/ccl/streamingccl/streamingest/stream_ingestion_job.go Outdated Show resolved Hide resolved

pkg/ccl/streamingccl/streamingest/stream_ingestion_job_test.go Outdated Show resolved Hide resolved

pkg/ccl/streamingccl/streamingest/stream_ingestion_job.go Outdated Show resolved Hide resolved

samiskin force-pushed the replication-stream-create-tenant branch 4 times, most recently from 5e692c0 to 267427e Compare July 4, 2022 01:12

knz mentioned this pull request Jul 8, 2022

sql: BACKUP TENANT / RESTORE ... AS TENANT does not accept placeholders #84075

Closed

samiskin force-pushed the replication-stream-create-tenant branch 2 times, most recently from 0617def to 53c05df Compare July 11, 2022 19:19

samiskin force-pushed the replication-stream-create-tenant branch from 53c05df to e9fe630 Compare July 11, 2022 20:58

craig bot merged commit 0aa3c44 into cockroachdb:master Jul 12, 2022

shermanCRL added the A-tenant-streaming Including cluster streaming label Jul 29, 2022

shermanCRL added this to the 22.2 milestone Jul 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

streamingccl: create new tenant automatically for replication stream#83646

streamingccl: create new tenant automatically for replication stream#83646
craig[bot] merged 1 commit intocockroachdb:masterfrom
samiskin:replication-stream-create-tenant

samiskin commented Jun 30, 2022 •

edited

Loading

Uh oh!

cockroach-teamcity commented Jun 30, 2022

Uh oh!

samiskin commented Jun 30, 2022 •

edited

Loading

Uh oh!

samiskin Jun 30, 2022

Uh oh!

stevendanna Jul 6, 2022

Uh oh!

samiskin Jul 11, 2022

Uh oh!

stevendanna left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

samiskin commented Jul 11, 2022

Uh oh!

craig bot commented Jul 12, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

samiskin commented Jun 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cockroach-teamcity commented Jun 30, 2022

Uh oh!

samiskin commented Jun 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

samiskin Jun 30, 2022

Choose a reason for hiding this comment

Uh oh!

stevendanna Jul 6, 2022

Choose a reason for hiding this comment

Uh oh!

samiskin Jul 11, 2022

Choose a reason for hiding this comment

Uh oh!

stevendanna left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

samiskin commented Jul 11, 2022

Uh oh!

craig bot commented Jul 12, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

samiskin commented Jun 30, 2022 •

edited

Loading

samiskin commented Jun 30, 2022 •

edited

Loading