Re-build remote cluster connections on credential changes by n1v0lg · Pull Request #103460 · elastic/elasticsearch

n1v0lg · 2023-12-14T15:50:25Z

This PR builds on #102798 by adding automatic remote connection rebuilding on cluster credentials changes. In particular, we rebuild a remote cluster connection if a credential for the associated cluster is newly added (i.e., we are moving from RCS 1.0 -> RCS 2.0) or removed (moving from RCS 2.0 -> RCS 1.0). A connection rebuild allows us to associate the correct profile (_remote_server in case of RCS 2.0, or "regular" transport profile for RCS 1.0) without requiring end-users to manually update remote cluster settings via a settings update call.

More context on connection rebuilding also in this comment.

Relates: ES-6764

…2798)" (elastic#103211)" This reverts commit 9db4cb4.

…2-reload

…elastic/elasticsearch into revert-103211-revert-102798-rcs2-reload

…build-connections

jakelandis

I haven't reviewed the tests yet, but looking good ... a couple comments.

jakelandis · 2023-12-21T20:18:18Z

...sticsearch/xpack/security/action/settings/TransportReloadRemoteClusterCredentialsAction.java

-        // We avoid stashing and marking context as system to keep the action as minimal as possible (i.e., avoid copying context)
-        remoteClusterService.updateRemoteClusterCredentials(request.getSettings());
-        listener.onResponse(ActionResponse.Empty.INSTANCE);
+        assert Transports.assertNotTransportThread("Remote connection re-building is too much for a transport thread");


jakelandis · 2023-12-21T20:36:13Z

x-pack/plugin/security/src/main/java/org/elasticsearch/xpack/security/Security.java

        );
-        assert future.isDone() : "expecting local-only action call to return immediately on invocation";
-        future.actionGet(0, TimeUnit.NANOSECONDS);
+        future.actionGet(10, TimeUnit.SECONDS);


I'm not sure we want a timeout here. If the timeout is hit it will not cancel the underlying work and other than returning sooner not sure what that buys us. It only encourages calling the API again which can compound the already unhealthy scenerio (and we will never be able to guess a correct upper bound unless it is huge). If anything the per-connection could have a timeout/retry ...but if does not already i don't see a need to add one now. i.e. use the explicit listener variant instead of the implicit listener used in the future to make the call to the action.

jakelandis · 2023-12-21T20:43:32Z

x-pack/plugin/security/src/main/java/org/elasticsearch/xpack/security/Security.java

+            return;
+        }
+
        final PlainActionFuture<ActionResponse.Empty> future = new PlainActionFuture<>();


Can we short circuit this entirely if the credentials did not change ?

It's tricky: just the absence of credentials in the settings doesn't mean a noop (it could be that we removed credentials) -- anything else would require access to the remote cluster service which is not available here. I wanted to add a short-circuit here as well originally but didn't come up with a good way to do it.

jakelandis · 2023-12-21T20:57:46Z

server/src/main/java/org/elasticsearch/transport/RemoteClusterService.java

+        GroupedActionListener<Void> groupedListener
+    ) {
+        if (remoteClusters.containsKey(clusterAlias)) {
+            updateRemoteCluster(clusterAlias, settings, true, ActionListener.wrap(status -> {


does this handle tearing down a connection after removing a credential ?

If a connection exists, and we remove a credential, updateRemoteCluster handles re-building it. If the connection does not exist, that means cluster settings for it were removed, and updateRemoteCluster will tear it down on the regular settings update call that removed the cluster settings. So if a connection does not exist anymore, and but a credential does still and is removed, there is nothing left to tear down. Let me know if I misinterpreted the question!

jfreden

Very solid work on this PR! It looks great. Easy to follow, good tests and a lot of thought has been put in to covering corner cases. I only have some optional comments.

One thing I couldn't really wrap my head around is if the reload of credentials is graceful or not? What happens to ongoing CCR and CCS requests when the credentials are reloaded? Are they just going to fail until the new connection is up? Maybe that's fine.

jfreden · 2023-12-21T12:22:36Z

server/src/main/java/org/elasticsearch/transport/RemoteClusterService.java

+        final GroupedActionListener<Boolean> groupedListener = new GroupedActionListener<>(
+            totalConnectionsToRebuild,
+            listener.map(successFlags -> {
+                logger.info("rebuild complete for [{}] connections after credentials update", successFlags.size());


Cool! I like the GroupedActionListener.

...sticsearch/xpack/security/action/settings/TransportReloadRemoteClusterCredentialsAction.java

jfreden · 2023-12-22T08:08:44Z

server/src/main/java/org/elasticsearch/transport/RemoteClusterCredentialsManager.java

-                Strings.collectionToCommaDelimitedString(clusterCredentials.keySet())
-            )
-        );
+    public final synchronized UpdateRemoteClusterCredentialsResult updateClusterCredentials(Settings settings) {


This is only called from a single-threaded context. Might consider keeping it synchronized for future proofing, but it adds some overhead to acquire an extra lock.

Yes great point. I meant to take a closer look at being less heavy handed with synchronization but didn't get to it. Thanks for prompting me to think about this again. I agree that with the current structure, we can remove synchronization here. I'll think on it and tweak some things 👍

Discussed this on Slack also:

I prefer to keep this one synchronized for future proofing

I removed sorting the cluster aliases as that adds unnecessary overhead without utility -- originally, I used sorting with the notion to prevent deadlocks in RemoteClusterService::updateRemoteClusterCredentials (since we iterate over aliases and call a synchronized method for each, we have a potential deadlock). However updateRemoteClusterCredentials is itself syncronized at the top level, so we're safe.

Thanks for raising and discussing this 👍

jfreden · 2023-12-22T08:19:19Z

server/src/main/java/org/elasticsearch/transport/RemoteClusterService.java

            remoteClusters.put(clusterAlias, remote);
            remote.ensureConnected(listener.map(ignored -> RemoteClusterConnectionStatus.CONNECTED));
-        } else if (remote.shouldRebuildConnection(newSettings)) {
+        } else if (forceRebuild || remote.shouldRebuildConnection(newSettings)) {


nit: Since we know that this will always be true when we reload credentials, it might make sense to break this in to its own method to avoid the boolean in the method signature.

agreed that the flag is not ideal but having stared at the method for a bit I don't see a good way to refactor it without basically duplicating it or using some sort of boolean flag in the end -- did you have a specific tweak in mind? partly, I wanted to avoid touching too much of the connection rebuilding code since it's fairly complex and outside of the security team's scope.

Makes sense! It's fairly complex so I think it's out of scope to refactor any of that. 👍

jfreden · 2023-12-22T08:40:41Z

server/src/test/java/org/elasticsearch/transport/RemoteClusterCredentialsManagerTests.java

@@ -20,18 +22,80 @@ public void testResolveRemoteClusterCredentials() {
        final String clusterAlias = randomAlphaOfLength(9);
        final String otherClusterAlias = randomAlphaOfLength(10);



If the CredentialsManager is expected to be thread safe, a concurrency test could be useful in this class.

I didn't get around to this yet -- I think a concurrency test for RemoteClusterService::updateRemoteClusterCredentials would give us the most useful coverage. I think it's a really nice-to-add test but don't want to defer another round of review for it. My plan is to add it still.

I'll log a follow up item to address this one in Jira to not block getting this out the door.

jfreden · 2023-12-22T08:47:22Z

...java/org/elasticsearch/xpack/remotecluster/RemoteClusterSecurityReloadCredentialsRestIT.java

+
+import static org.hamcrest.Matchers.containsInAnyOrder;
+
+// account for slow stored secure settings updates (involves removing and re-creating the keystore)


Would be interesting with a test case that blasts the API with concurrent reload requests combined with config updates.

agreed. it wouldn't work with a REST level test because of the keystore file rewrite but I'll see if there is something lower-level we can do

n1v0lg · 2023-12-22T09:45:49Z

@jfreden thanks for the review!

One thing I couldn't really wrap my head around is if the reload of credentials are graceful or not? What happens to ongoing CCR and CCS requests when the credentials are reloaded? Are they just going to fail until the new connection is up?

Not very graceful, indeed, and may cause CCS/CCR to fail. This is existing behavior, in that you can also trigger a connection rebuild by updating remote cluster settings. We recommend stopping long-running cross cluster operations in our migration guide: https://www.elastic.co/guide/en/elasticsearch/reference/current/remote-clusters-migrate.html#remote-clusters-migration-stop to avoid these sorts of failures.

DaveCTurner · 2023-12-27T08:11:30Z

server/src/main/java/org/elasticsearch/transport/RemoteClusterService.java

+        try {
+            final PlainActionFuture<RemoteClusterConnectionStatus> future = new PlainActionFuture<>();
+            updateRemoteCluster(clusterAlias, settings, true, future);
+            final RemoteClusterConnectionStatus status = future.actionGet(10, TimeUnit.SECONDS);


I do not like stacking the timeouts one after the other like this. If we have, say, 150 remotes, and they all hit the 10s timeout, then with this logic we'll be waiting for 1500s = 25 minutes. If we need to time out at all then we should have one top-level timeout, and TBH I'd rather it was in the caller seeing as how we don't do anything (except logging) when we hit the timeout anyway.

I know we have this problem when applying cluster states too, and IMO that's a bug. It's tricky to solve in that case because of the mechanics of how settings are applied, but I would rather avoid making the same mistake here too.

Fair point. I don't feel strongly about the per-connection timeout (nor the top-level timeout for that matter).

If we have 150 remotes, and each connection takes >10s to rebuild the overall request will still wait for 25m. A top level timeout could prevent this but it's difficult to pick a reasonable default since I expect timeouts to vary greatly based on the network situation for a given connection (@jakelandis points this out as here which prompted me to consider a per-connection timeout) -- e.g., a healthy connection will rebuild in >1s but a slow/unstable network could result in long waits.

It could be nice to add a timeout parameter to the initial, generic reload-secure-settings request but we have no way of passing this down to the individual plugin reload call, since the reload() call only takes settings as input and I would prefer not to change the ReloadablePlugin interface here since that's a bigger change that requires separate input and deliberation.

The only other alternative that comes to mind is adding a dedicated setting where end-users can set a connection rebuild timeout but that feels hacky and like an overkill since, as you point out, we don't do anything (except logging) when we hit the timeout anyway.

Given that it's hard to come up with a generic top-level timeout, and that individual connection timeouts can give us very high (and pretty arbitrary) wait times, I prefer to remove the timeout altogether for now and possibly revisit this in the future. @DaveCTurner @jakelandis let me know if you have objections.

I would be happy with no timeout here I think, especially if we trigger all the connection attempts in parallel since there's other timeouts in play lower down the stack, preventing this from really waiting forever.

I do think it's worth reconsidering the synchronous nature of the reload() API, which forces us to call each plugin in turn (or do something worse like spawning a thread per plugin). Stuff like this should be async all the way down the stack IMO and then these concerns wouldn't really arise.

server/src/main/java/org/elasticsearch/transport/RemoteClusterService.java

jakelandis

LGTM, nice work !

jakelandis · 2024-01-09T20:25:51Z

...java/org/elasticsearch/xpack/remotecluster/RemoteClusterSecurityReloadCredentialsRestIT.java

+
+    public void testUpgradeFromRcs1() throws Exception {
+        // Setup RCS 1.0 and check that it works
+        configureRemoteCluster("my_remote_cluster", fulfillingCluster, true, randomBoolean(), randomBoolean());


nit: would it be possible to assert no API key configured for my_remote_cluster ? Just a bit paraniod about ensuring this is really RCS 1.0. If it is anything but trivial, feel free to ignore this.

configureRemoteCluster calls checkRemoteConnection under the hood which ensures that no credential is configured when basicSecurity is true here

Checking the keystore would be a bit tricky, off the top of my head. Hope this is sufficient!

jakelandis · 2024-01-09T21:12:16Z

...RestTest/java/org/elasticsearch/xpack/remotecluster/RemoteClusterSecurityCcrMigrationIT.java

+        assumeFalse(
+            "Cannot run in FIPS mode since the keystore will be password protected and sending a password in the reload"
+                + "settings api call, requires TLS to be configured for the transport layer",
+            inFipsJvm()


requires TLS to be configured for the transport layer

where is this enforced ? Also, isn't TLS enabled for the tranport level in these tests ?

Also, could you log an issue for follow up. Ideally we wouldn't skip these in FIPS mode.

Ah good catch! The message is a copy-paste hoopla. FIPS is still a problem though, around insufficient password length for the keystore password. I'll tweak the message and log a follow up to address this.

n1v0lg · 2024-01-11T08:34:04Z

@elasticmachine update branch

n1v0lg · 2024-01-11T11:12:09Z

@elasticmachine update branch

n1v0lg and others added 30 commits December 7, 2023 15:43

Internal REST tests can update keystore file

4d8ab46

Nits

3b21dd6

Merge branch 'main' into internal-rest-test-write-to-keystore-file

6e747f3

Address review feedback

6b89b7c

Merge branch 'main' into internal-rest-test-write-to-keystore-file

cdfddfc

Fix test

d2355eb

Hot-reloadable remote cluster credentials

d247ed4

Revert "Revert "Hot-reloadable remote cluster credentials (elastic#10…

6007cf8

…2798)" (elastic#103211)" This reverts commit 9db4cb4.

Update docs/changelog/103215.yaml

f052d9c

Merge branch 'rcs2-reload-fixed' into revert-103211-revert-102798-rcs…

74ad43f

…2-reload

Merge branch 'revert-103211-revert-102798-rcs2-reload' of github.com:…

b668d0a

…elastic/elasticsearch into revert-103211-revert-102798-rcs2-reload

Remove changelog

8d13158

Update docs/changelog/103215.yaml

29766d8

Changelog again

2e880e4

Merge branch 'internal-rest-test-write-to-keystore-file' into rcs2-re…

e05ba2d

…build-connections

Rebuild remote connections on credentials change

ae1cbb3

Tests and sorting

2e0aafb

Merge branch 'main' into revert-103211-revert-102798-rcs2-reload

7c2b67e

Action types refactor

4072fac

Merge branch 'main' into revert-103211-revert-102798-rcs2-reload

140eaaf

Private constructor

2c5380e

Clearer assertion message

ed34686

Merge branch 'main' into revert-103211-revert-102798-rcs2-reload

cd63809

Align changelog with original pr

4874bb3

Skip non-existent connections

f03e40b

Merge branch 'main' into revert-103211-revert-102798-rcs2-reload

09398b4

Merge branch 'main' into revert-103211-revert-102798-rcs2-reload

4b1257b

Merge branch 'main' into revert-103211-revert-102798-rcs2-reload

85e104a

Merge branch 'main' into revert-103211-revert-102798-rcs2-reload

cee133b

Merge branch 'main' into revert-103211-revert-102798-rcs2-reload

161fe67

jakelandis reviewed Dec 21, 2023

View reviewed changes

jfreden reviewed Dec 22, 2023

View reviewed changes

n1v0lg added 3 commits December 22, 2023 15:27

Merge branch 'main' into rcs2-rebuild-connections

43059d4

Timeout on individual connections

98dcfc2

Clarify comment

32442c3

DaveCTurner reviewed Dec 27, 2023

View reviewed changes

n1v0lg added 3 commits January 8, 2024 11:02

FIPS skip

c0b333a

No timeout

194b12c

Merge branch 'main' into rcs2-rebuild-connections

b739626

DaveCTurner reviewed Jan 8, 2024

View reviewed changes

server/src/main/java/org/elasticsearch/transport/RemoteClusterService.java Outdated Show resolved Hide resolved

server/src/main/java/org/elasticsearch/transport/RemoteClusterService.java Outdated Show resolved Hide resolved

n1v0lg added 6 commits January 8, 2024 14:10

Ref counting

0a02add

Merge branch 'main' into rcs2-rebuild-connections

e7bcb2e

Use ref counting runnable

95daaad

Comment

0aa58da

No need to sort

114816e

Nit

31d2747

n1v0lg requested a review from jakelandis January 8, 2024 17:11

jakelandis approved these changes Jan 9, 2024

View reviewed changes

n1v0lg added 2 commits January 10, 2024 16:56

Merge branch 'main' into rcs2-rebuild-connections

52da6fd

Make fips work

e568a53

jakelandis added the :Security/FIPS Running ES in FIPS 140-2 mode label Jan 10, 2024

Merge branch 'main' into rcs2-rebuild-connections

7f9446d

n1v0lg added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Jan 11, 2024

Merge branch 'main' into rcs2-rebuild-connections

83423ac

elasticsearchmachine merged commit f84bda7 into elastic:main Jan 11, 2024

n1v0lg deleted the rcs2-rebuild-connections branch January 11, 2024 12:39

		@@ -20,18 +22,80 @@ public void testResolveRemoteClusterCredentials() {
		final String clusterAlias = randomAlphaOfLength(9);
		final String otherClusterAlias = randomAlphaOfLength(10);


		import static org.hamcrest.Matchers.containsInAnyOrder;

		// account for slow stored secure settings updates (involves removing and re-creating the keystore)

Conversation

n1v0lg commented Dec 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jakelandis left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jfreden left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

n1v0lg commented Dec 22, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jakelandis left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

n1v0lg Jan 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

n1v0lg commented Jan 11, 2024

Uh oh!

n1v0lg commented Jan 11, 2024

Uh oh!

Reviewers

Assignees

Labels

n1v0lg commented Dec 14, 2023 •

edited

Loading

jfreden left a comment •

edited

Loading

n1v0lg Jan 10, 2024 •

edited

Loading