Federation: parallel shutdown; disconnect links before stopping by michaelklishin · Pull Request #15271 · rabbitmq/rabbitmq-server

michaelklishin · 2026-01-15T18:10:57Z

Technical design pair: @ansd.

Proposed Changes

This PR makes the most expensive part of federation link shutdown — closing AMQP 0-9-1 connections to the upstream — parallel, by notifying links in the prep_stop shutdown callback.

This yields very significant efficiency gains with hundreds or thousands of links, all without changing the supervision tree structure.

Why Not Use `simple_one_for_one`?

Indeed the simple_one_for_one OTP supervisor restart strategy would shut down all child processes concurrently for us. But this would require changing the child identity (key)
to an Erlang PID, which would require intrusive and painful to test changes (such as
an ETS table that maps their PIDS to the current identities and the other way around).

Throttling to Avoid Overwhelming the Upstream

To avoid overwhelming the upstream schema data store (which could be a 7-9 node cluster on 3.x with Mnesia), we limit the degree of parallelism and add batching with configurable throttling delays into the process.

The entire link shutdown process is now capped at 180 seconds (by default), and should not meaningfully exceed that time period even on nodes with many thousands of links.

By default we close up to 128 links per batch, with a 50 ms delay, and a 180 second hard cap (timeout) for the entire link termination operation.

Data Safety Considerations

Federation uses publisher confirms by default, and most users never change it, therefore
aggressive connection closures are safe and acceptable.

In addition, the user can set resource-cleanup-mode to never to make sure that the
upstream resources (e.g. internal queues used by exchange federation) are never deleted
by the links running in the downstream cluster.

Show Me The Benchmark Data

Microbenchmarks (Supervisor Child Process Termination)

Below are some microbenchmarks that measure everything beyond the actual
AMQP 0-9-1 connection termination part on an 8 core aarch64 CPU from 2022:

┌───────┬──────────┬────────────┬─────────┐
│ Links │ Parallel │ Sequential │ Speedup │
├───────┼──────────┼────────────┼─────────┤
│ 1,000 │ 16ms     │ 6,401ms    │ ~400x   │
├───────┼──────────┼────────────┼─────────┤
│ 5,000 │ 30ms     │ 32,689ms   │ ~1,000x │
└───────┴──────────┴────────────┴─────────┘

Worst Case Scenario Calculations

If we consider the worst case scenario where every link connection hits its timeout,
1K links would take about 83 minutes to start for the sequential (status quo) version and 5.6 seconds (see below) with these changes.

Real World Federation Links with Outgoing Connections

With the throttling delay of 0, the time it takes to shut down N links to a remote upstream
cluster look like this:

┌───────┬─────────────────┐
│ Links │      Time       │
├───────┼─────────────────┤
│ 10    │ 100ms           │
├───────┼─────────────────┤
│ 50    │ 580ms           │
├───────┼─────────────────┤
│ 100   │ 1,067ms         │
├───────┼─────────────────┤
│ 1,000 │ 5,579ms         │
└───────┴─────────────────┘

Maintenance Mode Integration

Maintenance mode integration of these changes needs to be done with care: since maintenance mode stops all client connection listeners, we run the risk of stopping the listeners
before this part of the federation shutdown has a chance to do its job as designed.

For that reason, we have to special case the federation plugins in the core and first trigger their termination, then stop the listeners.

When the node is revived (the maintenance mode is rolled back), all links are restarted.

@ansd

This yields very significant efficiency gains with hundreds or thousands of links. To avoid overwhelming the upstream schema data store (which could be a 7-9 node cluster on 3.x with Mnesia), we limit the degree of parallelism and add configurable throttling delays into the process. Technical design pair: @ansd.

without it, the new keys (or rather, their defaults) will spill into the `config_schema_SUITE`s of other plugins.

We implement the `revive/0` part for symmetry. As with the revive command in general, it serves as a last resort available for rollback. Usually nodes put into maintenance mode are shortly stopped for upgrading or reconfiguration.

Previously, the following three supervisors used the wrong `shutdown` and wrong `type`: * rabbit_exchange_federation_sup * rabbit_federation_sup * rabbit_queue_federation_sup For `shutdown` Erlang/OTP recommends: "If the child process is another supervisor, the shutdown time must be set to infinity to give the subtree ample time to shut down. Setting the shutdown time to anything other than infinity for a child of type supervisor can cause a race condition where the child in question unlinks its own children, but fails to terminate them before it is killed." For `type` Erlang/OTP recommends: "type specifies if the child process is a supervisor or a worker. The type key is optional. If it is not specified, it defaults to worker." This commit fixes the wrong child spec by using a timeout of `infinity` and type `supervisor`. (cherry picked from commit cfcf6cf)

## What? Federation links started in the federation plugins are put under the `rabbit` app supervision tree (unfortunately). This commit ensures that the entire federation supervision hierarchies (including all federation links) are stopped **before** stopping app `rabbit` when stopping RabbittMQ. ## Why? Previously, we've seen cases where hundreds of federation links are stopped during the shutdown procedure in app `rabbit` leading to federation link restarts happening in parallel to vhosts being stopped. In one case, the shutdown of app `rabbit` even got stuck (although there is no evidence that federation was the problem). Either way, the cleaner appraoch is to gracefully stop all federation links, i.e. the entire supervision hierarchy under `rabbit_exchange_federation_sup` and `rabbit_queue_federation_sup` when stopping the federation apps, i.e. **before** proceeding to stop app `rabbit`. ## How? The boot step cleanup steps for the federation plugins are skipped when stopping RabbitMQ. Hence, this commit ensures that the supervisors are stopped in the stop/1 application callback. This commit does something similar to #14054 but uses a simpler approach. (cherry picked from commit 8bffa58)

when the core now interacts with a part of the supervision tree owned by this plugin for more efficient shutdown.

ansd

make run-broker PLUGINS="rabbitmq_exchange_federation"
./sbin/rabbitmqctl set_parameter federation-upstream origin '{"uri":"amqp://localhost:5672"}'
./sbin/rabbitmqctl set_policy exchange-federation "^amq.direct" '{"federation-upstream-set":"all"}' --priority 10 --apply-to exchanges
./sbin/rabbitmq-upgrade drain

will print the following warning every 5 seconds:

2026-01-16 11:05:04.797918+01:00 [warning] <0.1080.0> Federation exchange 'amq.direct' in vhost '/' did not connect to exchange 'amq.direct' in vhost '/' on amqp://localhost:5672. Reason: {error,
2026-01-16 11:05:04.797918+01:00 [warning] <0.1080.0>                                                                                                                                        econnrefused}

With 5k federation links, this will create 1k warnings being logged per second when RabbitMQ is put into maintenance mode.

deps/rabbitmq_exchange_federation/src/rabbit_federation_exchange_link.erl

deps/rabbitmq_federation_common/src/rabbit_federation_pg.erl

deps/rabbitmq_exchange_federation/src/rabbit_federation_exchange_link.erl

(cherry picked from commit a7a2cac)

We implement the `revive/0` part for symmetry. As with the revive command in general, it serves as a last resort available for rollback. Usually nodes put into maintenance mode are shortly stopped for upgrading or reconfiguration. (cherry picked from commit 283aa0e)

(cherry picked from commit 1721af8)

(cherry picked from commit 19bb842)

(cherry picked from commit 59e9f7a)

Federation: disconnect links before stopping, in parallel (backport #15271)

This is a backport of #15271 to v4.1.x with its single federation plugin repository structure.

michaelklishin · 2026-01-19T17:31:37Z

Some numbers on the effectiveness of these changes:

We implement the `revive/0` part for symmetry. As with the revive command in general, it serves as a last resort available for rollback. Usually nodes put into maintenance mode are shortly stopped for upgrading or reconfiguration.

michaelklishin added this to the 4.3.0 milestone Jan 15, 2026

michaelklishin requested a review from ansd January 15, 2026 18:10

michaelklishin added the backport-v4.2.x label Jan 15, 2026

michaelklishin added 2 commits January 15, 2026 10:55

Federation: use cuttlefish:unset/0 for new schema keys

4ff6b2a

without it, the new keys (or rather, their defaults) will spill into the `config_schema_SUITE`s of other plugins.

More rabbitmq.conf schema tests #15271

a7a2cac

Integrate with maintenance mode #15271

283aa0e

We implement the `revive/0` part for symmetry. As with the revive command in general, it serves as a last resort available for rollback. Usually nodes put into maintenance mode are shortly stopped for upgrading or reconfiguration.

michaelklishin force-pushed the mk-federation-parallel-disconnect branch from 9f491c0 to 283aa0e Compare January 15, 2026 20:36

michaelklishin mentioned this pull request Jan 15, 2026

Federation: consider making link shutdown concurrent #15256

Closed

michaelklishin and others added 4 commits January 15, 2026 13:24

Exclude these two from Dialyzer #15271

1721af8

Federation: be more defensive during shutdown

807e186

when the core now interacts with a part of the supervision tree owned by this plugin for more efficient shutdown.

ansd mentioned this pull request Jan 16, 2026

Stop federation supervision hierachies before stopping rabbit #15270

Closed

ansd reviewed Jan 16, 2026

View reviewed changes

deps/rabbitmq_exchange_federation/src/rabbit_federation_exchange_link.erl Outdated Show resolved Hide resolved

ansd reviewed Jan 16, 2026

View reviewed changes

deps/rabbitmq_exchange_federation/src/rabbit_federation_exchange_link.erl Outdated Show resolved Hide resolved

ansd reviewed Jan 16, 2026

View reviewed changes

deps/rabbitmq_federation_common/src/rabbit_federation_pg.erl Outdated Show resolved Hide resolved

ansd reviewed Jan 16, 2026

View reviewed changes

deps/rabbitmq_exchange_federation/src/rabbit_federation_exchange_link.erl Outdated Show resolved Hide resolved

michaelklishin added 2 commits January 16, 2026 08:40

Refactoring, logging tweaks #15271

19bb842

Tweak link states further #15271

59e9f7a

michaelklishin merged commit cdede67 into main Jan 16, 2026
291 checks passed

michaelklishin deleted the mk-federation-parallel-disconnect branch January 16, 2026 17:26

mergify bot pushed a commit that referenced this pull request Jan 16, 2026

More rabbitmq.conf schema tests #15271

9ff7392

(cherry picked from commit a7a2cac)

mergify bot pushed a commit that referenced this pull request Jan 16, 2026

Exclude these two from Dialyzer #15271

33cedb1

(cherry picked from commit 1721af8)

mergify bot pushed a commit that referenced this pull request Jan 16, 2026

Refactoring, logging tweaks #15271

df3de10

(cherry picked from commit 19bb842)

mergify bot pushed a commit that referenced this pull request Jan 16, 2026

Tweak link states further #15271

195678c

(cherry picked from commit 59e9f7a)

mergify bot mentioned this pull request Jan 16, 2026

Federation: parallel shutdown; disconnect links before stopping (backport #15271) #15283

Merged

michaelklishin added a commit that referenced this pull request Jan 16, 2026

Merge pull request #15283 from rabbitmq/mergify/bp/v4.2.x/pr-15271

cc2d8d0

Federation: disconnect links before stopping, in parallel (backport #15271)

michaelklishin added a commit that referenced this pull request Jan 16, 2026

Federation: disconnect links before stopping, in parallel

06f0edc

This is a backport of #15271 to v4.1.x with its single federation plugin repository structure.

michaelklishin mentioned this pull request Jan 16, 2026

Federation: parallel shutdown; disconnect links before stopping (pre-4.2.0 vintage edition) #15288

Merged

michaelklishin changed the title ~~Federation: disconnect links before stopping, in parallel~~ Federation: parallel shutdown; disconnect links before stopping Jan 24, 2026

michaelklishin added a commit that referenced this pull request Feb 24, 2026

More rabbitmq.conf schema tests #15271

697acf7

michaelklishin added a commit that referenced this pull request Feb 24, 2026

Exclude these two from Dialyzer #15271

d8f3650

michaelklishin added a commit that referenced this pull request Feb 24, 2026

Refactoring, logging tweaks #15271

95e77de

michaelklishin added a commit that referenced this pull request Feb 24, 2026

Tweak link states further #15271

5586f5a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Federation: parallel shutdown; disconnect links before stopping#15271

Federation: parallel shutdown; disconnect links before stopping#15271
michaelklishin merged 10 commits intomainfrom
mk-federation-parallel-disconnect

michaelklishin commented Jan 15, 2026 •

edited

Loading

Uh oh!

ansd left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michaelklishin commented Jan 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

michaelklishin commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed Changes

Why Not Use simple_one_for_one?

Throttling to Avoid Overwhelming the Upstream

Data Safety Considerations

Show Me The Benchmark Data

Microbenchmarks (Supervisor Child Process Termination)

Worst Case Scenario Calculations

Real World Federation Links with Outgoing Connections

Maintenance Mode Integration

Uh oh!

ansd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michaelklishin commented Jan 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

michaelklishin commented Jan 15, 2026 •

edited

Loading

Why Not Use `simple_one_for_one`?