Skip to content

Fix federation supervisor crash during upgrade to 4.2.x on multi-node cluster (backport #15252)#15254

Merged
michaelklishin merged 2 commits intov4.2.xfrom
mergify/bp/v4.2.x/pr-15252
Jan 13, 2026
Merged

Fix federation supervisor crash during upgrade to 4.2.x on multi-node cluster (backport #15252)#15254
michaelklishin merged 2 commits intov4.2.xfrom
mergify/bp/v4.2.x/pr-15252

Conversation

@mergify
Copy link
Copy Markdown

@mergify mergify bot commented Jan 13, 2026

Proposed Changes

In a multi-node cluster after a rolling upgrade from below 4.2 to 4.2
supervisor rabbit_federation_exchange_link_sup_sup crashed because
rabbit_federation_link_sup:start_link had arity 1 until 4.1.x. PR
mirrored supervisor preserves the child definitions which still
include a call with arity 1 (without the link module).

To keep old child specs valid, add back a start_link/1 function in rabbit_federation_link_sup.

Fixes #15239

Run the test with

SECONDARY_DIST=$(PWD)/secondary/rabbitmq_server-4.1.7 make -C  deps/rabbitmq_exchange_federation ct-exchange t=rolling_upgrade:child_id_format

Without the patch the test case rolling_upgrade:child_id_format fails with:

=== Location: [{erpc,call,1366},
              {exchange_SUITE,'-child_id_format/1-fun-5-',675},
              {lists,foreach_1,2310},
              {exchange_SUITE,child_id_format,670},
              {test_server,ts_tc,1794},
              {test_server,run_test_case_eval1,1303},
              {test_server,run_test_case_eval,1235}]
=== === Reason: {exception,
                     {noproc,
                         {gen_server,call,
                             [rabbit_federation_exchange_link_sup_sup,
                              which_children,infinity]}}}

Types of Changes

What types of changes does your code introduce to this project?
Put an x in the boxes that apply

  • Bug fix (non-breaking change which fixes issue #NNNN)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause an observable behavior change in existing systems)
  • Documentation improvements (corrections, new content, etc)
  • Cosmetic change (whitespace, formatting, etc)
  • Build system and/or CI

Checklist

Put an x in the boxes that apply.
You can also fill these out after creating the PR.
This is simply a reminder of what we are going to look for before merging your code.

  • Mandatory: I (or my employer/client) have have signed the CA (see https://github.com/rabbitmq/cla)
  • I have read the CONTRIBUTING.md document
  • I have added tests that prove my fix is effective or that my feature works
  • All tests pass locally with my changes
  • If relevant, I have added necessary documentation to https://github.com/rabbitmq/rabbitmq-website
  • If relevant, I have added this change to the first version(s) in release-notes that I expect to introduce it

Further Comments

If this is a relatively large or complex change, kick off the discussion by explaining why you chose the solution
you did and what alternatives you considered, etc.


This is an automatic backport of pull request #15252 done by [Mergify](https://mergify.com).

In a multi-node cluster after a rolling upgrade from below 4.2 to 4.2
supervisor `rabbit_federation_exchange_link_sup_sup` crashed because
`rabbit_federation_link_sup:start_link` had arity 1 until 4.1.x. PR
mirrored supervisor preserves the child definitions which still
include a call with arity 1 (without the link module).

To keep old child specs valid, add back a start_link/1 function in `rabbit_federation_link_sup`.

Fixes https://github.com/rabbitmq/rabbitmq-server/discussions/15239

(cherry picked from commit 0c71c1a)
Without the patch the test case rolling_upgrade:child_id_format fails with:
```
=== Location: [{erpc,call,1366},
              {exchange_SUITE,'-child_id_format/1-fun-5-',675},
              {lists,foreach_1,2310},
              {exchange_SUITE,child_id_format,670},
              {test_server,ts_tc,1794},
              {test_server,run_test_case_eval1,1303},
              {test_server,run_test_case_eval,1235}]
=== === Reason: {exception,
                     {noproc,
                         {gen_server,call,
                             [rabbit_federation_exchange_link_sup_sup,
                              which_children,infinity]}}}
```

(cherry picked from commit 4eca000)
@michaelklishin michaelklishin added this to the 4.2.3 milestone Jan 13, 2026
@michaelklishin michaelklishin merged commit 1809320 into v4.2.x Jan 13, 2026
573 of 575 checks passed
@michaelklishin michaelklishin deleted the mergify/bp/v4.2.x/pr-15252 branch January 13, 2026 15:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants