rabbit_quorum_queue: Shrink batches of QQs in parallel by the-mikedavis · Pull Request #15081 · rabbitmq/rabbitmq-server

the-mikedavis · 2025-12-05T22:29:29Z

Shrinking a member node off of a QQ can be parallelized. The operation involves

removing the node from the QQ's cluster membership (appending a command to the log and committing it) with ra:remove_member/3
updating the metadata store to remove the member from the QQ type state with rabbit_amqqueue:update/2
deleting the queue data from the node with ra:force_delete_server/2 if the node can be reached

All of these operations are I/O bound. Updating the cluster membership and metadata store involves appending commands to those logs and replicating them. Writing commands to Ra synchronously in serial is fairly slow - sending many commands in parallel is much more efficient. By parallelizing these steps we can write larger chunks of commands to WAL(s).

ra:force_delete_server/2 benefits from parallelizing if the node being shrunk off is no longer reachable, for example in some hardware failures. The underlying rpc:call/4 will attempt to auto-connect to the node and this can take some time to time out. By parallelizing this, each rpc:call/4 reuses the same underlying distribution entry and all calls fail together once the connection fails to establish.

Discussed in #15057

the-mikedavis · 2025-12-05T22:31:39Z

With this change and the default 64 set here (just a sensible-seeming constant) I see my test in #15057 of shrinking from 1000 QQs go from taking ~2hrs to taking 1min52sec.

deps/rabbit/src/rabbit_quorum_queue.erl

kjnilsson · 2025-12-08T07:55:37Z

This looks fine to me, at least for now.

It would be quite possible to get much higher throughput on this and use command pipelining instead of spawning a bunch of processes just to exercise the WAL more. We'd need to add that as an option to the Ra API however.

the-mikedavis · 2025-12-08T15:34:47Z

Ah yeah, with pipelining we could use the WAL much more efficiently. That shouldn't be too bad to add to Ra - just a new function in ra that would use ra_server_proc:cast_command/3, right? Once mnesia is gone we could use Khepri async commands for the metadata store updates so both of those parts could be done with pipelining.

I'm actually more worried about the ra:force_delete_server/2 part since that step can take a while (7 seconds) if the connection to the node times out. An easy way around that would be adding a function in rabbit_quorum_queue to call ra:force_delete_server/2 on all queues after the membership and metadata store parts are done. Then it would just be one RPC call which could time out.

In the meantime making this parallel seems like an easy improvement since we can continue using the delete_member/2 helper. But in the long run we should definitely use pipelining instead 👍

michaelklishin · 2025-12-25T01:45:52Z

@kjnilsson do you have any more feedback on the updated version?

kjnilsson · 2026-01-13T07:32:40Z

deps/rabbit/src/rabbit_quorum_queue.erl

+                                       amqqueue:get_type(Q) == ?MODULE,
+                                       lists:member(Node, get_nodes(Q))]),
+    Parent = self(),
+    lists:flatten([begin


could you use ra_lib:partition_parallel/2|3 here?

Not directly: shrink/2 returns the current or updated size of the cluster and that's used in the output of the rabbitmq-queues shrink command. With ra_lib:partition_parallel/2 we need to return a boolean so we can't add the size info. Implementation-wise though, this looks nearly the same.

Shrinking a member node off of a QQ can be parallelized. The operation involves * removing the node from the QQ's cluster membership (appending a command to the log and committing it) with `ra:remove_member/3` * updating the metadata store to remove the member from the QQ type state with `rabbit_amqqueue:update/2` * deleting the queue data from the node with `ra:force_delete_server/2` if the node can be reached All of these operations are I/O bound. Updating the cluster membership and metadata store involves appending commands to those logs and replicating them. Writing commands to Ra synchronously in serial is fairly slow - sending many commands in parallel is much more efficient. By parallelizing these steps we can write larger chunks of commands to WAL(s). `ra:force_delete_server/2` benefits from parallelizing if the node being shrunk off is no longer reachable, for example in some hardware failures. The underlying `rpc:call/4` will attempt to auto-connect to the node and this can take some time to time out. By parallelizing this, each `rpc:call/4` reuses the same underlying distribution entry and all calls fail together once the connection fails to establish.

michaelklishin reviewed Dec 5, 2025

View reviewed changes

deps/rabbit/src/rabbit_quorum_queue.erl Outdated Show resolved Hide resolved

the-mikedavis force-pushed the md/parallel-shrink branch 2 times, most recently from f14957d to a14595d Compare December 8, 2025 15:46

the-mikedavis mentioned this pull request Dec 8, 2025

Add functions to pipeline membership changes rabbitmq/ra#566

Open

the-mikedavis marked this pull request as ready for review December 8, 2025 18:35

the-mikedavis force-pushed the md/parallel-shrink branch from a14595d to ea57c83 Compare December 27, 2025 21:23

kjnilsson reviewed Jan 13, 2026

View reviewed changes

the-mikedavis force-pushed the md/parallel-shrink branch from ea57c83 to b74999d Compare February 24, 2026 15:32

lukebakken force-pushed the md/parallel-shrink branch from b74999d to 511692a Compare February 24, 2026 22:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rabbit_quorum_queue: Shrink batches of QQs in parallel#15081

rabbit_quorum_queue: Shrink batches of QQs in parallel#15081
the-mikedavis wants to merge 1 commit intomainfrom
md/parallel-shrink

the-mikedavis commented Dec 5, 2025 •

edited

Loading

Uh oh!

the-mikedavis commented Dec 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

kjnilsson commented Dec 8, 2025

Uh oh!

the-mikedavis commented Dec 8, 2025

Uh oh!

michaelklishin commented Dec 25, 2025

Uh oh!

kjnilsson Jan 13, 2026

Uh oh!

the-mikedavis Jan 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

the-mikedavis commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

the-mikedavis commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

kjnilsson commented Dec 8, 2025

Uh oh!

the-mikedavis commented Dec 8, 2025

Uh oh!

michaelklishin commented Dec 25, 2025

Uh oh!

kjnilsson Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

the-mikedavis Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

the-mikedavis commented Dec 5, 2025 •

edited

Loading

the-mikedavis commented Dec 5, 2025 •

edited

Loading

the-mikedavis Jan 13, 2026 •

edited

Loading