API delete endpoint intermittently runs into an exception in rabbit_exchange:serialise_events/1
#11952
-
Describe the bugI'm maintaining an automation tool that manages rabbitMQ resources. The tool sometimes needs to create or delete tens to multiple hundreds of resources at once. If the request count is bigger than 10 the API often returns an 400 error:
There's mostly two or more failing requests at once. Above logs show one case with two of them(note the timestamp). I suspect there's a race-condition deleting bindings or something, but I'm speculating. Either way, a more useful error than the one above would be really helpful. Reproduction stepsI'll provide a script for repro needed. Expected behaviorHuman readable error message. Additional contextServer version: 3.13.6 |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 4 replies
-
@rauno56 we cannot suggest anything with the amount of information. The stack trace at scenario that involves a binding deletion, so my best guess is that it is what we was up to a couple of years ago. I won't guess any further, we do not guess in this community. In 4.x with Khepri, binding deletion will be single operation in the data store instead of a transaction over a potentially large number of rows, which is the case with Mnesia. |
Beta Was this translation helpful? Give feedback.
-
Looks like an exchanged that has just been deleted concurrently is passed on to a function that notifies exchange decorators. It could simply be more defensive as I don't know what the error message would look like and how that would help anyone. |
Beta Was this translation helpful? Give feedback.
-
What kind of information could I provide more that would help to get to the bottom of this?
If that's the case(I cannot read Erlang's stack traces tbh), I speculate that it has to do with deleting two exchanges or a exchange and queue that are bound together in the same binding.
Any further context so I can confirm?
If the above speculations are right it should not even error. If they are not right, it should error with something that explains how to avoid it - because it is in fact possible to avoid it by adding a delay between delete requests. |
Beta Was this translation helpful? Give feedback.
-
A potentially highly relevant discussion where I explain what may be the root cause and what the possible options are (in |
Beta Was this translation helpful? Give feedback.
A potentially highly relevant discussion where I explain what may be the root cause and what the possible options are (in
4.0.x
) #13030.