HttpEventHandler.body should avoid being resized #5444

franz1981 · 2025-01-13T19:45:37Z

Handling chunks requires appending Buffer(s) in the body stored at HttpEventHandler.
See

vert.x/vertx-core/src/main/java/io/vertx/core/http/impl/HttpEventHandler.java

Line 54 in d5f613c

body.appendBuffer(chunk);

This can both create excessive heap footprint and performing useless copies.
It would be ideal if we could retain the chunks accumulating them in some list or queue and eventually allocating a single buffer to copy or transferring them to a composite buffer (without copying anything).
I have used a similar strategy in https://github.com/quarkusio/quarkus/blob/b870c90569a4c88467313587565205818034f4df/independent-projects/vertx-utils/src/main/java/io/quarkus/vertx/utils/AppendBuffer.java#L19

geoand · 2025-01-13T19:51:15Z

This would be very useful in Quarkus when a large body is returned as part of the REST Client call

franz1981 · 2025-01-13T19:56:55Z

In addition, without this we could probably suffer from GC nepotism (see jbossas/jboss-threads#74) since the body, while being resized, can fall into young gen, while the http event handler, since it collect all chunks, can become old gen, still referencing it, and making it to become old as well, even if not required.
Nulling out the body field once not needed, should help.

gsmet · 2025-01-14T10:33:28Z

To give an idea of the problem: to get a 365 MB file in chunks with the REST Client in Quarkus, we end up allocating 16 GB of byte[] through Buffer operations.

vietj · 2025-01-14T11:04:22Z

this method should only be used for small payloads, that does not mean that we cannot optimize the current one though, yet I am not advocating to use this for large payload, the javadoc says Don't use this if your request body is large - you could potentially run out of RAM.

geoand · 2025-01-14T11:08:44Z

this method should only be used for small payloads

This is of course true, but:

A large payload like the one @gsmet mentioned while clearly is a pathological case, it should however not result in an almost 2 orders of magnitude increase in the necessary memory size. If I am trying to grab a 300MB payload and I have say 600MB of heap, shouldn't I be able to get it?
In the REST Client (from where the issue was found by @gsmet) , it's often impossible to know how large the retrieved payload will be. Users should use one of the other ways we provide, but we can't know how large the payload is actually going to be - this even differ during the lifetime of the application.

gsmet · 2025-01-14T11:18:44Z

Especially when dealing with chunks as you have no idea of the size of the response.

vietj · 2025-01-14T12:35:00Z

I did say "that does not mean that we cannot optimize the current one though" so of course we can improve the impl and we will :-), I was just giving my opinion.

when dealing with unknown size, I believe it should be conservative.

geoand · 2025-01-14T12:35:51Z

🙏🏽

gsmet · 2025-01-14T12:41:01Z

Don't use this if your request body is large - you could potentially run out of RAM

To clarify, @vietj, I'm perfectly aware that loading 365 MB in RAM is a bad idea (and it was even worse as we were copying the buffer so it was more like 750 MB of RAM).
But my main concern here is the memory allocation storm we end up having.

I will do some tests with various sizes to give you an idea of how it goes with 1 MB/2 MB/10 MB/100 MB so that we have a baseline. I have a Byteman script that can handle this.

gsmet · 2025-01-14T15:33:29Z

Here is what I came up with.
The amplification factor is allocations / size of payload.

It's relatively stable until 20 MB and then gets a lot worse. Still we allocate 4 to 5 times the size of the payload. Note that this might not only be due to the Vert.x side.
After a certain size, it evolves linearly from what I can see and not for the better: for a 400 MB file, we allocate more than 20 GB of memory.

franz1981 · 2025-01-14T15:51:39Z

Yep the enlargement of the buffer happens based on what Netty decide - which is a linear for a bit than move to power of 2 increases

EmadAlblueshi · 2025-01-15T07:01:05Z

Hi @franz1981 out of curiosity is this considered similar to your issue? https://github.com/eclipse-vertx/vert.x/blob/master/vertx-core/src/main/java/io/vertx/core/parsetools/impl/RecordParserImpl.java#L292

franz1981 · 2025-01-18T20:48:25Z

Yep @EmadAlblueshi it is similar; appending can be dangerous unless it happens very few times.

franz1981 added the enhancement label Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HttpEventHandler.body should avoid being resized #5444

HttpEventHandler.body should avoid being resized #5444

franz1981 commented Jan 13, 2025 •

edited

Loading

geoand commented Jan 13, 2025

franz1981 commented Jan 13, 2025 •

edited

Loading

gsmet commented Jan 14, 2025

vietj commented Jan 14, 2025

geoand commented Jan 14, 2025 •

edited

Loading

gsmet commented Jan 14, 2025

vietj commented Jan 14, 2025

geoand commented Jan 14, 2025

gsmet commented Jan 14, 2025

gsmet commented Jan 14, 2025 •

edited

Loading

franz1981 commented Jan 14, 2025

EmadAlblueshi commented Jan 15, 2025

franz1981 commented Jan 18, 2025

HttpEventHandler.body should avoid being resized #5444

HttpEventHandler.body should avoid being resized #5444

Comments

franz1981 commented Jan 13, 2025 • edited Loading

geoand commented Jan 13, 2025

franz1981 commented Jan 13, 2025 • edited Loading

gsmet commented Jan 14, 2025

vietj commented Jan 14, 2025

geoand commented Jan 14, 2025 • edited Loading

gsmet commented Jan 14, 2025

vietj commented Jan 14, 2025

geoand commented Jan 14, 2025

gsmet commented Jan 14, 2025

gsmet commented Jan 14, 2025 • edited Loading

franz1981 commented Jan 14, 2025

EmadAlblueshi commented Jan 15, 2025

franz1981 commented Jan 18, 2025

franz1981 commented Jan 13, 2025 •

edited

Loading

franz1981 commented Jan 13, 2025 •

edited

Loading

geoand commented Jan 14, 2025 •

edited

Loading

gsmet commented Jan 14, 2025 •

edited

Loading