Fix integer overflows when decoding large payloads #192

teh-cmc · 2025-10-14T13:16:18Z

This fixes an issue on 64bit platforms where encoding large payloads works as expected but decoding them subsequently fails.
The root cause is that write_integer expects host-sized integers (i.e. usize) while read_integer on the other hand always assumes 32bit integers.

This PR:

Adds a test demonstrating the issue (revert the fix to see it fail).
Fixes the issue by expecting usize everywhere. We could also go all the way and expect u64 everywhere instead, but I think keeping things host-sized by default is fine (yes, that means a 32bit host might still fail when decoding large enough data from a 64bit client).
Removes the unsafe implementation of write_integer. It is incompatible with values >32bits, and I don't think it is worth its weight in complexity in any case given how optimizer-friendly the safe version already is. Happy to revert that part if needed.

PSeitz · 2025-10-26T15:20:29Z

What's your use case? The block format is not a good match for such large payloads, since it requires everything to be in memory (compressed and decompressed data). The frame format is a better choice for large payloads.

teh-cmc · 2025-10-27T09:41:28Z

What's your use case? The block format is not a good match for such large payloads, since it requires everything to be in memory (compressed and decompressed data). The frame format is a better choice for large payloads.

We use LZ4 blocks as a low-level primitive within a larger protocol that already manages its own framing. Under normal operation, these blocks are around ~1 MiB in size, but in certain rare edge cases outside our control, they can grow significantly larger. We’re fine with the extra memory cost in those situations, as long as the rest of the system remains stable.

PSeitz · 2025-10-27T12:47:07Z

Thanks for the PR!

About the unsafe implementation of write_integer: The main difference currently seems to be for uncompressible data, where we get a large difference (~30%). But uncompressible data compression should already be fast enough, with its own fastpath.

emilk · 2025-10-28T08:06:06Z

Thanks for merging this 🙏
Is there a patch release planned?

PSeitz · 2025-11-11T10:56:09Z

Released with 0.12.0

teh-cmc · 2025-11-12T08:07:40Z

Thank you!

This is a follow up to #11525. It fixes the remaining half of #11516. We don't want to merge it as is yet. Let's see if we can get the fix merged upstream first, so we don't have to depend on yet another fork: * PSeitz/lz4_flex#192 --- * Fixes #11516 --- Checks: * [x] `rerun` with `--nb_scans 3000` now works * [x] `rerun rrd stats` with `--nb_scans 3000` now works

teh-cmc added 4 commits October 14, 2025 12:50

fix illegal doc comment

95607e8

remove unsafe write_integer which AFAICT is not used _and_ broken

bbfc570

implement test demonstrating the issue

51d11d1

fix the issue

764b423

teh-cmc mentioned this pull request Oct 14, 2025

LZ4 0.12 (fix decoding of unsually large chunks) rerun-io/rerun#11533

Merged

2 tasks

This was referenced Oct 15, 2025

Rerun PRs rerun-io/opensource#2

Open

RRD decoder panics on allocation: capacity overflow rerun-io/rerun#11516

Closed

PSeitz merged commit c1483c4 into PSeitz:main Oct 27, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix integer overflows when decoding large payloads #192

Fix integer overflows when decoding large payloads #192

Uh oh!

teh-cmc commented Oct 14, 2025 •

edited

Loading

Uh oh!

PSeitz commented Oct 26, 2025

Uh oh!

teh-cmc commented Oct 27, 2025

Uh oh!

Uh oh!

PSeitz commented Oct 27, 2025

Uh oh!

emilk commented Oct 28, 2025

Uh oh!

PSeitz commented Nov 11, 2025

Uh oh!

teh-cmc commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix integer overflows when decoding large payloads #192

Fix integer overflows when decoding large payloads #192

Uh oh!

Conversation

teh-cmc commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PSeitz commented Oct 26, 2025

Uh oh!

teh-cmc commented Oct 27, 2025

Uh oh!

Uh oh!

PSeitz commented Oct 27, 2025

Uh oh!

emilk commented Oct 28, 2025

Uh oh!

PSeitz commented Nov 11, 2025

Uh oh!

teh-cmc commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

teh-cmc commented Oct 14, 2025 •

edited

Loading