Fix #563: filesizeformat with proper accuracy by seijikun · Pull Request #568 · askama-rs/askama

seijikun · 2025-08-12T13:31:24Z

I refactored the current humansize / filesize implementation from f32 to u128 and added more unit-tests.
Additionally, the new refactoring now allows changing the requested precision (not yet plumbed on the code generation side).

u128 is required to handle the SI Prefixes:

    ((AsciiChar::new(b'Z'), 1e-19), 1e24),
    ((AsciiChar::new(b'Y'), 1e-22), 1e27),
    ((AsciiChar::new(b'R'), 1e-25), 1e30),
    ((AsciiChar::new(b'Q'), 1e-28), 1e33),

Getting something as performant as the original version is quite challenging. Here are my benchmark results for the current solution:

Benchmarks (time for 250000000 format & display invocations):

Original (f32): 1.9542559s
This MR, but with u64: 2.638588188s
This MR (u128): 3.29419516s

Kijewski · 2025-08-12T13:47:19Z

Working on u128 would probably be faster than working on f32. I guess formatting directly into the output stream is what makes the new code slower.

GuillaumeGomez · 2025-08-12T13:50:58Z

Working on u128 would probably be faster than working on f32. I guess formatting directly into the output stream is what makes the new code slower.

Nope, if floats and integers are of the same size, the floats are always slower, however floats are always faster than bigger integers (so u64 > f32 > u32).

Also, what kind of numbers would we need above 18 exabytes (u64::MAX)? O.o (I think u64 is more than enough)

seijikun · 2025-08-12T13:56:21Z

I knew binary search was probably slower than linear search in this case, but I didn't think it was noticeable...
But holy cow...

Now I'm at 1.832858975s with u128 - faster than the current implementation :)

seijikun · 2025-08-12T13:58:37Z

Also, what kind of numbers would we need above 18 exabytes (u64::MAX)? O.o (I think u64 is more than enough)

I tried to port it and couldn't represent the previous functionality, because the set of SI_PREFIXES in the current implementation overflowed the u64, so it was either ... delete them or u128 🤷

seijikun · 2025-08-12T14:47:53Z

Just some final formatting improvements.
Could one of you plumb the precision with an optional parameter, please?
I wasn't able to get that to work.

GuillaumeGomez · 2025-08-12T14:50:59Z

Might be better if you did so you can learn: what did you try?

There are a few things needed to make it work: the filesizeformat function needs to take all args and the optional ones need to use Option. Then in askama_derive, depending on the arguments, you call the function with Some or None for the optional args.

seijikun · 2025-08-12T15:36:56Z

Might be better if you did so you can learn: what did you try?

There are a few things needed to make it work: the filesizeformat function needs to take all args and the optional ones need to use Option. Then in askama_derive, depending on the arguments, you call the function with Some or None for the optional args.

I had trouble understanding how the new argument parsing stuff works (how to fill the default_value field). But I think I got it to work now

seijikun · 2025-08-12T20:47:32Z

One more improvement. Instead of just stupidly limiting the precision to the completely arbitrary number of 15 due to numerical problems, I sat down and did some thinking:

Allowing a larger precision than the chosen si-prefix's exponent doesn't make any sense, because except for the super-edgecase of more than 1000 quetta bytes, the additional digits will all just be 0s, which are stripped anyway.
This way, I can guarantee that this is always true: scale <= unit.1.
Furthermore, both numbers are 10-based, so this is also always true: (unit.1 % scale) == 0.
This allowed me to change the fractional calculation from:

let mut fractional = remainder.saturating_mul(scale) / unit.1;

to:

let mut fractional = remainder / (unit.1 / scale);

which effectively avoids overflow due to the multiplication, while allowing much higher precision.
(The affect on runtime performance is negligible - to my own surprise)

GuillaumeGomez · 2025-08-12T21:12:22Z

+            ' '.write_into(dest, values)?;
+            unit.0.write_into(dest, values)?;
+            'B'.write_into(dest, values)?;


Suggested change

' '.write_into(dest, values)?;

unit.0.write_into(dest, values)?;

'B'.write_into(dest, values)?;

write!(dest, " {}B", unit..0)?;

I can do this change, but I would strongly advise against it.
This will outright destroy performance.
(I came from there 😅)

Wait really? I'm surprised, generally the I/O calls are slow so we try to do as little of them as possible.

The std calls are SUPER slow. That's why I use askama's FastWritable::write_into() everywhere.
I can do 100s of calls of FastWritable::write_into() and will still be faster than one write!().
Inlining sure can work wonders :)

GuillaumeGomez · 2025-08-13T10:00:31Z

Looks all good to me, great work!

Kijewski

Thank you!

seijikun · 2025-08-13T12:41:26Z

Thank you two!

seijikun force-pushed the mr-filter-humansize branch 2 times, most recently from f989c7c to 92d705c Compare August 12, 2025 13:37

Kijewski reviewed Aug 12, 2025

View reviewed changes

Comment thread askama/src/filters/humansize.rs Outdated

Kijewski reviewed Aug 12, 2025

View reviewed changes

Comment thread askama/src/filters/humansize.rs Outdated

Kijewski reviewed Aug 12, 2025

View reviewed changes

Comment thread askama/src/filters/humansize.rs Outdated

seijikun force-pushed the mr-filter-humansize branch from 92d705c to 17ef961 Compare August 12, 2025 13:54

GuillaumeGomez reviewed Aug 12, 2025

View reviewed changes

Comment thread askama/src/filters/humansize.rs Outdated

seijikun force-pushed the mr-filter-humansize branch 2 times, most recently from 3a563a9 to 7b1e266 Compare August 12, 2025 14:46

seijikun force-pushed the mr-filter-humansize branch from 7b1e266 to 39278c9 Compare August 12, 2025 14:48

seijikun force-pushed the mr-filter-humansize branch 2 times, most recently from dcedaf9 to 74474ff Compare August 12, 2025 15:35

github-advanced-security AI found potential problems Aug 12, 2025

View reviewed changes

Comment thread fuzzing/fuzz/src/filters.rs Fixed

seijikun force-pushed the mr-filter-humansize branch from 74474ff to c8740f1 Compare August 12, 2025 20:04

Kijewski reviewed Aug 12, 2025

View reviewed changes

Comment thread askama/src/filters/humansize.rs Outdated

seijikun force-pushed the mr-filter-humansize branch 2 times, most recently from 3aed30a to 1706904 Compare August 12, 2025 20:38

GuillaumeGomez reviewed Aug 12, 2025

View reviewed changes

Comment thread askama/src/filters/humansize.rs Outdated

GuillaumeGomez reviewed Aug 12, 2025

View reviewed changes

Comment thread askama/src/filters/humansize.rs Outdated

GuillaumeGomez reviewed Aug 12, 2025

View reviewed changes

Comment thread askama/src/filters/humansize.rs Outdated

Kijewski reviewed Aug 12, 2025

View reviewed changes

Comment thread askama/src/filters/humansize.rs Outdated

GuillaumeGomez reviewed Aug 12, 2025

View reviewed changes

Comment thread askama/src/filters/humansize.rs Outdated

seijikun force-pushed the mr-filter-humansize branch from 1706904 to 968a1d4 Compare August 12, 2025 21:47

Fix askama-rs#563: filesizeformat with proper accuracy

767ae9e

seijikun force-pushed the mr-filter-humansize branch from 968a1d4 to 767ae9e Compare August 13, 2025 09:57

GuillaumeGomez approved these changes Aug 13, 2025

View reviewed changes

Kijewski approved these changes Aug 13, 2025

View reviewed changes

Kijewski merged commit 59a9820 into askama-rs:master Aug 13, 2025
42 checks passed

Kijewski mentioned this pull request Aug 13, 2025

v0.15 release planning #477

Closed

Conversation

seijikun commented Aug 12, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Kijewski commented Aug 12, 2025

Uh oh!

GuillaumeGomez commented Aug 12, 2025

Uh oh!

seijikun commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seijikun commented Aug 12, 2025

Uh oh!

Uh oh!

seijikun commented Aug 12, 2025

Uh oh!

GuillaumeGomez commented Aug 12, 2025

Uh oh!

seijikun commented Aug 12, 2025

Uh oh!

Uh oh!

Uh oh!

seijikun commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

GuillaumeGomez Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

seijikun Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

GuillaumeGomez Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

seijikun Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

GuillaumeGomez commented Aug 13, 2025

Uh oh!

Kijewski left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

seijikun commented Aug 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

seijikun commented Aug 12, 2025 •

edited

Loading

seijikun commented Aug 12, 2025 •

edited

Loading

seijikun Aug 12, 2025 •

edited

Loading