Ban use of Math.fma across the entire codebase by rmuir · Pull Request #12014 · apache/lucene

rmuir · 2022-12-13T01:17:03Z

When FMA is not supported by the hardware, these methods fall back to BigDecimal usage [1] which causes them to be 2500x slower [2].

While most hardware in the last 10 years may have the support, out of box both VirtualBox and QEMU don't pass thru FMA support (for the latter at least you can tweak it with e.g. -cpu host or similar to fix this).

This creates a terrible undocumented performance trap, see [3] for an example of a 30x slowdown of an entire application. In my experience, developers are often too far detached from the production reality, and that reality is: we're not deploying to macbook pros in production, instead we are almost all using virtualization: we can't afford such performance traps.

Practically it would be an issue too: e.g. Policeman jenkins instance that runs our tests currently uses virtualbox. It would be bad for vector-search tests to suddenly get 30x slower.

We can't safely use this method anywhere, as we don't have access to check CPUID or anything to see if it will be insanely slow or not. Let's ban it completely: I'm concerned it will sneak into our codebase otherwise... it almost happened before: #10718

[1] Math.java source code
[2] Comment on JIRA issue for x86 intrinsic mentioning 2500x speedup
[3] VirtualBox bug for lack of FMA support

When FMA is not supported by the hardware, these methods fall back to BigDecimal usage which causes them to be 2500x slower. While most hardware in the last 10 years may have the support, out of box both VirtualBox and QEMU don't pass thru FMA support (for the latter at least you can tweak it with e.g. -cpu host or similar to fix this). This creates a terrible undocumented performance trap. Prevent it from sneaking into our codebase.

gsmiller · 2022-12-13T01:28:51Z

+1, seems reasonable to me. We can always remove this ban in the future if there's a good reason, but seems reasonable to put this in place to prevent it sneaking in for now.

rmuir · 2022-12-13T01:32:19Z

Yeah, I think if the fallback java code was 2x, 4x, or 8x slower (like you would expect from these intrinsics), we wouldn't be having this conversation :)

benwtrent · 2022-12-14T14:54:16Z

Holy crap, creating BigDecimal and then multiplying & adding is crazy. This is a completely unacceptable fallback calculation for this method.

+1 on banning its use in the code base.

dweiss · 2022-12-14T15:42:10Z

I honestly don't know who can use this method without any provided cpuid check... We actually use fma in our code but do so by detecting the performance difference between a naive implementation on primitive types and Math.fma (during bootstrap). It's ugly like hell but the difference is so vast that it works. I'm not sure who'd ever gain from using the bigdecimal-based implementation...

rmuir · 2022-12-14T15:55:18Z

I looked at what e.g. glibc does here as a fallback out of curiousity, for floats it is very simple (using Dekker algorithm), but requires changing the FP rounding mode, which you cant do in java. For doubles it is more complicated but still no bigdecimal.

When FMA is not supported by the hardware, these methods fall back to BigDecimal usage which causes them to be 2500x slower. While most hardware in the last 10 years may have the support, out of box both VirtualBox and QEMU don't pass thru FMA support (for the latter at least you can tweak it with e.g. -cpu host or similar to fix this). This creates a terrible undocumented performance trap. Prevent it from sneaking into our codebase.

This comment was marked as duplicate.

Sign in to view

uschindler approved these changes Dec 14, 2022

View reviewed changes

rmuir merged commit 3ac71ad into apache:main Dec 17, 2022

rmuir mentioned this pull request May 18, 2023

Integrate the Incubating Panama Vector API #12311

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ban use of Math.fma across the entire codebase#12014

Ban use of Math.fma across the entire codebase#12014
rmuir merged 1 commit intoapache:mainfrom
rmuir:ban_fma

rmuir commented Dec 13, 2022

Uh oh!

gsmiller commented Dec 13, 2022

Uh oh!

rmuir commented Dec 13, 2022

Uh oh!

benwtrent commented Dec 14, 2022

Uh oh!

dweiss commented Dec 14, 2022 •

edited

Loading

Uh oh!

rmuir commented Dec 14, 2022

Uh oh!

This comment was marked as duplicate.

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

rmuir commented Dec 13, 2022

Uh oh!

gsmiller commented Dec 13, 2022

Uh oh!

rmuir commented Dec 13, 2022

Uh oh!

benwtrent commented Dec 14, 2022

Uh oh!

dweiss commented Dec 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rmuir commented Dec 14, 2022

Uh oh!

This comment was marked as duplicate.

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dweiss commented Dec 14, 2022 •

edited

Loading