Following investigation via #99023 we identified the following vectorized implementations whose precision deviates from the tolerances specified in their scalar equivalents:
| Routine |
Double $\epsilon$
|
Single $\epsilon$
|
| Scalar Baseline |
8.8817841970012523e-16 |
4.76837158e-07f |
Cbrt |
1e-13 |
N/A |
CosPi |
Ν/Α |
1e-5 |
Dot |
1e-14 |
1e-5 |
Exp10 |
1e-13 |
1e-5 |
Exp2 |
1e-14 |
1e-5 |
Exp2M1 |
1e-14 |
1e-5 |
Exp10M1 |
1e-13 |
1e-5 |
Pow (scalar/span) |
N/A |
1e-5 |
Pow (span/span) |
1e-13 |
1e-5 |
Pow (span/scalar) |
1e-13 |
1e-5 |
RootN |
1e-13 |
N/A |
SinPi |
1e-13 |
1e-4 |
Sum |
1e-14 |
1e-5 |
SumOfMagnitudes |
1e-12 |
1e-6 |
SumOfSquares |
1e-12 |
1e-6 |