ARROW-12074: [C++][Compute] Add scalar arithmetic kernels for decimal #10364

cyb70289 · 2021-05-20T09:30:40Z

Add basic binary arithmetic (+,-,*,/) kernels for decimal types.

github-actions · 2021-05-20T09:31:00Z

https://issues.apache.org/jira/browse/ARROW-12074

bkietz

Thanks for working on this!

Please extract the decimal upscaling from the addition kernel into an implicit cast. This will simplify the addition kernel to stateless addition (IIUC) and give callers control over how to handle upscaling. See CommonNumeric() for existing promotion behavior; this should give you an idea of how to specify the upscaling as an implicit cast.

cpp/src/arrow/compute/kernels/test_util.h

cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc

cpp/src/arrow/compute/kernels/scalar_arithmetic.cc

cyb70289 · 2021-06-01T05:03:12Z

Please extract the decimal upscaling from the addition kernel into an implicit cast. This will simplify the addition kernel to stateless addition (IIUC) and give callers control over how to handle upscaling.

Thanks @bkietz , casting args (rescaling decimal inputs) implicitly before exec looks much better than my current implementation. Will try the approach.

cyb70289 · 2021-06-02T04:22:36Z

@bkietz , met with one problem, would like to hear your comments. Thanks.

Decimal upscaling is operation dependent. E.g., +,- will upscale arg with smaller scale to align digit, * needn't scaling, / is more complicated.

Implicit args casting happens before kernel is created. DispatchBest only knows arg types, no operation type (kernel dependent) is available. So we cannot figure out the "to be casted" arg type (new precision, scale).
https://github.com/apache/arrow/blob/master/cpp/src/arrow/compute/function.cc#L175

Maybe add another callback kernel->explicit_cast() and call it after or inside DispatchBest? Or create different ScalarFunction struct (and DispatchBest) for each decimal operation?

bkietz · 2021-06-02T19:35:11Z

DispatchBest is aware of the operation; it has access to the function's name. You could write a CommonDecimal() function which returns differing scales/precisions for "add"/"subtract", "divide", and "multiply":

if (auto type = CommonNumeric(*values)) {
  ReplaceTypes(type, values);
} else if (auto type = CommonDecimal(name_, *values)) {
  ReplaceTypes(type, values);
}

cyb70289 · 2021-06-04T05:15:32Z

Thanks @bkietz , it's almost done except one last catch.

As the output type (precision, scale) is dependent on the inputs, I have a resolver object to calculate output type. The resolver is called with the casted input type, not the original type. It causes problem to division, as the output precision and scale should be calculated from original inputs. No trouble for add/subtract as the output precision/scale is the same for original and casted inputs (digit aligned). Multiply doesn't need cast.

Does it make sense to pass both original input types and the casted types to the resolver [1][2]? We will have to update all existing custom resolver codes.
Maybe add a kernel flag to select passing original or casted input types to the resolver?
Or there are better ways to handle this?

[1] https://github.com/apache/arrow/blob/master/cpp/src/arrow/compute/function.cc#L196
[2] https://github.com/apache/arrow/blob/master/cpp/src/arrow/compute/exec.cc#L495

EDIT: Pushed latest code to ease reviewing. Unit test fails due to this issue.

docs/source/cpp/compute.rst

cpp/src/arrow/compute/kernels/codegen_internal.h

cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc

bkietz · 2021-06-04T13:33:31Z

cpp/src/arrow/compute/kernels/scalar_arithmetic.cc

Please open a follow up JIRA to make this public so it can be reused (for example in the comparison functions)

bkietz · 2021-06-04T13:36:50Z

cpp/src/arrow/compute/kernels/scalar_arithmetic.cc

I think this could be the default case; it's what would be used for any comparison kernel, for example

Additionally, I think this could be defined for the varargs case (so it could be used in elementwise_min/elementwise_max/...)

Not done yet. Will follow up.

cpp/src/arrow/compute/kernels/scalar_arithmetic.cc

cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc

cpp/src/arrow/compute/kernels/scalar_arithmetic.cc

cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc

cyb70289 · 2021-06-15T13:42:28Z

Rebased

bkietz

This is looking great, thanks again! Just a few more items

cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc

bkietz · 2021-06-15T15:20:22Z

cpp/src/arrow/compute/kernels/scalar_arithmetic.cc

Please ensure we're not adding decimal kernels for functions which don't support decimals yet:

Suggested change

AddDecimalBinaryKernels<Op>(name, &func);

// TODO($FOLLOW_UP_JIRA) remove when decimal is supported for all arithmetic kernels

if (name == "add" || name == "add_checked" || ...) {

AddDecimalBinaryKernels<Op>(name, &func);

}

Please open a follow up JIRA for supporting decimals in the other arithmetic functions as well

I moved AddDecimalBinaryKernels calls out of MakeArithmeticFunction and MakeArithmeticFunctionNotNull, and put it under each kernel supporting decimal operations.
https://github.com/apache/arrow/pull/10364/files#diff-3eafd7246f6a8c699f10d46e3276852fe44b6853b5517ef10396e561730c09f4R840

I'm afraid some arithmetic kernels (e.g., power) may not support decimals?

Thanks, that will work

I'm afraid some arithmetic kernels (e.g., power) may not support decimals?

This is one topic which the follow up JIRA would give us a better place to discuss

cpp/src/arrow/compute/kernels/scalar_arithmetic.cc

bkietz

Will merge on green CI.

Thanks for doing this!

Add basic binary arithmetic (+,-,*,/) kernels for decimal types. Closes apache#10364 from cyb70289/decimal-arith Authored-by: Yibo Cai <yibo.cai@arm.com> Signed-off-by: Benjamin Kietzman <bengilgit@gmail.com>

github-actions bot added the Component: C++ label May 20, 2021

cyb70289 marked this pull request as draft May 20, 2021 09:31

cyb70289 marked this pull request as ready for review May 27, 2021 10:29

bkietz self-requested a review May 27, 2021 10:51

bkietz requested changes May 28, 2021

View reviewed changes

cpp/src/arrow/compute/kernels/test_util.h Outdated Show resolved Hide resolved

cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc Outdated Show resolved Hide resolved

cpp/src/arrow/compute/kernels/scalar_arithmetic.cc Outdated Show resolved Hide resolved

bkietz requested changes Jun 4, 2021

View reviewed changes

cyb70289 commented Jun 10, 2021

View reviewed changes

cpp/src/arrow/compute/kernels/scalar_arithmetic.cc Outdated Show resolved Hide resolved

cyb70289 commented Jun 10, 2021

View reviewed changes

cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc Outdated Show resolved Hide resolved

cyb70289 commented Jun 10, 2021

View reviewed changes

cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc Outdated Show resolved Hide resolved

cyb70289 requested a review from bkietz June 11, 2021 01:46

bkietz requested changes Jun 15, 2021

View reviewed changes

cyb70289 added 4 commits June 18, 2021 05:54

ARROW-12074: [C++][Compute] Add scalar arithmetic kernels for decimal

76cfec0

address review comments

ae3ef78

address review comments #2

9fd0cca

add only supported decimal kernels

dd38397

bkietz approved these changes Jun 18, 2021

View reviewed changes

bkietz closed this in 4743e18 Jun 18, 2021

cyb70289 deleted the decimal-arith branch June 19, 2021 00:51

asfimport mentioned this pull request Aug 4, 2021

[C++][Compute] Add scalar arithmetic kernels for decimal inputs #27899

Closed

-  AddDecimalBinaryKernels<Op>(name, &func);
+  // TODO($FOLLOW_UP_JIRA) remove when decimal is supported for all arithmetic kernels
+  if (name == "add" || name == "add_checked" || ...) {
+    AddDecimalBinaryKernels<Op>(name, &func);
+  }

ARROW-12074: [C++][Compute] Add scalar arithmetic kernels for decimal #10364

ARROW-12074: [C++][Compute] Add scalar arithmetic kernels for decimal #10364

Uh oh!

Conversation

cyb70289 commented May 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented May 20, 2021

Uh oh!

bkietz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cyb70289 commented Jun 1, 2021

Uh oh!

cyb70289 commented Jun 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bkietz commented Jun 2, 2021

Uh oh!

cyb70289 commented Jun 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cyb70289 commented Jun 15, 2021

Uh oh!

bkietz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

bkietz left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cyb70289 commented May 20, 2021 •

edited

Loading

cyb70289 commented Jun 2, 2021 •

edited

Loading

cyb70289 commented Jun 4, 2021 •

edited

Loading