Implement `mapreduce_impl` for `IndexCartesian`. by N5N3 · Pull Request #45651 · JuliaLang/julia

N5N3 · 2022-06-12T03:05:42Z

On master, mapreduce calls mapfoldl for inputs with IndexCartesian style, which brings performance pain if the 1st dimension is long enough for vectorlization.

This PR implements pairwise mapreduce_impl for better performance by cutting the highest splitable dimension in half.
(As we can't reorder the inputs, and CartesianPartition has much higher overhead)

Test added.

A simple benchmark

julia> a = randn(100, 100);

julia> slow(a) = view(a, axes(a)...)
slow (generic function with 1 method)

julia> @btime reduce(+, slow($a))
  1.870 μs (0 allocations: 0 bytes)
88.67608098492965

julia> @btime foldl(+, slow($a))
  9.700 μs (0 allocations: 0 bytes)
88.67608098492977

N5N3 added 2 commits June 12, 2022 10:51

Implement mapreduce_impl for IndexCartesian

11e1c52

invalid test fix.

7003234

N5N3 added the performance Must go faster label Jun 12, 2022

adienes added the fold sum, maximum, reduce, foldl, etc. label May 5, 2025

mbauman mentioned this pull request May 14, 2025

WIP: The great pairwise reduction refactor #58418

Draft

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement `mapreduce_impl` for `IndexCartesian`.#45651

Implement `mapreduce_impl` for `IndexCartesian`.#45651
N5N3 wants to merge 2 commits intoJuliaLang:masterfrom
N5N3:CartReduce

N5N3 commented Jun 12, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

N5N3 commented Jun 12, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants