fix map! undefined value exposure by adienes · Pull Request #56673 · JuliaLang/julia

adienes · 2024-11-24T16:38:10Z

hopefully more targeted / less disruptive than #50647 , would (at least partially) fix #36235

may need extra documentation explaining that with this implementation, @inbounds map!(dest, As) will always iterate through the indices of As[1] regardless of the other sizes

tbh, the existing implementation is pretty fundamentally incorrect and does not match the behavior explicitly described in the docstring, so I think that this PR (or #50647 , or something else similar) should merge regardless of benchmarks impact. however, I would appreciate if someone could trigger a benchmark run just so we can see the impact :)

adienes · 2024-11-24T17:15:00Z

pretty sure build failures are unrelated

LilithHafner · 2024-11-27T00:12:59Z

This looks like a simple typo:

julia/base/abstractarray.jl

Line 3466 in a17db2b

    
           @boundscheck LinearIndices(dest) == idxs1 && all(x -> LinearIndices(x) == idxs1, As)

That's not how @boundscheck works. That won't throw even if the indices don't line up. Would it work to simply fix that typo or would that be too strict because the typo has been present for six years?

adienes · 2024-11-27T04:06:36Z

typo aside, it's checking the wrong thing. that's checking that literally all the indices are equal.

map is supposed to stop when any of the constituent arguments are exhausted --- this should be honored even in the presence of @inbounds ! I only made it skip this check in the presence of @inbounds in the PR because I figured there would be complaints if the added indices intersection impacted performance

but for a correct implementation to match the docstring, I think intersect(map(eachindex ∘ LinearIndices, As)...) --- or something else to that effect must be computed, @inbounds or no

this is one of those cases where Julia skipped directly to "make it fast" before "make it right" ☹️

LilithHafner

Thanks for addressing this significant issue. Much appreciated. A related issue that you might want to be aware of, but don't need to fix in this PR is #56696.

I agree that it is acceptable to take some performance regressions in exchange for fixing this bug. I have some refactoring suggestions to make it a bit simpler.

I'm planning to run benchmarks once we agree on an implementation, but if you want me to run them right away instead I'd be happy to do that.

LilithHafner · 2024-11-27T13:31:10Z

base/abstractarray.jl

+    idxs1 = eachindex(LinearIndices(As[1]))
+    @boundscheck begin
+        idxs1 = intersect(map(eachindex ∘ LinearIndices, As)...)
+        checkbounds(dest, idxs1)
+    end
    for i = idxs1


Suggested change

idxs1 = eachindex(LinearIndices(As[1]))

@boundscheck begin

idxs1 = intersect(map(eachindex ∘ LinearIndices, As)...)

checkbounds(dest, idxs1)

end

for i = idxs1

idxs = mapreduce(LinearIndices, intersect, As)

checkbounds(dest, inds)

for i = idxs

I don't see why we need eachindex is here

We should only include bounds-check elision if benchmarks reveal that it makes a difference. In this case, we would also need to add @propagate_inbounds to map! to make it take effect.

I think that mapreduce(LinearIndices, intersect, As) is more elegant than intersect(map(LinearIndices, As)...), unless there is some performance reason to use the latter.

If always compute the intersection of indices then inds is a better name than inds1

thank you for the comments!

the eachindex I added for the following reason

julia> mapreduce(LinearIndices, intersect, [A, B]) 3-element Vector{Int64}: 1 2 3 julia> mapreduce(eachindex ∘ LinearIndices, intersect, [A, B]) Base.OneTo(3)

so I figured that if we can hit fast-path intersection for ranges rather than collecting the indices we should. Another solution might be to add a fast intersect(::LinearIndices, ::LinearIndices) ?

the mapreduce formulation indeed looks nicer to me as well.

If always compute the intersection of indices then inds is a better name than inds1

Agreed! but I suppose we should decide if in the first place @inbounds should allow skipping the check and iterate through the indices of the first argument? I think this is a decision on semantics rather than implementation that I am not empowered to make. the docs should probably be updated if this is intended

the eachindex I added for the following reason

Good reason.

I suppose we should decide if in the first place @inbounds should allow skipping the check and iterate through the indices of the first argument?

@inbounds should never expose documented semantics that are otherwise not present. This is because you can't count on bounds checks being removed. For exmaple, Julia could be run with -check-bound=yes or the boundschecking might not be inlined into the callsite where @inbounds is present. In either case, the bounds checks will remain. Consequently I think the answer to that question is no: @inbounds should not change the semantics other than to skip bounds checks.

LilithHafner · 2024-11-27T13:36:01Z

test/abstractarray.jl

 @test_throws ArgumentError map!(-, [1])

+# Issue #30624
+@test map!(+, [0,0,0], [1,2], [10,20,30], [100]) == [111,0,0]


Would you also add a test for #36235 (comment), please?

…eted_inplace_map_idxs

adienes · 2024-11-28T03:44:55Z

added a bunch more tests (many failing or segfaulting on main)
removed all @inbounds until it's clear that these are needed for performance, we can re-add
I would like to abdicate responsibility for fixing OffsetArrays interactions :)

LilithHafner

Previously, map!(+, ones(1), ones(2, 2)) and map!(+, ones(1), ones(2, 2), ones(2, 2)) worked and this makes them throw (as the docstring indicates they should).

Performance is, at a glance, sometimes slightly better and sometimes slightly worse than it was on master. Correctness first, performance optimization second so I'm not too concerned.

I like that the implementation for map_n! is a bit simpler now and that it clearly aligns with the implementations of 1-arg and 2-arg map.

test/abstractarray.jl

LilithHafner · 2024-12-02T14:44:50Z

I would like to abdicate responsibility for fixing OffsetArrays interactions :)

Unfortunately we have to deal with them before merging because otherwise existing, valid, code that uses OffsetArrays will break. For example,

julia> R = fill(0, -4:-1);

julia> B = OffsetArray(reshape(1:12, 4, 3), -5, 6);

julia> minimum!(R, B)
4-element OffsetArray(::Vector{Int64}, -4:-1) with eltype Int64 with indices -4:-1:
 1
 2
 3
 4

This is a test failure from CI which fails because the implementation of minimum! (which lives in Base) for arrays with unconventional indexing relies on a bug that this PR fixes:

julia> x = fill(0, 1:3);

julia> y = fill(1, 2:4);

julia> map!(identity, x, y)
3-element OffsetArray(::Vector{Int64}, 1:3) with eltype Int64 with indices 1:3:
 1
 1
 1

@nanosoldier runtests()

adienes · 2024-12-02T15:42:06Z

I guess we'll have to iterate through destination idxs separately to fix the OffsetArrays issue? then dest will be treated just as a fully generic iterable / assignable collection.

function map!(f::F, dest::AbstractArray, A::AbstractArray, B::AbstractArray) where F
    inds = intersect(eachindex(LinearIndices(A)), eachindex(LinearIndices(B)))
    @boundscheck length(dest) >= length(inds)
    dest_idx = firstindex(dest)
    for i = inds
        dest[dest_idx] = f(A[i], B[i])
        dest_idx = nextind(dest, dest_idx)
    end
    return dest
end

I could be wrong but this seems more likely to hurt performance.

Would there be a meaningful difference here between iterating through firstindex, nextindex of dest directly vs through LinearIndices(dest) ?

nanosoldier · 2024-12-03T00:58:50Z

The package evaluation job you requested has completed - possible new issues were detected.
The full report is available.

LilithHafner · 2024-12-03T14:12:38Z

Oh, wow! Nanosoldier looks clean. Rerunning failures to be sure:

@nanosoldier runtests(["ApproxLogFunction", "TestSetExtensions", "FunctionOperators", "RemoteFiles", "ChebyshevFiltering", "EinExprs", "Andes", "OpenTelemetryExporterPrometheus", "RegressionTables", "GLPK", "CDDLib", "OrdinaryDiffEqSSPRK", "MPIMeasurements", "ManifoldDiffEq", "OceanRobots", "DynamicMovementPrimitives", "ActiveInference", "Yunir", "ImmersedLayers", "FrequencySweep", "LowRankIntegrators", "NetworkJumpProcesses", "BoxCox"])

LilithHafner · 2024-12-03T14:19:04Z

I guess we'll have to iterate through destination idxs separately to fix the OffsetArrays issue?

Given how nice PkgEval is looking, another option would be to fix the minimum! implementation to not rely on the map! bug.

Would there be a meaningful difference here between iterating through firstindex, nextindex of dest directly vs through LinearIndices(dest) ?

I doubt it, but hopefully that won't be necessary. If we can implement it in a way that isn't practically breaking, I'd love to make map! take offset indices into account (like map) rather than iterating separately.

adienes · 2024-12-04T04:31:06Z

I'm finding all the indirection in reducedim.jl pretty inscrutable 😓

but I guess the "real" issue is in mapfirst! ?

function mapfirst!(f::F, R::AbstractArray, A::AbstractArray{<:Any,N}) where {N, F}
    lsiz = check_reducedims(R, A)
    t = _firstreducedslice(axes(R), axes(A))
    map!(f, R, view(A, t...))
end

in your example, we have

julia> t
(IdOffsetRange(values=-4:-1, indices=-4:-1), 7:7)

so then

julia> LinearIndices(view(B, t...)) |> collect
4×1 Matrix{Int64}:
 1
 2
 3
 4

on main, that final map! call will happily map indices [1 2 3 4] into [-4, -3, -2, -1] whereas this PR makes it (correctly?) throw a BoundsError

nanosoldier · 2024-12-04T10:42:15Z

The package evaluation job you requested has completed - possible new issues were detected.
The full report is available.

adienes · 2024-12-04T13:30:04Z

actually, @mbauman 's tour de force #55318 with this PR rebased on top seems to solve the offset arrays case. so maybe the solution is rebase & wait to get more help on that one?

adienes · 2024-12-04T15:21:34Z

at the very least, this PR would swap "bad" bugs (segfaults, undefined behavior) for "regular" bugs (something throwing a BoundsError when it should not)

adienes · 2025-01-26T17:24:12Z

updates #30389. also breadcrumb to #46352

adienes · 2025-04-16T14:11:03Z

#old
map!(f, R, view(A, t...))

#new
map!(f, view(R, firstindex(R):lastindex(R)), view(A, t...))

in mapfirst! is the main update. it looks (and feels) a bit odd to assign into a view like this, but it seems to pass all the test cases now, existing, newly added, and offset-arrays alike.

since a more comprehensive refactor to dimensional reductions appears to be still a ways away, I'd suggest leniency for "ugly" bugfixes like this.

adienes · 2025-04-16T15:46:16Z

@nanosoldier runtests()

nanosoldier · 2025-04-17T13:17:29Z

The package evaluation job you requested has completed - possible new issues were detected.
The full report is available.

Report summary

❗ Packages that crashed

6 packages crashed on the previous version too.

✖ Packages that failed

23 packages failed only on the current version.

Illegal method overwrites during precompilation: 1 packages
Package has test failures: 3 packages
Package tests unexpectedly errored: 1 packages
Tests became inactive: 1 packages
Test duration exceeded the time limit: 17 packages

1536 packages failed on the previous version too.

✔ Packages that passed tests

27 packages passed tests only on the current version.

Other: 27 packages

5121 packages passed tests on the previous version too.

~ Packages that at least loaded

8 packages successfully loaded only on the current version.

Other: 8 packages

2855 packages successfully loaded on the previous version too.

➖ Packages that were skipped altogether

905 packages were skipped on the previous version too.

adienes · 2025-04-18T20:19:51Z

ok, I narrowed the scope. only map_n! is changed now with the implementation you suggested.

as you note, this would leave open the question of input validation when the docstring is violated, but at least it fixes the segfaults and uninitialized value exposure (which was my main goal)

…!` (#56673) (cherry picked from commit 0947114)

…!` (JuliaLang#56673)

…!` (#56673) (cherry picked from commit 0947114)

fix map! undefined value exposure

7717106

LilithHafner added the bugfix This change fixes an existing bug label Nov 27, 2024

LilithHafner reviewed Nov 27, 2024

View reviewed changes

adienes added 2 commits November 27, 2024 11:49

Merge branch 'master' of https://github.com/JuliaLang/julia into targ…

e6acfa9

…eted_inplace_map_idxs

apply code review

592d233

LilithHafner added minor change Marginal behavior change acceptable for a minor release needs pkgeval Tests for all registered packages should be run with this change labels Dec 2, 2024

LilithHafner reviewed Dec 2, 2024

View reviewed changes

test/abstractarray.jl Outdated Show resolved Hide resolved

adienes mentioned this pull request Dec 29, 2024

Inconsistency of matrix cov and cor in the presence of missing JuliaStats/Statistics.jl#183

Open

adienes added 4 commits April 13, 2025 16:56

Merge branch 'master' into targeted_inplace_map_idxs

980ed0b

Merge branch 'master' into targeted_inplace_map_idxs

b90d3e0

update for offsetarrays

c623093

Merge branch 'master' into targeted_inplace_map_idxs

4ddaf5d

mbauman removed the status: waiting for PR reviewer label Apr 18, 2025

apply code review

f9d5fa7

mbauman approved these changes Apr 18, 2025

View reviewed changes

mbauman removed the minor change Marginal behavior change acceptable for a minor release label Apr 18, 2025

LilithHafner merged commit 0947114 into JuliaLang:master Apr 19, 2025
5 of 8 checks passed

nsajko added the arrays [a, r, r, a, y, s] label Apr 19, 2025

mbauman added backport 1.10 Change should be backported to the 1.10 release backport 1.11 Change should be backported to release-1.11 backport 1.12 Change should be backported to release-1.12 labels Apr 19, 2025

KristofferC pushed a commit that referenced this pull request Apr 22, 2025

Switch from segfault to zip behavior for mismatched indices in `map…

66a3cc9

…!` (#56673) (cherry picked from commit 0947114)

KristofferC mentioned this pull request Apr 22, 2025

Backports for 1.12.0-beta2 #58009

Merged

51 tasks

KristofferC removed the backport 1.12 Change should be backported to release-1.12 label Apr 25, 2025

KristofferC pushed a commit that referenced this pull request Apr 25, 2025

Switch from segfault to zip behavior for mismatched indices in `map…

16832c1

…!` (#56673) (cherry picked from commit 0947114)

KristofferC mentioned this pull request Apr 25, 2025

Backports for julia 1.11.6 #58224

Merged

71 tasks

LebedevRI pushed a commit to LebedevRI/julia that referenced this pull request May 2, 2025

Switch from segfault to zip behavior for mismatched indices in `map…

7c923f9

…!` (JuliaLang#56673)

akdienes mentioned this pull request May 12, 2025

WIP: a more principled take on dimensional reduction inits #55318

Closed

KristofferC pushed a commit that referenced this pull request Jun 4, 2025

Switch from segfault to zip behavior for mismatched indices in `map…

d3c44fe

…!` (#56673) (cherry picked from commit 0947114)

KristofferC mentioned this pull request Jun 4, 2025

Backports for 1.10.10 #57715

Merged

75 tasks

KristofferC pushed a commit that referenced this pull request Jun 5, 2025

Switch from segfault to zip behavior for mismatched indices in `map…

368516b

…!` (#56673) (cherry picked from commit 0947114)

KristofferC pushed a commit that referenced this pull request Jul 3, 2025

Switch from segfault to zip behavior for mismatched indices in `map…

ccd4cfc

…!` (#56673) (cherry picked from commit 0947114)

KristofferC pushed a commit that referenced this pull request Aug 19, 2025

Switch from segfault to zip behavior for mismatched indices in `map…

9aeee1b

…!` (#56673) (cherry picked from commit 0947114)

KristofferC pushed a commit that referenced this pull request Aug 19, 2025

Switch from segfault to zip behavior for mismatched indices in `map…

5361f7c

…!` (#56673) (cherry picked from commit 0947114)

KristofferC removed the backport 1.11 Change should be backported to release-1.11 label Aug 19, 2025

KristofferC pushed a commit that referenced this pull request Aug 19, 2025

Switch from segfault to zip behavior for mismatched indices in `map…

7ac9a80

…!` (#56673) (cherry picked from commit 0947114)

DilumAluthge mentioned this pull request Sep 13, 2025

Backports for Julia 1.10.11 #58889

Merged

71 tasks

KristofferC mentioned this pull request Nov 20, 2025

set VERSION to 1.12.2 #60182

Merged

KristofferC mentioned this pull request Dec 12, 2025

release-1.12: Revert "build: More msys2 fixes (#59028)" #60374

Merged

KristofferC mentioned this pull request Jan 3, 2026

set VERSION to 1.12.4 #60539

Merged

github-actions bot removed the backport 1.10 Change should be backported to the 1.10 release label Jan 18, 2026

Uh oh!

Conversation

adienes commented Nov 24, 2024

Uh oh!

adienes commented Nov 24, 2024

Uh oh!

LilithHafner commented Nov 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adienes commented Nov 27, 2024

Uh oh!

LilithHafner left a comment

Choose a reason for hiding this comment

Uh oh!

LilithHafner Nov 27, 2024

Choose a reason for hiding this comment

Uh oh!

adienes Nov 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LilithHafner Nov 27, 2024

Choose a reason for hiding this comment

Uh oh!

LilithHafner Nov 27, 2024

Choose a reason for hiding this comment

Uh oh!

adienes commented Nov 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LilithHafner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

LilithHafner commented Dec 2, 2024

Uh oh!

adienes commented Dec 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nanosoldier commented Dec 3, 2024

Uh oh!

LilithHafner commented Dec 3, 2024

Uh oh!

LilithHafner commented Dec 3, 2024

Uh oh!

adienes commented Dec 4, 2024

Uh oh!

nanosoldier commented Dec 4, 2024

Uh oh!

adienes commented Dec 4, 2024

Uh oh!

adienes commented Dec 4, 2024

Uh oh!

adienes commented Jan 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adienes commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adienes commented Apr 16, 2025

Uh oh!

nanosoldier commented Apr 17, 2025

❗ Packages that crashed

✖ Packages that failed

✔ Packages that passed tests

~ Packages that at least loaded

➖ Packages that were skipped altogether

Uh oh!

adienes commented Apr 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

LilithHafner commented Nov 27, 2024 •

edited

Loading

adienes Nov 27, 2024 •

edited

Loading

adienes commented Nov 28, 2024 •

edited

Loading

adienes commented Dec 2, 2024 •

edited

Loading

adienes commented Jan 26, 2025 •

edited

Loading

adienes commented Apr 16, 2025 •

edited

Loading