Indexing #655

mcabbott · 2022-08-02T14:43:38Z

This wants to add a rule for A[i,j,k] any AbstractArray.

The earlier rule was only for Array. I think the argument for that is that, ultimately, indexing of any linear algebra wrapper resolves to indexing the underlying array. But it resolves to scalar indexing, which I think will be quite inefficient for something like gradient(x -> sum(x[:,1]), transpose(rand(3,3)))[1]. And in practice that fails (with just the Array rule) as it creates & mutates an array to hold the parts.

The internal function _zerolike_writeat which was previously used for some other rules is re-named ∇getindex and simplified: ~~I am not sure why it needed dims~~ . It has rules to allow higher derivatives.

It also always makes a full dense array; we could consider adding something like Zygote.OneElement to be more efficient at scalar indexing. But once you add two of those you get an Array; perhaps InplaceableThunk is eventually going to be better?

src/rulesets/Base/indexing.jl

mcabbott · 2022-08-02T14:48:37Z

src/rulesets/Base/indexing.jl

+function rrule(::typeof(∇getindex), x, dy, inds...)
+    z = ∇getindex(x, dy, inds...)
+    function ∇getindex_pullback(dz)
+        d2y = getindex(unthunk(dz), inds...)


This second derivative function doesn't seem to infer well, can it be improved?

src/rulesets/Base/indexing.jl

oxinabox

Cool cool once these comments are addressed as you feel best
merge when happy

src/rulesets/Base/array.jl

src/rulesets/Base/indexing.jl

oxinabox · 2022-08-11T17:41:37Z

src/rulesets/Base/indexing.jl

+    ∇getindex!(dx, x, dy, plain_inds...)
+    return ProjectTo(x)(dx)  # since we have x, may as well do this inside, not in rules
+end
+


Suggested change

"""

_setindex_zero(x::AbstractArray, dy, inds...)

Basically this function is like `zero(x)` except that it ensure that

it is possible to set the value at index `inds` to `dy`.

It does this while preserving at least the outermost the structure of `x`.

Like `zero(x)`, it promises that `x == x + _setindex_zero(x, dy, inds...)` for all inputs; i.e. it always returns an additive identity.

"""

function _setindex_zero(x::AbstractArray, dy, inds...) end

I wonder if we shouldn't call it _setable_zero or something?

Maybe, I don't like any names. I guess it takes indices in a way that's like set/getindex functions.

src/rulesets/Base/indexing.jl

oxinabox · 2022-08-11T18:15:03Z

src/rulesets/Base/indexing.jl

+    return Base.unsafe_getindex(x, i), getindex(ẋ, i)
+end
+
+function rrule(cfg::RuleConfig{>:HasReverseMode}, ::typeof(Base.unsafe_getindex), x::AbstractVector, i::Integer)


This doesn't need any mode does it?

Suggested change

function rrule(cfg::RuleConfig{>:HasReverseMode}, ::typeof(Base.unsafe_getindex), x::AbstractVector, i::Integer)

function rrule(cfg::RuleConfig, ::typeof(Base.unsafe_getindex), x::AbstractVector, i::Integer)

Not sure why we aren't just calling the rrule for get infact?
Or putting a Union{typeof(getindex), typeof(Base.getindex)) in the function arg on them.
(might even be able to stick view in that union too?)

I guess I thought the first Zygote use of this might leave its own rule intact for scalar indexing, to make OneElement for that, and use CR for all others. In which case rrule_via_ad here will call that.

More generally it may want to do other more efficient things for indexing ranges. Or we may want to do that here, and remove this entirely.

test/rulesets/Base/indexing.jl

oxinabox · 2022-08-11T18:28:14Z

test/rulesets/Base/indexing.jl

+        @test unthunk(bk2(jl(ones(2,2)))[2]) == jl([0 1 1; 0 1 1])
+
+        y3, bk3 = rrule(getindex, x_23_gpu, 1, [1,1,2])  # slow path, copy to CPU
+        @test_skip Array(y3) == Array(x_gpu)[1, [1,1,2]]  # error in Pkg.test, no idea why


Can we reproduce and open an issue on Julia itself and link back here?

After fiddling a bit, here's a better version. These steps run in global scope, but fail inside the let block. FiniteDifferences is involved:

julia> let x_23_gpu = jl(rand(2, 3)) # using JLArrays, loaded for @gpu in test_helpers.jl # Scalar indexing, copied from: @macroexpand @allowscalar A[i] y1, bk1 = rrule(CFG, Base.task_local_storage, () -> x_23_gpu[1], :ScalarIndexing, ScalarAllowed) @test y1 == @allowscalar x_23_gpu[1] bk1(1.0) end ERROR: StackOverflowError: Stacktrace: [1] to_vec(x::JLArray{Float64, 2}) @ FiniteDifferences ~/.julia/packages/FiniteDifferences/VpgIT/src/to_vec.jl:73 [2] to_vec(x::Base.ReshapedArray{Float64, 1, JLArray{Float64, 2}, Tuple{}}) @ FiniteDifferences ~/.julia/packages/FiniteDifferences/VpgIT/src/to_vec.jl:84 --- the last 2 lines are repeated 39990 more times --- [79983] to_vec(x::JLArray{Float64, 2}) @ FiniteDifferences ~/.julia/packages/FiniteDifferences/VpgIT/src/to_vec.jl:73

At best FiniteDifferences can give zero here. But the parameter which needs tracking is embedded in the function () -> x_23_gpu[1] and I don't think it can unpack that.

mcabbott marked this pull request as draft August 2, 2022 14:44

mcabbott commented Aug 2, 2022

View reviewed changes

src/rulesets/Base/indexing.jl Outdated Show resolved Hide resolved

oxinabox reviewed Aug 2, 2022

View reviewed changes

src/rulesets/Base/indexing.jl Outdated Show resolved Hide resolved

mcabbott marked this pull request as ready for review August 3, 2022 03:01

mcabbott mentioned this pull request Aug 3, 2022

getindex for CuArray to fix repeated indices error FluxML/Zygote.jl#1131

Closed

mcabbott force-pushed the getindex branch from 98e87c3 to d8f0d15 Compare August 9, 2022 00:30

mcabbott added 7 commits August 10, 2022 09:39

move and rename _zerolike_writeat, NFC

09f98ca

simplify, use it for getindex, tests

e006af3

add unsafe_getindex too

94e90fb

tidy, make weird types work via _setindex_zero

0e1c2c5

fix view & its zero-arrays

c1ebf3f

test unsafe_getindex

9e1aa8c

handle indexing of GPU arrays

f49b118

mcabbott force-pushed the getindex branch from d8f0d15 to f49b118 Compare August 10, 2022 16:39

oxinabox approved these changes Aug 11, 2022

View reviewed changes

mcabbott mentioned this pull request Aug 11, 2022

Error with Diagonal JuliaDiff/ChainRulesTestUtils.jl#260

Open

mcabbott added 4 commits August 11, 2022 15:26

suggested changes

79fcbd9

restore some gpu tests

f11ce50

avoid the error

7d183f1

in fact, mystery errors persist

adab9b0

mcabbott merged commit 77ef0eb into JuliaDiff:main Aug 12, 2022

mcabbott deleted the getindex branch August 12, 2022 12:45

This was referenced Aug 17, 2022

Support getindex on non-numeric arrays JuliaDiff/Diffractor.jl#82

Closed

Fix ambiguity in _setindex_zero #669

Merged

ToucheSir mentioned this pull request Dec 21, 2022

Incorrect derivative of getindex() with repeating indices on CuArrays FluxML/Zygote.jl#821

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Indexing #655

Indexing #655

mcabbott commented Aug 2, 2022 •

edited

Loading

mcabbott Aug 2, 2022

oxinabox left a comment

oxinabox Aug 11, 2022

mcabbott Aug 11, 2022

oxinabox Aug 11, 2022

mcabbott Aug 11, 2022

oxinabox Aug 11, 2022

mcabbott Aug 12, 2022 •

edited

Loading

+"""
+    _setindex_zero(x::AbstractArray, dy, inds...)
+Basically this function is like `zero(x)` except that it ensure that
+it is possible to set the value at index `inds` to `dy`.
+It does this while preserving at least the outermost the structure of `x`.
+Like `zero(x)`, it promises that `x == x + _setindex_zero(x, dy, inds...)` for all inputs; i.e. it always returns an additive identity.
+"""
+function _setindex_zero(x::AbstractArray, dy, inds...) end

	function rrule(cfg::RuleConfig{>:HasReverseMode}, ::typeof(Base.unsafe_getindex), x::AbstractVector, i::Integer)
	function rrule(cfg::RuleConfig, ::typeof(Base.unsafe_getindex), x::AbstractVector, i::Integer)

Indexing #655

Indexing #655

Conversation

mcabbott commented Aug 2, 2022 • edited Loading

mcabbott Aug 2, 2022

Choose a reason for hiding this comment

oxinabox left a comment

Choose a reason for hiding this comment

oxinabox Aug 11, 2022

Choose a reason for hiding this comment

mcabbott Aug 11, 2022

Choose a reason for hiding this comment

oxinabox Aug 11, 2022

Choose a reason for hiding this comment

mcabbott Aug 11, 2022

Choose a reason for hiding this comment

oxinabox Aug 11, 2022

Choose a reason for hiding this comment

mcabbott Aug 12, 2022 • edited Loading

Choose a reason for hiding this comment

mcabbott commented Aug 2, 2022 •

edited

Loading

mcabbott Aug 12, 2022 •

edited

Loading