Adjoints for Linear Solve #449

avik-pal · 2023-12-20T20:11:07Z

Fixes #198, Fixes #322

TODOs:

Preconditioning. These need to be added to the Adjoint Sensitivity Struct. Can we use a left preconditioner for the forward problem as the transpose right preconditioner? Need someone with a better grasp of linear algebra to chime in here
Support the linsolve !== nothing case. This is useful if we know that $A^T$ has a structure exploitable by a different solver
Tests -- Take from Enzyme and ChainRules
Fix literal_getproperty for LinearSolution SciMLBase.jl#566

Example:

using LinearSolve, Zygote

A = rand(4, 4)
b = rand(4)

test_func_1(A, b) = sum(abs2, A \ b)

test_func_1(A, b)

∂A_1, ∂b_1 = @btime Zygote.gradient(test_func_1, copy(A), copy(b))
display(∂A_1)
display(∂b_1)

function test_func_2(A, b)
    prob = LinearProblem(A, b)
    sol = solve(prob)
    return sum(abs2, sol.u)
end

test_func_2(A, b)

∂A_2, ∂b_2 = @btime Zygote.gradient(test_func_2, copy(A), copy(b))
display(∂A_2)
display(∂b_2)

In the following case the cache stores the correct gradients but they are not propagated to A and b. @ChrisRackauckas any idea how to fix this?

cache = init(LinearProblem(copy(A), copy(b)), nothing);
function test_func_3(cache, A, b)
    cache.A = A
    cache.b = b
    sol = solve!(cache)
    return sum(abs2, sol.u)
end

test_func_3(cache, copy(A), copy(b))

∂cache, ∂A_3, ∂b_3 = @btime Zygote.gradient(test_func_3, cache, copy(A), copy(b))
∂cache.A
∂cache.b
display(∂A_3)  # nothing
display(∂b_3)  # nothing

codecov · 2023-12-20T20:18:04Z

Codecov Report

Attention: Patch coverage is 6.66667% with 42 lines in your changes are missing coverage. Please review.

Project coverage is 22.96%. Comparing base (a206054) to head (06c09a3).

❗ Current head 06c09a3 differs from pull request most recent head 7671369. Consider uploading reports for the commit 7671369 to get more accurate results

Files	Patch %	Lines
src/adjoint.jl	2.32%	42 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##             main     #449       +/-   ##
===========================================
- Coverage   66.12%   22.96%   -43.17%     
===========================================
  Files          27       28        +1     
  Lines        2146     2147        +1     
===========================================
- Hits         1419      493      -926     
- Misses        727     1654      +927

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ChrisRackauckas · 2023-12-20T21:02:47Z

src/adjoint.jl

+    # Forward Solve
+    sol = solve!(cache, alg, args...; kwargs...)
+
+    function ∇solve!(∂sol)


Don't we technically have to deepcopy in here?

I guess so, it can be problematic if there are 2 subsequent solve calls on the cache.

ChrisRackauckas · 2023-12-20T21:07:51Z

In the following case the cache stores the correct gradients but they are not propagated to A and b. @ChrisRackauckas any idea how to fix this?

Is this not just an inherent limitation of Zygote with mutation? I would presume we just need to stay away from that and only support solve with CRC.

avik-pal added 2 commits December 20, 2023 09:25

Setup to handle adjoints

7e61692

Finish part of the implementation

06c09a3

avik-pal mentioned this pull request Dec 20, 2023

Fix literal_getproperty for LinearSolution SciML/SciMLBase.jl#566

Merged

ChrisRackauckas reviewed Dec 20, 2023

View reviewed changes

Merge branch 'main' of github.com:SciML/LinearSolve.jl into ap/adjoint

7671369

avik-pal force-pushed the ap/adjoint branch from dc8d7e6 to 7671369 Compare February 24, 2024 19:50

avik-pal added 3 commits February 24, 2024 15:48

Allow special solver for adjoint

c153903

Add compat entries

34995f6

Fix HYPRE

7c1f1b2

avik-pal force-pushed the ap/adjoint branch 2 times, most recently from 4198c86 to 2493dca Compare February 24, 2024 21:42

More tests and some safety

6432716

avik-pal force-pushed the ap/adjoint branch from 2493dca to 6432716 Compare February 24, 2024 21:44

ChrisRackauckas approved these changes Feb 25, 2024

View reviewed changes

Up min SciMLBase compat

e937e67

avik-pal force-pushed the ap/adjoint branch from 360bc11 to e937e67 Compare February 25, 2024 15:38

ChrisRackauckas merged commit 7b090b4 into main Feb 25, 2024
10 of 16 checks passed

ChrisRackauckas deleted the ap/adjoint branch February 25, 2024 17:26

ChrisRackauckas mentioned this pull request Feb 25, 2024

Using preconditioners in the adjoints of a linear solve #476

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjoints for Linear Solve #449

Adjoints for Linear Solve #449

avik-pal commented Dec 20, 2023 •

edited

Loading

codecov bot commented Dec 20, 2023 •

edited

Loading

ChrisRackauckas Dec 20, 2023

avik-pal Dec 20, 2023

ChrisRackauckas commented Dec 20, 2023

Adjoints for Linear Solve #449

Adjoints for Linear Solve #449

Conversation

avik-pal commented Dec 20, 2023 • edited Loading

TODOs:

Example:

codecov bot commented Dec 20, 2023 • edited Loading

Codecov Report

ChrisRackauckas Dec 20, 2023

Choose a reason for hiding this comment

avik-pal Dec 20, 2023

Choose a reason for hiding this comment

ChrisRackauckas commented Dec 20, 2023

avik-pal commented Dec 20, 2023 •

edited

Loading

codecov bot commented Dec 20, 2023 •

edited

Loading