Derivative of Residual Sum-of-Square: Where's the sum ? #1294

CMoebus · 2024-10-04T17:00:23Z

Hi I am new to Symbolics,
ich want to derive the gradient of the resiudal sum of squares. My code is here:
let
#-----------------------------------------------------------------------------
# residuals sum of squares
function mySumOfSquares(ys, xs, ϕ0, ϕ1)
ysHat = ϕ0 .+ ϕ1 .* xs
residuals = ys - ysHat
mySoSq = sum(residual -> residual^2, residuals, init=0.0)
end # function mySumOfSquares
#-----------------------------------------------------------------------------
@variables xss, yss, ϕ0s, ϕ1s
Dϕ0 = Differential(ϕ0s)
Dϕ1 = Differential(ϕ1s)
println(simplify(expand_derivatives(Dϕ0(mySumOfSquares(yss, xss, ϕ0s, ϕ1s)))))
println(simplify(expand_derivatives(Dϕ1(mySumOfSquares(yss, xss, ϕ0s, ϕ1s)))))
#-----------------------------------------------------------------------------
end # let
The Latex code for the hand-calculated gradient is here:
Now let's compute the gradient vector for a given set of parameters of the loss function $L$:

$L = \sum_{i=1}^N \left(y_i - \hat y_i\right)^2 = \sum_{i=1}^N \left(y_i - (\phi_0 + \phi_1 x_i)\right)^2 = \sum_{i=1}^N \left(y_i - \phi_0 - \phi_1 x_i\right)^2$

$;$
$;$
$;$

$\nabla L =
\frac{\partial L}{\partial \mathbf{\phi}} =

\left[
\begin{array}{c}
\frac{\partial L}{\partial \phi_0} \
\frac{\partial L}{\partial \phi_1}
\end{array}
\right] =

\left[
\begin{array}{c}
\sum_{i=1}^N(-2y_i + 2\phi_0 + 2\phi_1x_i) \
\sum_{i=1}^N(-2x_i y_i + 2\phi_0x_i + 2\phi_1x_i^2)
\end{array}
\right] =

\left[
\begin{array}{c}
- 2\sum_{i=1}^N(y_i - (\phi_0 + \phi_1x_i)) \
- 2\sum_{i=1}^N x_i(y_i - (\phi_0 + \phi_1x_i))
\end{array}
\right]$
The partial derivates computed by Symbolics.jl are correct but the sum is missing. What was my fault ?
All the best, Claus Möbus

ChrisRackauckas · 2024-10-05T15:17:50Z

The sum of a scalar variable is just the scalar.

CMoebus · 2024-10-05T16:06:02Z

Sorry Chris, the gradient vector of the residual sum of squares of a simple univariate regression contains two elements which are sums (see here: https://uol.de/f/2/dept/informatik/ag/lks/download/Probabilistic_Programming/JULIA/Pluto.jl/Machine_Learning/UnderstandingDeepLearning/UDL_20240920_6_2_2_GradientDescent_II.html?v=1728143690 and in Prince's book (6.7, 6.8, p.80; https://github.com/udlbook/udlbook/releases/download/v4.0.4/Understanding_Deep_Learning.pdf).
All the best, Claus

ChrisRackauckas · 2024-10-05T16:19:14Z

@variables xss, yss, ϕ0s, ϕ1s these are scalar variables, did you mean to make any of them arrays?

CMoebus · 2024-10-05T20:52:45Z

Yes, xss, yss should be arrays; ϕ0s, ϕ1s should be scalar parameters. So the gradient is a two-component vector. C. https://uol.de/en/lcs/

…

________________________________ From: Christopher Rackauckas ***@***.***> Sent: Saturday, October 5, 2024 6:19:35 PM To: JuliaSymbolics/Symbolics.jl ***@***.***> Cc: Claus Möbus ***@***.***>; Author ***@***.***> Subject: Re: [JuliaSymbolics/Symbolics.jl] Derivative of Residual Sum-of-Square: Where's the sum ? (Issue #1294) ACHTUNG! Diese E-Mail kommt von Extern! WARNING! This email originated off-campus. @variables xss, yss, ϕ0s, ϕ1s these are scalar variables, did you mean to make any of them arrays? — Reply to this email directly, view it on GitHub<#1294 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABOF6LT5UZXGSMVOAHYLKALZ2AGRPAVCNFSM6AAAAABPMIFX6GVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGOJVGEYDMOBYG4>. You are receiving this because you authored the thread.Message ID: ***@***.***>

ChrisRackauckas · 2024-10-05T20:54:58Z

Define n and then @variables xss[1:n], yss[1:n], ϕ0s, ϕ1s ?

CMoebus · 2024-10-06T10:53:52Z

Sorry, the error message is now: "Differentiation with array expressions is not yet supported". I uploaded the Julia/Pluto-code here: https://uol.de/f/2/dept/informatik/ag/lks/download/Probabilistic_Programming/JULIA/Pluto.jl/Machine_Learning/UnderstandingDeepLearning/UDL_20240920_6_2_2_GradientDescent_II.html?v=1728143690

CMoebus · 2024-10-09T10:27:59Z

I tried various code variants (https://uol.de/f/2/dept/informatik/ag/lks/download/Probabilistic_Programming/JULIA/Pluto.jl/Machine_Learning/UnderstandingDeepLearning/UDL_20240920_6_2_2_GradientDescent_II.html?v=1728406870) but my impression is that Symbolics.jl is presently unable to calculate derivatives when array expressions are in the code. But this is always the case when you have statistical models at hand. Sorry fo that. All the best. Claus

ChrisRackauckas · 2024-10-09T13:35:25Z

You'd have to scalarize it. Symbolics.scalarize(expr)

CMoebus · 2024-10-09T15:32:26Z

Sorry, can you provide more detail where and what ? https://uol.de/en/lcs/

…

________________________________ From: Christopher Rackauckas ***@***.***> Sent: Wednesday, October 9, 2024 3:35:47 PM To: JuliaSymbolics/Symbolics.jl ***@***.***> Cc: Claus Möbus ***@***.***>; Author ***@***.***> Subject: Re: [JuliaSymbolics/Symbolics.jl] Derivative of Residual Sum-of-Square: Where's the sum ? (Issue #1294) ACHTUNG! Diese E-Mail kommt von Extern! WARNING! This email originated off-campus. You'd have to scalarize it. Symbolics.scalarize(expr) — Reply to this email directly, view it on GitHub<#1294 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABOF6LTWFKNLUGCIY3DA2VTZ2UWLHAVCNFSM6AAAAABPMIFX6GVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIMBSGM3DSNJYG4>. You are receiving this because you authored the thread.Message ID: ***@***.***>

CMoebus · 2024-10-12T12:42:02Z

I tried various code variants (https://uol.de/f/2/dept/informatik/ag/lks/download/Probabilistic_Programming/JULIA/Pluto.jl/Machine_Learning/UnderstandingDeepLearning/UDL_20240920_6_2_2_GradientDescent_II.html?v=1728406870) but my impression is that Symbolics.jl is presently unable to calculate derivatives when array expressions are in the code. But this is always the case when you have statistical models at hand. Sorry fo that. All the best. Claus

After some trial and error I came out with this code snippet which genersted a correct but very low level answer which has to be abstracted by hand.
let

@variables x[1:N], y[1:N], ϕ0, ϕ1

rSS = sum((y[i]- (ϕ0 + ϕ1*x[i]))^2 for i in 1:N)

grad_rSS = Symbolics.gradient(rSS, [ϕ0, ϕ1])

simplify(grad_rSS)

end # let

But this abstraction should be provided by Symbolics.jl ! But how to activate this process ? Do you know an answer ?

ChrisRackauckas · 2024-10-12T14:44:32Z

julia> using Symbolics

julia> N = 10
10

julia> @variables x[1:N], y[1:N], ϕ0, ϕ1
4-element Vector{Any}:
   x[1:10]
   y[1:10]
 ϕ0
 ϕ1

julia> rSS = sum((y[i]- (ϕ0 + ϕ1*x[i]))^2 for i in 1:N)
(y[1] - ϕ0 - x[1]*ϕ1)^2 + (y[10] - ϕ0 - x[10]*ϕ1)^2 + (y[2] - ϕ0 - x[2]*ϕ1)^2 + (y[3] - ϕ0 - x[3]*ϕ1)^2 + (y[4] - ϕ0 - x[4]*ϕ1)^2 + (y[5] - ϕ0 - x[5]*ϕ1)^2 + (y[6] - ϕ0 - x[6]*ϕ1)^2 + (y[7] - ϕ0 - x[7]*ϕ1)^2 + (y[8] - ϕ0 - x[8]*ϕ1)^2 + (y[9] - ϕ0 - x[9]*ϕ1)^2

julia> grad_rSS = Symbolics.gradient(rSS, [ϕ0, ϕ1])
2-element Vector{Num}:
                                                    -2(y[1] - ϕ0 - x[1]*ϕ1) - 2(y[10] - ϕ0 - x[10]*ϕ1) - 2(y[2] - ϕ0 - x[2]*ϕ1) - 2(y[3] - ϕ0 - x[3]*ϕ1) - 2(y[4] - ϕ0 - x[4]*ϕ1) - 2(y[5] - ϕ0 - x[5]*ϕ1) - 2(y[6] - ϕ0 - x[6]*ϕ1) - 2(y[7] - ϕ0 - x[7]*ϕ1) - 2(y[8] - ϕ0 - x[8]*ϕ1) - 2(y[9] - ϕ0 - x[9]*ϕ1)
 -2x[1]*(y[1] - ϕ0 - x[1]*ϕ1) - 2x[10]*(y[10] - ϕ0 - x[10]*ϕ1) - 2x[2]*(y[2] - ϕ0 - x[2]*ϕ1) - 2x[3]*(y[3] - ϕ0 - x[3]*ϕ1) - 2x[4]*(y[4] - ϕ0 - x[4]*ϕ1) - 2x[5]*(y[5] - ϕ0 - x[5]*ϕ1) - 2x[6]*(y[6] - ϕ0 - x[6]*ϕ1) - 2x[7]*(y[7] - ϕ0 - x[7]*ϕ1) - 2x[8]*(y[8] - ϕ0 - x[8]*ϕ1) - 2x[9]*(y[9] - ϕ0 - x[9]*ϕ1)

julia> simplify(grad_rSS)
2-element Vector{Num}:
                                                    -2(y[1] - ϕ0 - x[1]*ϕ1) - 2(y[10] - ϕ0 - x[10]*ϕ1) - 2(y[2] - ϕ0 - x[2]*ϕ1) - 2(y[3] - ϕ0 - x[3]*ϕ1) - 2(y[4] - ϕ0 - x[4]*ϕ1) - 2(y[5] - ϕ0 - x[5]*ϕ1) - 2(y[6] - ϕ0 - x[6]*ϕ1) - 2(y[7] - ϕ0 - x[7]*ϕ1) - 2(y[8] - ϕ0 - x[8]*ϕ1) - 2(y[9] - ϕ0 - x[9]*ϕ1)
 -2x[1]*(y[1] - ϕ0 - x[1]*ϕ1) - 2x[10]*(y[10] - ϕ0 - x[10]*ϕ1) - 2x[2]*(y[2] - ϕ0 - x[2]*ϕ1) - 2x[3]*(y[3] - ϕ0 - x[3]*ϕ1) - 2x[4]*(y[4] - ϕ0 - x[4]*ϕ1) - 2x[5]*(y[5] - ϕ0 - x[5]*ϕ1) - 2x[6]*(y[6] - ϕ0 - x[6]*ϕ1) - 2x[7]*(y[7] - ϕ0 - x[7]*ϕ1) - 2x[8]*(y[8] - ϕ0 - x[8]*ϕ1) - 2x[9]*(y[9] - ϕ0 - x[9]*ϕ1)

CMoebus · 2024-10-13T07:30:02Z

This has to be simplfified within Symbolics.jl. Do you know how ? C. https://uol.de/en/lcs/

…

________________________________ From: Christopher Rackauckas ***@***.***> Sent: Saturday, October 12, 2024 4:44:53 PM To: JuliaSymbolics/Symbolics.jl ***@***.***> Cc: Claus Möbus ***@***.***>; Author ***@***.***> Subject: Re: [JuliaSymbolics/Symbolics.jl] Derivative of Residual Sum-of-Square: Where's the sum ? (Issue #1294) ACHTUNG! Diese E-Mail kommt von Extern! WARNING! This email originated off-campus. julia> using Symbolics julia> N = 10 10 julia> @variables x[1:N], y[1:N], ϕ0, ϕ1 4-element Vector{Any}: x[1:10] y[1:10] ϕ0 ϕ1 julia> rSS = sum((y[i]- (ϕ0 + ϕ1*x[i]))^2 for i in 1:N) (y[1] - ϕ0 - x[1]*ϕ1)^2 + (y[10] - ϕ0 - x[10]*ϕ1)^2 + (y[2] - ϕ0 - x[2]*ϕ1)^2 + (y[3] - ϕ0 - x[3]*ϕ1)^2 + (y[4] - ϕ0 - x[4]*ϕ1)^2 + (y[5] - ϕ0 - x[5]*ϕ1)^2 + (y[6] - ϕ0 - x[6]*ϕ1)^2 + (y[7] - ϕ0 - x[7]*ϕ1)^2 + (y[8] - ϕ0 - x[8]*ϕ1)^2 + (y[9] - ϕ0 - x[9]*ϕ1)^2 julia> grad_rSS = Symbolics.gradient(rSS, [ϕ0, ϕ1]) 2-element Vector{Num}: -2(y[1] - ϕ0 - x[1]*ϕ1) - 2(y[10] - ϕ0 - x[10]*ϕ1) - 2(y[2] - ϕ0 - x[2]*ϕ1) - 2(y[3] - ϕ0 - x[3]*ϕ1) - 2(y[4] - ϕ0 - x[4]*ϕ1) - 2(y[5] - ϕ0 - x[5]*ϕ1) - 2(y[6] - ϕ0 - x[6]*ϕ1) - 2(y[7] - ϕ0 - x[7]*ϕ1) - 2(y[8] - ϕ0 - x[8]*ϕ1) - 2(y[9] - ϕ0 - x[9]*ϕ1) -2x[1]*(y[1] - ϕ0 - x[1]*ϕ1) - 2x[10]*(y[10] - ϕ0 - x[10]*ϕ1) - 2x[2]*(y[2] - ϕ0 - x[2]*ϕ1) - 2x[3]*(y[3] - ϕ0 - x[3]*ϕ1) - 2x[4]*(y[4] - ϕ0 - x[4]*ϕ1) - 2x[5]*(y[5] - ϕ0 - x[5]*ϕ1) - 2x[6]*(y[6] - ϕ0 - x[6]*ϕ1) - 2x[7]*(y[7] - ϕ0 - x[7]*ϕ1) - 2x[8]*(y[8] - ϕ0 - x[8]*ϕ1) - 2x[9]*(y[9] - ϕ0 - x[9]*ϕ1) julia> simplify(grad_rSS) 2-element Vector{Num}: -2(y[1] - ϕ0 - x[1]*ϕ1) - 2(y[10] - ϕ0 - x[10]*ϕ1) - 2(y[2] - ϕ0 - x[2]*ϕ1) - 2(y[3] - ϕ0 - x[3]*ϕ1) - 2(y[4] - ϕ0 - x[4]*ϕ1) - 2(y[5] - ϕ0 - x[5]*ϕ1) - 2(y[6] - ϕ0 - x[6]*ϕ1) - 2(y[7] - ϕ0 - x[7]*ϕ1) - 2(y[8] - ϕ0 - x[8]*ϕ1) - 2(y[9] - ϕ0 - x[9]*ϕ1) -2x[1]*(y[1] - ϕ0 - x[1]*ϕ1) - 2x[10]*(y[10] - ϕ0 - x[10]*ϕ1) - 2x[2]*(y[2] - ϕ0 - x[2]*ϕ1) - 2x[3]*(y[3] - ϕ0 - x[3]*ϕ1) - 2x[4]*(y[4] - ϕ0 - x[4]*ϕ1) - 2x[5]*(y[5] - ϕ0 - x[5]*ϕ1) - 2x[6]*(y[6] - ϕ0 - x[6]*ϕ1) - 2x[7]*(y[7] - ϕ0 - x[7]*ϕ1) - 2x[8]*(y[8] - ϕ0 - x[8]*ϕ1) - 2x[9]*(y[9] - ϕ0 - x[9]*ϕ1) — Reply to this email directly, view it on GitHub<#1294 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABOF6LSKBTY25YNDVFLCL3TZ3EYWLAVCNFSM6AAAAABPMIFX6GVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIMBYGU4DSMBVGE>. You are receiving this because you authored the thread.Message ID: ***@***.***>

ChrisRackauckas · 2024-10-13T08:36:16Z

I have no idea what you're saying.

CMoebus closed this as completed Oct 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Derivative of Residual Sum-of-Square: Where's the sum ? #1294

Derivative of Residual Sum-of-Square: Where's the sum ? #1294

CMoebus commented Oct 4, 2024

ChrisRackauckas commented Oct 5, 2024

CMoebus commented Oct 5, 2024

ChrisRackauckas commented Oct 5, 2024

CMoebus commented Oct 5, 2024 via email

ChrisRackauckas commented Oct 5, 2024

CMoebus commented Oct 6, 2024

CMoebus commented Oct 9, 2024

ChrisRackauckas commented Oct 9, 2024

CMoebus commented Oct 9, 2024 via email

CMoebus commented Oct 12, 2024

ChrisRackauckas commented Oct 12, 2024

CMoebus commented Oct 13, 2024 via email

ChrisRackauckas commented Oct 13, 2024

Derivative of Residual Sum-of-Square: Where's the sum ? #1294

Derivative of Residual Sum-of-Square: Where's the sum ? #1294

Comments

CMoebus commented Oct 4, 2024

ChrisRackauckas commented Oct 5, 2024

CMoebus commented Oct 5, 2024

ChrisRackauckas commented Oct 5, 2024

CMoebus commented Oct 5, 2024 via email

ChrisRackauckas commented Oct 5, 2024

CMoebus commented Oct 6, 2024

CMoebus commented Oct 9, 2024

ChrisRackauckas commented Oct 9, 2024

CMoebus commented Oct 9, 2024 via email

CMoebus commented Oct 12, 2024

ChrisRackauckas commented Oct 12, 2024

CMoebus commented Oct 13, 2024 via email

ChrisRackauckas commented Oct 13, 2024