Correctly handling the case λmax = 0. #53

barankarakus · 2020-09-06T20:40:37Z

Fixes #51.

Two changes:

Change to computeλ to ensure λmax = 0 leads to an output of [0] and
not [NaN, ..., NaN].
Change to fit! to ensure the case where autoλ = true and λmax = 0 is
handled correctly (rather than throwing an error).

Two changes: 1) Change to computeλ to ensure λmax = 0 leads to an output of [0] and not [NaN, ..., NaN]. 2) Change to fit! to ensure the case where autoλ = true and λmax = 0 is handled correctly (rather than throwing an error).

coveralls · 2020-09-06T20:58:59Z

Pull Request Test Coverage Report for Build 202

19 of 19 (100.0%) changed or added relevant lines in 2 files are covered.
7 unchanged lines in 3 files lost coverage.
Overall coverage increased (+6.6%) to 91.049%

Files with Coverage Reduction	New Missed Lines	%
src/segselect.jl	1	89.23%
src/coordinate_descent.jl	3	93.63%
src/Lasso.jl	3	88.05%

Totals
Change from base Build 198:	6.6%
Covered Lines:	885
Relevant Lines:	972

💛 - Coveralls

AsafManela

Thanks for this change.
It seems like the test case is basically one where there is no variation in y.
Do you think you could add a test for this case?

src/Lasso.jl

Changing spelling of 'regularisation'.

barankarakus · 2020-09-20T22:06:13Z

Added tests (and some more minor changes). Let me know if anything else needs done!

AsafManela · 2020-09-22T22:18:23Z

src/Lasso.jl

@@ -209,6 +209,10 @@ const MAX_DEV_FRAC = 0.999
 # Compute automatic λ values based on λmax and λminratio
 function computeλ(λmax, λminratio, α, nλ)
    λmax /= α
+    if isapprox(λmax, 0; atol=1e-10)  # then assuming λmax = 0


This is tricky because I think lambda is not unitless, so if it is small or not depends on the data given.
How does glmnet in R (or GLMNet.jl) handle this case?

The reason I've changed the equality check to an isapprox() check is due to floating point arithmetic leading to a lambdamax that should actually be zero being very close to zero but non-zero instead. Simple example when this happens is a design matrix X with entries sampled from U[0, 1] and y a non-zero vector with identical entries.

I agree, the data could be such that lambdamax is genuinely very small but non-zero.

That said, I think it would be very rare to encounter such data in practice... especially since lambdamax (for the linear model) scales linearly with X and y, and we tend to standardise these.

I see two approaches going forward:

Keep this check as is - the case where it would fail to produce correct output basically never occurs anyway.

Revert back to the equality check. The real case in which the package failed was the case where lambdamax was exactly zero, anyway. Moreover, even if lambdamax should be zero but instead is a very small number, there is no major problem: the solver works very fast and it is clear from the output that every value of lambda yields zero active coefficients.

I'll leave it to you to decide 😃.

Additionally: I'm not sure how glmnet in R or Julia handles this.

AsafManela · 2020-09-22T22:19:21Z

test/lasso.jl

+    return true
+end
+
+@test zero_variation_test() == true


Maybe use @test_log instead?

Also, any idea why the tests stopped working in julia v1.0?

Maybe use @test_log instead?

I agree. Will implement tomorrow.

Also, any idea why the tests stopped working in julia v1.0?

Unfortunately nope!

Correctly handling the case λmax = 0.

3ccd5f0

Two changes: 1) Change to computeλ to ensure λmax = 0 leads to an output of [0] and not [NaN, ..., NaN]. 2) Change to fit! to ensure the case where autoλ = true and λmax = 0 is handled correctly (rather than throwing an error).

barankarakus changed the title ~~Correctly handling the case λmax = 0.~~ Correctly handling the case λmax = 0; fixes #51 Sep 6, 2020

barankarakus changed the title ~~Correctly handling the case λmax = 0; fixes #51~~ Correctly handling the case λmax = 0. Sep 6, 2020

AsafManela requested changes Sep 20, 2020

View reviewed changes

src/Lasso.jl Outdated Show resolved Hide resolved

barankarakus and others added 3 commits September 20, 2020 21:44

Update src/Lasso.jl

f4a3923

Changing spelling of 'regularisation'.

Replacing equality with approximate equality.

5636fed

Added test for case: zero variation in y.

5ac04f7

AsafManela requested changes Sep 22, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correctly handling the case λmax = 0. #53

Correctly handling the case λmax = 0. #53

barankarakus commented Sep 6, 2020 •

edited

Loading

coveralls commented Sep 6, 2020 •

edited

Loading

AsafManela left a comment

barankarakus commented Sep 20, 2020

AsafManela Sep 22, 2020

barankarakus Sep 22, 2020

barankarakus Sep 22, 2020

AsafManela Sep 22, 2020

AsafManela Sep 22, 2020

barankarakus Sep 22, 2020

barankarakus Sep 22, 2020

Correctly handling the case λmax = 0. #53

Are you sure you want to change the base?

Correctly handling the case λmax = 0. #53

Conversation

barankarakus commented Sep 6, 2020 • edited Loading

coveralls commented Sep 6, 2020 • edited Loading

Pull Request Test Coverage Report for Build 202

💛 - Coveralls

AsafManela left a comment

Choose a reason for hiding this comment

barankarakus commented Sep 20, 2020

AsafManela Sep 22, 2020

Choose a reason for hiding this comment

barankarakus Sep 22, 2020

Choose a reason for hiding this comment

barankarakus Sep 22, 2020

Choose a reason for hiding this comment

AsafManela Sep 22, 2020

Choose a reason for hiding this comment

AsafManela Sep 22, 2020

Choose a reason for hiding this comment

barankarakus Sep 22, 2020

Choose a reason for hiding this comment

barankarakus Sep 22, 2020

Choose a reason for hiding this comment

barankarakus commented Sep 6, 2020 •

edited

Loading

coveralls commented Sep 6, 2020 •

edited

Loading