Add "a" parameter to softplus() #83 #85

DominiqueMakowski · 2024-09-03T07:49:38Z

Following up the issues related to an exp link-function (TuringLang/Turing.jl#2310), it reinforced the idea that a softplus link could actually be a good alternative. However, I feel like implementing its generalized version (#83) would be key (useful when modelling small parameters), so here my shot at it.

devmotion

I'm not sure how widely use this variant is (and whether there are other commonly used alternatives, the issue mentions also Liu and Ferber 2016?). If it's added, we should make to sure to test it and to also add support for it in the ChainRules, InverseFunctions, and ChangesOfVariables extensions.

devmotion · 2024-09-03T21:47:30Z

src/basicfuns.jl

@@ -165,9 +165,14 @@ Return `log(1+exp(x))` evaluated carefully for largish `x`.
 This is also called the ["softplus"](https://en.wikipedia.org/wiki/Rectifier_(neural_networks))
 transformation, being a smooth approximation to `max(0,x)`. Its inverse is [`logexpm1`](@ref).

+The generalized `softplus` function (Wiemann et al., 2024) takes an additional optional parameter `a` that control 


I assume there exist earlier references for this function?

I went through Liu and Farber to double-check

From my understanding (ML is not my field), they validate "noisy softplus" as an improvement over other activation functions for neurons in NNs.

However, it seems like they named the a parameter sigma. Their plot looks similar but different in terms of values (?)

I'm not sure how widely use this variant

I share your concern here, I'm also careful not to add niche features to such a base package and add maintaining burden.
I can't say how commonly the generalized version is already used, its development seems fairly recent.
However, I can see its usefulness in quite a lot of cases: the default softplus only becomes close to identity after x > 2, and from experience we often do model parameters smaller than that (typical sigmas in neuroscience/psychology are like between 0 and 1), so using adjusted softplus links would make sense in these contexts. I suppose it's a tradeoff between the complexity of the feature and its (potential) usage

src/basicfuns.jl

Co-authored-by: David Widmann <[email protected]>

devmotion

I think the remaining items here are:

Include the new docstrings in the documentation
Add tests for softplus and invsoftplus
Add support for InverseFunctions for softplus and invsoftplus and test it
Add support for ChangesOfVariables for softplus and invsoftplus and test it

I think ChainRules support should not be needed since log1pexp and log1mexp are already supported, and we can expect AD to differentiate through the remaining parts of the functions.

src/basicfuns.jl

activate.jl

src/basicfuns.jl

Co-authored-by: David Widmann <[email protected]>

…unctions.jl

DominiqueMakowski · 2024-09-14T16:24:55Z

Add support for InverseFunctions / ChangesOfVariables

Can you clarify?

DominiqueMakowski · 2024-09-25T15:18:20Z

Kind bump

devmotion · 2024-09-25T23:00:56Z

Sorry, I missed your previous comment.

Add support for InverseFunctions / ChangesOfVariables

Since this PR adds new functions, we should also add definitions of InverseFunctions.inverse to https://github.com/JuliaStats/LogExpFunctions.jl/blob/289114f535827c612ce10c01b8dec9d3a55e4d15/ext/LogExpFunctionsInverseFunctionsExt.jl and definitions of ChangesOfVariables.with_logabsdet_jacobian to https://github.com/JuliaStats/LogExpFunctions.jl/blob/289114f535827c612ce10c01b8dec9d3a55e4d15/ext/LogExpFunctionsChangesOfVariablesExt.jl. Additionally, we could add definitions of ChainRulesCore.frule and ChainRulesCore.rrule - but in principle AD should "just work" since all other involved functions are known to ChainRules.

DominiqueMakowski · 2024-09-27T08:42:54Z

I am not sure how to specify the ChangesofVariables one 🤔

DominiqueMakowski · 2024-10-26T08:52:35Z

Kind bump

DominiqueMakowski · 2024-12-03T08:34:39Z

Since there is no preexisting ChangesOfVariables.with_logabsdet_jacobian for softplus I'm really not sure what to write there

devmotion

I had re-reviewed the PR but apparently forgotten to submit the review on GH.

src/basicfuns.jl

test/basicfuns.jl

tpapp

LGTM

Add "a" parameter to softplus()

35f18e3

devmotion reviewed Sep 3, 2024

View reviewed changes

DominiqueMakowski and others added 3 commits September 4, 2024 07:52

Update src/basicfuns.jl

83acf1d

Co-authored-by: David Widmann <[email protected]>

Update src/basicfuns.jl

0fdda7a

Co-authored-by: David Widmann <[email protected]>

add own docstring to softplus

bf85177

devmotion reviewed Sep 4, 2024

View reviewed changes

src/basicfuns.jl Outdated Show resolved Hide resolved

activate.jl Outdated Show resolved Hide resolved

src/basicfuns.jl Show resolved Hide resolved

DominiqueMakowski and others added 5 commits September 4, 2024 08:20

Update src/basicfuns.jl

af527b5

Co-authored-by: David Widmann <[email protected]>

Delete activate.jl

be659f8

Merge branch 'master' of https://github.com/DominiqueMakowski/LogExpF…

88df88b

…unctions.jl

Add docstring

a113b43

docs and test

0a03aba

Add inverse

0c0ab2a

typo

5d8a05b

devmotion reviewed Dec 10, 2024

View reviewed changes

src/basicfuns.jl Outdated Show resolved Hide resolved

src/basicfuns.jl Outdated Show resolved Hide resolved

test/basicfuns.jl Outdated Show resolved Hide resolved

test/basicfuns.jl Outdated Show resolved Hide resolved

devmotion added 2 commits December 10, 2024 20:55

Fix method overrides

da5130f

Add ChangesOfVariables definitions and extend tests

5f1d99d

devmotion approved these changes Dec 10, 2024

View reviewed changes

tpapp approved these changes Dec 11, 2024

View reviewed changes

tpapp merged commit 76a23a7 into JuliaStats:master Dec 11, 2024
4 checks passed

tpapp mentioned this pull request Dec 11, 2024

Add "a" parameter to softplus #83

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add "a" parameter to softplus() #83 #85

Add "a" parameter to softplus() #83 #85

DominiqueMakowski commented Sep 3, 2024

devmotion left a comment

devmotion Sep 3, 2024

DominiqueMakowski Sep 4, 2024 •

edited

Loading

devmotion left a comment

DominiqueMakowski commented Sep 14, 2024

DominiqueMakowski commented Sep 25, 2024

devmotion commented Sep 25, 2024

DominiqueMakowski commented Sep 27, 2024

DominiqueMakowski commented Oct 26, 2024

DominiqueMakowski commented Dec 3, 2024

devmotion left a comment

tpapp left a comment

Add "a" parameter to softplus() #83 #85

Add "a" parameter to softplus() #83 #85

Conversation

DominiqueMakowski commented Sep 3, 2024

devmotion left a comment

Choose a reason for hiding this comment

devmotion Sep 3, 2024

Choose a reason for hiding this comment

DominiqueMakowski Sep 4, 2024 • edited Loading

Choose a reason for hiding this comment

devmotion left a comment

Choose a reason for hiding this comment

DominiqueMakowski commented Sep 14, 2024

DominiqueMakowski commented Sep 25, 2024

devmotion commented Sep 25, 2024

DominiqueMakowski commented Sep 27, 2024

DominiqueMakowski commented Oct 26, 2024

DominiqueMakowski commented Dec 3, 2024

devmotion left a comment

Choose a reason for hiding this comment

tpapp left a comment

Choose a reason for hiding this comment

DominiqueMakowski Sep 4, 2024 •

edited

Loading