Substantial updates to tutorial 01_gaussian-mixture-model #439

JasonPekos · 2024-04-10T04:15:38Z

First PR! Hope everything is ok.

As discussed in the slack, this PR adds the following significant changes to this tutorial

Covers the use of ordered() from Bijectors.jl in making the model identifiable (currently it is multimodal, and the seed is just lucky.)
Introduces a (MUCH faster) version of the model with assignments marginalized out with Turing.@addlogprob!
A very simple version of the marginalized model using only ~ MixtureModel(dists, weights)
Example of recovering marginalized assignment draws with generated_quantities()

There are also a few minor changes:

Changed chain plots from grouping chains together to grouping parameters together. I think this makes the multimodality discussion much clearer.
Added a tiny amount of burnin to the MCMC sampler, as really far-out initialization was making the plots hard to read
Added one more chain to n-chains to make multimodality more likely
Added and reworked tests for the models (now tests rhat to make sure the bijector discussions are all correct)

devmotion

Looks good, I made a few comments 🙂

devmotion · 2024-04-10T06:24:24Z

tutorials/01-gaussian-mixture-model/01_gaussian-mixture-model.jmd

+Where we sum the components with `logsumexp` from the [`StatsFuns.jl` package](https://github.com/JuliaStats/StatsFuns.jl).
+
+
+The manually incremented likelihood can be added to the log-probability with `Turing.@addlogprob!`, giving us the following model:


IMO we should not recommend the use of Turing.@addlogprob! in it's so easy to misuse and to get (silently) wrong results because it operates completely outside of the ~ logic in Turing/DynamicPPL. Instead I think usually one should use ~ with a (possibly custom) distribution.

Sounds good! I initially wasn't going to include that section for basically the reasons you bring up, but I ended up including it (even though I don't actually sample from that model) to motivate what's going on with the MixtureModel lpdf.

I can replace it with a custom distribution (although this might be a little long for a model that's really just exposition), or omit it entirely.

devmotion · 2024-04-10T06:25:27Z

tutorials/01-gaussian-mixture-model/01_gaussian-mixture-model.jmd

+Now, re-running our model, we can see that the assigned means are consistent across chains:
+
+```julia
+chains = sample(model, sampler, MCMCThreads(), nsamples, nchains; discard_initial = burn);


Maybe let's keep the tutorial simple and avoid surprising warnings in singlethreaded environments:

Suggested change

chains = sample(model, sampler, MCMCThreads(), nsamples, nchains; discard_initial = burn);

chains = sample(model, sampler, nsamples, nchains; discard_initial = burn);

Actually un-resolving this because I don't think it works? As it is right now, I'm not sure if Turing allows multiple chains without specifying a type of parallelism.

The documentation, if it's current, seems to suggest I should do something like:

chains = mapreduce(c -> sample(model_fun, sampler, 1000), chainscat, 1:num_chains)

I'm not sure if that's worth it just to get rid of the warning — let me know what you think though.

tutorials/01-gaussian-mixture-model/01_gaussian-mixture-model.jmd

devmotion · 2024-04-10T06:35:27Z

tutorials/01-gaussian-mixture-model/01_gaussian-mixture-model.jmd

+    # Return sample_class(yi) for fixed μ, w.
+    function sample_class(xi)
+        lvec = [(logpdf(d, xi) + log(w[i])) for (i, d) in enumerate(dists)]
+        rand(Categorical(exp.(lvec .- logsumexp(lvec))))
+    end


This should be defined outside of the model and probably use softmax or softmax! directly.

tutorials/01-gaussian-mixture-model/01_gaussian-mixture-model.jmd

fix sample call Co-authored-by: David Widmann <[email protected]>

remove use of MCMCThread() Co-authored-by: David Widmann <[email protected]>

Remove Bijectors import Co-authored-by: David Widmann <[email protected]>

Co-authored-by: David Widmann <[email protected]>

JasonPekos · 2024-05-16T19:00:47Z

I think maybe this should be closed and revisited when #441 is done?

fwiw the current thing that's keeping this frozen is the multithreading stuff. If we want to stay away from:

chains = sample(model, sampler, MCMCThreads(), nsamples, nchains; discard_initial = burn);

to avoid warnings in single threaded environments, we'll need to update a bunch of tutorials, because this is pretty common across all the tutorials.

yebai · 2024-05-23T17:19:30Z

Thanks, @JasonPekos, for the PR. Would you like to migrate your changes here to #441?

JasonPekos · 2024-05-23T17:25:44Z

Thanks, @JasonPekos, for the PR. Would you like to migrate your changes here to #441?

Yup, will do.

JasonPekos added 3 commits April 9, 2024 23:53

updates to 01_gmm tutorial

915f0f8

remove files that shouldn't have gotten picked up by git?

b1069a0

remove files that shouldn't be here

5f0914c

devmotion reviewed Apr 10, 2024

View reviewed changes

JasonPekos and others added 12 commits April 10, 2024 12:30

Update tutorials/01-gaussian-mixture-model/01_gaussian-mixture-model.jmd

87ffabd

fix sample call Co-authored-by: David Widmann <[email protected]>

Update tutorials/01-gaussian-mixture-model/01_gaussian-mixture-model.jmd

3952644

remove use of MCMCThread() Co-authored-by: David Widmann <[email protected]>

Update tutorials/01-gaussian-mixture-model/01_gaussian-mixture-model.jmd

835203f

Remove Bijectors import Co-authored-by: David Widmann <[email protected]>

Update tutorials/01-gaussian-mixture-model/01_gaussian-mixture-model.jmd

99e7313

Co-authored-by: David Widmann <[email protected]>

Update tutorials/01-gaussian-mixture-model/01_gaussian-mixture-model.jmd

17b3990

Co-authored-by: David Widmann <[email protected]>

Update tutorials/01-gaussian-mixture-model/01_gaussian-mixture-model.jmd

cf5469f

Co-authored-by: David Widmann <[email protected]>

various small fixes to 01_gmm

d56a1a1

pull class recovery function outside of model

baeb774

no longer manually import bijectors

d937095

reword clunky sentence

5517704

reword clunky sentence

1c031f5

grammar

5672a23

JasonPekos mentioned this pull request Apr 17, 2024

Switch to Quarto #435

Closed

Merge branch 'TuringLang:master' into master

80e8ed5

JasonPekos closed this May 23, 2024

JasonPekos mentioned this pull request May 24, 2024

Substantially Update GMM-01 #447

Merged

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Substantial updates to tutorial 01_gaussian-mixture-model #439

Substantial updates to tutorial 01_gaussian-mixture-model #439

JasonPekos commented Apr 10, 2024 •

edited

Loading

devmotion left a comment

devmotion Apr 10, 2024

JasonPekos Apr 10, 2024

devmotion Apr 10, 2024

JasonPekos Apr 10, 2024

devmotion Apr 10, 2024

JasonPekos commented May 16, 2024

yebai commented May 23, 2024

JasonPekos commented May 23, 2024

		Where we sum the components with `logsumexp` from the [`StatsFuns.jl` package](https://github.com/JuliaStats/StatsFuns.jl).


		The manually incremented likelihood can be added to the log-probability with `Turing.@addlogprob!`, giving us the following model:

	chains = sample(model, sampler, MCMCThreads(), nsamples, nchains; discard_initial = burn);
	chains = sample(model, sampler, nsamples, nchains; discard_initial = burn);

Substantial updates to tutorial 01_gaussian-mixture-model #439

Substantial updates to tutorial 01_gaussian-mixture-model #439

Conversation

JasonPekos commented Apr 10, 2024 • edited Loading

devmotion left a comment

Choose a reason for hiding this comment

devmotion Apr 10, 2024

Choose a reason for hiding this comment

JasonPekos Apr 10, 2024

Choose a reason for hiding this comment

devmotion Apr 10, 2024

Choose a reason for hiding this comment

JasonPekos Apr 10, 2024

Choose a reason for hiding this comment

devmotion Apr 10, 2024

Choose a reason for hiding this comment

JasonPekos commented May 16, 2024

yebai commented May 23, 2024

JasonPekos commented May 23, 2024

JasonPekos commented Apr 10, 2024 •

edited

Loading