Adding U-Net for diffusion model #33

hongkai-dai · 2023-04-17T21:46:46Z

It seems that most diffusion papers use U-Net (or U-Net with FiLM structure for conditional input) instead of MLP for the diffusion model. We can consider adding our own U-Net.

hjsuh94 · 2023-04-17T21:50:55Z

My impression is that U-Net will only be useful when we deal with images. For vector data, I'm not sure how much it will provide inductive bias.

But we should definitely add it for pixel-domain examples!

hongkai-dai · 2023-04-17T21:54:03Z

Sounds good! I was checking Janner's code and saw that they use U-Net for their state/action pairs https://github.com/jannerm/diffuser/blob/main/diffuser/models/temporal.py.

You mind if I add some preliminary implementation on U-Net as an exercise? I am having some problem to fit a good score function with my MLP on the cart-pole example, so I am trying to debug what is happening. One candidate is to switch to a different network structure.

hjsuh94 · 2023-04-17T21:55:01Z

That sounds good to me! I think data stabilization is a good test to see if the score function was trained correctly.

I have also noticed that the score function is a bit fickle to train compared to standard regression.

hongkai-dai · 2023-04-17T22:05:10Z

Sorry what do you mean by data stabilization? Currently I test the learned score function by applying Lagenvin dynamics zₜ₊₁ = zₜ + ε/2*s_θ(zₜ)+√ε * noise, and see when I take many Langevin dynamics (like 1000 steps) does z look like coming from the training data distribution. Is that what you mean?

hjsuh94 · 2023-04-17T22:11:21Z

That's exactly right, although I've been simply using standard gradient descent!

hongkai-dai · 2023-04-17T22:40:50Z

Got it, thanks! I will try the version zₜ₊₁ = zₜ + ε/2*s_θ(zₜ) without the noise term, I think that corresponds to the standard gradient descent?

hongkai-dai self-assigned this Apr 17, 2023

hjsuh94 mentioned this issue Apr 18, 2023

Add Noise-Conditioned Score Estimator #36

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding U-Net for diffusion model #33

Adding U-Net for diffusion model #33

hongkai-dai commented Apr 17, 2023

hjsuh94 commented Apr 17, 2023

hongkai-dai commented Apr 17, 2023

hjsuh94 commented Apr 17, 2023

hongkai-dai commented Apr 17, 2023

hjsuh94 commented Apr 17, 2023

hongkai-dai commented Apr 17, 2023

Adding U-Net for diffusion model #33

Adding U-Net for diffusion model #33

Comments

hongkai-dai commented Apr 17, 2023

hjsuh94 commented Apr 17, 2023

hongkai-dai commented Apr 17, 2023

hjsuh94 commented Apr 17, 2023

hongkai-dai commented Apr 17, 2023

hjsuh94 commented Apr 17, 2023

hongkai-dai commented Apr 17, 2023