-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding U-Net for diffusion model #33
Comments
My impression is that U-Net will only be useful when we deal with images. For vector data, I'm not sure how much it will provide inductive bias. But we should definitely add it for pixel-domain examples! |
Sounds good! I was checking Janner's code and saw that they use U-Net for their state/action pairs https://github.com/jannerm/diffuser/blob/main/diffuser/models/temporal.py. You mind if I add some preliminary implementation on U-Net as an exercise? I am having some problem to fit a good score function with my MLP on the cart-pole example, so I am trying to debug what is happening. One candidate is to switch to a different network structure. |
That sounds good to me! I think data stabilization is a good test to see if the score function was trained correctly. I have also noticed that the score function is a bit fickle to train compared to standard regression. |
Sorry what do you mean by data stabilization? Currently I test the learned score function by applying Lagenvin dynamics zₜ₊₁ = zₜ + ε/2*s_θ(zₜ)+√ε * noise, and see when I take many Langevin dynamics (like 1000 steps) does |
That's exactly right, although I've been simply using standard gradient descent! |
Got it, thanks! I will try the version zₜ₊₁ = zₜ + ε/2*s_θ(zₜ) without the noise term, I think that corresponds to the standard gradient descent? |
It seems that most diffusion papers use U-Net (or U-Net with FiLM structure for conditional input) instead of MLP for the diffusion model. We can consider adding our own U-Net.
The text was updated successfully, but these errors were encountered: