Losses for conditional diffusion models #15

vinayak-sharan · 2024-05-20T11:57:09Z

Hey thanks for the videos and codes, I am experimenting with conditional ldms.

Do you happen to have loss plots or logs of the loss? I have a feeling that the loss is decreasing really slowly or not decreasing at all.
Could you let me know if you had similar loss decrease? Here is the screenshot for your reference.

explainingai-code · 2024-05-21T12:31:25Z

Hi @vinayak-sharan ,
Could you tell me which dataset are you training it on and what is the conditioning(text/mask/class) that you are trying ?

vinayak-sharan · 2024-05-22T04:41:51Z

I am training on CelebHQ with masks and texts both as conditions.

explainingai-code · 2024-05-22T09:00:07Z

I dont have any logs but I think for this case(CelebHQ conditioned on masks and texts), by 50 epochs you should get decent generation output.
By any chance have you generated samples from the currently trained model.
And also how was the autoencoder output, were you able to have the autoencoder part trained such that the reconstructions decent enough ?

vinayak-sharan · 2024-05-25T05:40:12Z

Hey Tushar, I trained the ldm for 200 epochs and plotted the loss. The VQ-VAE samples are quite good. But the ldm sample is not what I expected :D

Loss over the epochs, I noticed that it start increasing back after like 100 epochs. I am surprised, since it's a plot of train loss, it should be doing overfitting.

VQ-VAE samples:

LDM samples:

vinayak-sharan · 2024-05-25T05:50:14Z

Here are the checkpoints in case you are interested: https://drive.google.com/drive/folders/1N2lRCFKz-fshPs3hzIV7ym_gs9kkYmTT?usp=sharing

explainingai-code · 2024-05-27T16:59:56Z

I was never able to train for more than 100 epochs(cause of compute limitations), but the issue of increase in loss, I think should be reduced by adding a decay in learning rate, so maybe try with that.
But more importantly the overall loss decrease is very less and with mask I was able to get higher quality outputs with lesser epochs of training(atleast for the common poses like first and last images of your sample).
Could you train a model with just mask conditioning and during training double check if the mask used is indeed the correct one in the data loader by comparing the input image and mask.
Also if you made any modifications in the code/config could you share that as well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Losses for conditional diffusion models #15

Losses for conditional diffusion models #15

vinayak-sharan commented May 20, 2024

explainingai-code commented May 21, 2024 •

edited

Loading

vinayak-sharan commented May 22, 2024

explainingai-code commented May 22, 2024

vinayak-sharan commented May 25, 2024

vinayak-sharan commented May 25, 2024

explainingai-code commented May 27, 2024

Losses for conditional diffusion models #15

Losses for conditional diffusion models #15

Comments

vinayak-sharan commented May 20, 2024

explainingai-code commented May 21, 2024 • edited Loading

vinayak-sharan commented May 22, 2024

explainingai-code commented May 22, 2024

vinayak-sharan commented May 25, 2024

vinayak-sharan commented May 25, 2024

explainingai-code commented May 27, 2024

explainingai-code commented May 21, 2024 •

edited

Loading