How to condition based on multiple features? #14

vinodrajendran001 · 2024-05-14T02:43:10Z

I would like to condition the model using multiple features. In my case, I have lot of columns say A, B, C and D and some of the columns are categorical and some are numerical. Now I wanted to implement stable diffusion by conditioning on all the columns together. Please advise what are the modifications I need to do.

Thanks.

The text was updated successfully, but these errors were encountered:

explainingai-code · 2024-05-14T07:20:32Z

Hello @vinodrajendran001,

I think based on your requirement you can use the class embeddings(for categorical) and timestep embeddings for numerical conditioning.

Lets go through the categorical conditions first. Assume you have two categorical variables A/B both having 3 classes each(A1/A2/A3 and B1/B2/B3).
You can either use two seperate class embeddings for each of the two here(https://github.com/explainingai-code/StableDiffusion-PyTorch/blob/main/models/unet_cond_base.py#L61). So class_a_emb and class_b_emb and then based on your input add the the conditioning embedding values to the timestep embedding here (https://github.com/explainingai-code/StableDiffusion-PyTorch/blob/main/models/unet_cond_base.py#L155). So for this case rather than just adding class_emb you would add a_emb and b_emb values both.

You could also combine the classes(assuming every data point has values for both classes). So you have just one class embedding but your embedding values are for (A1B1, A1B2 , A1B3, ..., A3B1, A3B2, A3B3). This will make the changes simpler(infact I think everything will work out of the box and no modification is needed) but it assumes you have values for both classes for all training data and also during generation you would not be able to generate say an image with only one condition A1.

For numerical cases, you can convert them to positional embeddings using(https://github.com/explainingai-code/StableDiffusion-PyTorch/blob/main/models/blocks.py#L5) and then use something like timestep embeddings(https://github.com/explainingai-code/StableDiffusion-PyTorch/blob/main/models/unet_cond_base.py#L148C9-L149C35). With different t_proj layer for timesteps and numerical conditioning field. And then just like class embedding add that to the timestep embedding. Though if your numerical field values do not cover entire range from min to max values then it might make sense to rather bin them and convert them also to classes only.

So assuming one additional numerical condition and the above two classes.
Ultimately you would do this.

time_step_emb = time_step_emb + class_emb + numerical_emb

And then this timestep_emb would be passed to downblocks and upblocks here (https://github.com/explainingai-code/StableDiffusion-PyTorch/blob/main/models/unet_cond_base.py#L167)

vinodrajendran001 · 2024-05-14T09:00:40Z

That's a good idea. Let me try to translate your inputs into code and experiment.

Thanks :-)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to condition based on multiple features? #14

How to condition based on multiple features? #14

vinodrajendran001 commented May 14, 2024

explainingai-code commented May 14, 2024

vinodrajendran001 commented May 14, 2024

How to condition based on multiple features? #14

How to condition based on multiple features? #14

Comments

vinodrajendran001 commented May 14, 2024

explainingai-code commented May 14, 2024

vinodrajendran001 commented May 14, 2024