GitHub - sisodia-a/jmr24_generativevisualdesign: Code to replicate results in, "Generative Interpretable Visual Design: Using Disentanglement for Visual Conjoint Analysis."

Generative Interpretable Visual Design: Using Disentanglement for Visual Conjoint Analysis

Code to replicate results in, "Generative Interpretable Visual Design: Using Disentanglement for Visual Conjoint Analysis."

If you find this work useful, please use the following citation information in plaintext or BibTeX format:

Sisodia, A, Burnap, A, and Kumar, V (2024). Generative Interpretable Visual Design: Using Disentanglement for Visual Conjoint Analysis. Journal of Marketing Research (Accepted).

@article{sisodia2024generative,
  title= Generative Interpretable Visual Design: Using Disentanglement for Visual Conjoint Analysis},
  author={Sisodia, Ankit and Burnap, Alex and Kumar, Vineet},
  journal={Journal of Marketing Research},
  year={2024},
  publisher={SAGE Publications Sage CA: Los Angeles, CA},
  url={https://doi.org/10.1177/00222437241276736},
}

This code will reproduce Figures (5, 6, 10, 12, G.1, G.2) and Tables (2, 6, 7, D.1, E.1, F.1) of the paper.

Installation

Libraries

This codebase was built using the Pytorch neural net framework. However, it also depends on over 100 other Python packages, as well as GPU libraries including CUDA runtimes and cuDNN kernel support. We can not guarantee this code will run and replicate unless the same environment and library versions are used. To install these packages (including the same library versions), please use the following:

conda env create -f disentanglement_env.yml

Data

To download the dataset, please contact the authors. Copy the data files to ./hyperparameter_selection/data/watches/ and ./post_model_search/data/watches/ directories.

Replication Steps

Step 1: Grid Search for Hyperparamaters

Go to ./hyperparameter_selection and run disentanglement model with a unique $\beta$, $\delta$, and supervisory signal combination with 10 different seeds. Vary $\beta$, $\delta$, and supervisory signal combination.

For example, in the below command, the seed is set to 1, $\beta$=18, $\delta$=50, and the supervisory signal is brand. The model name is brand_s1.

python main.py --sup_signal brand -s 1 --name brand_s1 --btcvae-B 18 --btcvae-M 50

In the above command, seed, $\beta$, and $\delta$ is a scalar value. This codebase, specific to the watch dataset, supports the following set of discrete supporting signals. Using any other name will result in an error.

discreteprice
brand
circa
material
movement
discreteprice_brand
discreteprice_circa
discreteprice_material
discreteprice_movement
brand_circa
brand_material
brand_movement
circa_material
circa_movement
material_movement
discreteprice_brand_circa
discreteprice_brand_material
discreteprice_brand_movement
discreteprice_circa_material
discreteprice_circa_movement
discreteprice_material_movement
brand_circa_material
brand_circa_movement
brand_material_movement
circa_material_movement
discreteprice_brand_circa_material
discreteprice_brand_circa_movement
discreteprice_brand_material_movement
discreteprice_circa_material_movement
brand_circa_material_movement
discreteprice_brand_circa_material_movement

The above command will create a directory results/<model-name>/ which will contain:

model.pt: The model at the end of training.
specs.json: The parameters used to run the program (default and modified with CLI).
train_losses.csv: All (sub-)losses computed during training on the train and validation dataset.
test_losses.log: All (sub-)losses computed at the end of training on the test1 and test2 dataset.
filename_test1.csv: filenames of all watches in the test1 dataset.
filename_test2.csv: filenames of all watches in the test2 dataset.
filename_train.csv: filenames of all watches in the train dataset.
mean_params_test1.csv: mean visual characteristics of all watches in the test1 dataset.
mean_params_test2.csv: mean visual characteristics of all watches in the test2 dataset.
mean_params_train.csv: mean visual characteristics of all watches in the train dataset.

Select the value of $\beta$ and $\delta$ for each supervisory signal at which the average supervised loss across 10 seeds on the test1 dataset is lowest. The supervised loss on the test1 set is stored as sup_loss_test in the first json object in the filename ending in test_losses.log in the directory results/<model-name>/ for each combination of seed, $\beta$, $\delta$, and the supervisory signal.

Step 2: Comparison of Different Supervisory Signals

Go to ./post_model_search and run disentanglement model at the optimal $\beta$ and $\delta$ for each supervisory signal combination at 10 different seeds.

For the watch dataset, execute the commands listed in execute_step2.txt to use the values listed in the paper. For example, execute the following command:

python main.py --sup_signal brand_circa_movement -s 10 --name brand_circa_movement_s10 --btcvae-B 50 --btcvae-M 1

The above command will create a directory results/brand_circa_movement_s10/ which will contain:

model.pt: The model at the end of training.
specs.json: The parameters used to run the program (default and modified with CLI).
train_losses.csv: All (sub-)losses computed during training on the train and validation dataset.
test_losses.log: All (sub-)losses computed at the end of training on the test1 and test2 dataset.
brand_circa_movement_s10_filename_test1.csv: filenames of all watches in the test1 dataset.
brand_circa_movement_s10_filename_test2.csv: filenames of all watches in the test2 dataset.
brand_circa_movement_s10_filename_train.csv.csv: filenames of all watches in the train dataset.
brand_circa_movement_s10_mean_params_test1.csv: mean visual characteristics of all watches in the test1 dataset.
brand_circa_movement_s10_mean_params_test2.csv: mean visual characteristics of all watches in the test2 dataset.
brand_circa_movement_s10_mean_params_train.csv: mean visual characteristics of all watches in the train dataset.

python main_viz.py --name brand_circa_movement_s10

The above command will create brand_circa_movement_s10_posterior_traversals.png (Figure 5 and Figure G1a) in results/brand_circa_movement_s10/.

python main_viz.py --name circa_s10

The above command will create circa_s10_posterior_traversals.png (Figure G1b) in results/circa_s10/.

python main_viz.py --name unsupervised_s10

The above command will create unsupervised_s10_posterior_traversals.png (Figure G1c) in results/unsupervised_s10/.

python main_viz.py --name ae_s10

The above command will create ae_s10_posterior_traversals.png (Figure G2a) in results/ae_s10/.

python main_viz.py --name vae_s10

The above command will create vae_s10_posterior_traversals.png: (Figure G2b) in results/vae_s10/.

Step 3: UDR Calculation

Copy the files stored in results/<model_name>/ directory with the filename ending in mean_params_test2.csv to the calculate_udr folder.

Go to ./calculate_udr to compare the UDRs for different supervisory signals. Switch to an R environment and execute the Rscript udr_calculation.R with the supervisory signal as the argument. For example:

Rscript udr_calculation.R --sup_signal='brand'

The results will be appended to filenamed udr.log. It will replicate results in Table F.1 of the paper.

Step 4: Helper Script

Go to ./r_script. Switch to an R environment and execute the Rscript replication_script.Rmd to produce Table 2, D.1, E.1 and Figure 6 of the paper.

Step 5: Conjoint Analysis and "Ideal Point'' Generative Design

See the directory /conjoint_analysis_and_ideal_point_design for an example ipython notebook for running the hiearachical Bayeisan estimation and produce Table 6-7 and Figure 10-12 in the paper.

Note that generating the ideal point design requires inputting the "ideal point" embedding values into the generative model from the disentanglement portion of this codebase. This is applicable for Figure 11. Modify the torch tensor in the save_cbc_images function in the ./post_model_search/utils/visualize.py file to save the ideal point watch image.

List of Files

./README.md: README file
./disentanglement_env.yml: Environment File



./calculate_udr/udr.log: Sample Output of udr_calculation.R
./calculate_udr/udr_calculation.R: Script to Calculate UDR
./calculate_udr/*mean_params_test2.csv: mean visual characteristics of all watches in the test2 dataset. 



./conjoint_analysis_and_ideal_point_design/conjoint_analysis_benchmark_models/data_generated.py: Data Generation for Conjoint
./conjoint_analysis_and_ideal_point_design/conjoint_analysis_benchmark_models/models.py: Prediction Models
./conjoint_analysis_and_ideal_point_design/conjoint_analysis_benchmark_models/requirements_benchmark_models.yml: Environment File
./conjoint_analysis_and_ideal_point_design/conjoint_analysis_benchmark_models/train_generated_watch_prediction.py: Watch Prediction Training Functions
./conjoint_analysis_and_ideal_point_design/data_conjointanalysis/CBCwatchexercise_simple_responses_generated.csv: Conjoint Survey Response CSV File
./conjoint_analysis_and_ideal_point_design/data_conjointanalysis/conj_gen_file_mapping_AnkitThresholds.csv: CSV file to map to actual visual characteristic quantified levels
./conjoint_analysis_and_ideal_point_design/data_conjointanalysis/FullConjointData_generated_mapped_variables.csv: Conjoint Survey Raw File 
./conjoint_analysis_and_ideal_point_design/data_conjointanalysis/disentanglement_example.png: Disentanglement Example
./conjoint_analysis_and_ideal_point_design/Example_Python_Notebook_with_Results_and_Plots.ipynb: Conjoint Interactive Python Notebook
./conjoint_analysis_and_ideal_point_design/data.py: Data Generation for Conjoint 
./conjoint_analysis_and_ideal_point_design/HB_conjoint_requirements.yml: Environment File
./conjoint_analysis_and_ideal_point_design/run_HB_conjoint.py: Hierarchical Bayesian Conjoint Prediction File



./hyperparameter_selection/__pycache__/*: bytecode cache files automatically generated by python

./hyperparameter_selection/data/watches/christies.npz: Training Data
./hyperparameter_selection/data/watches/christies_test1.npz: Test1 Data
./hyperparameter_selection/data/watches/christies_test2.npz: Test2 Data

./hyperparameter_selection/dataset/__pycache__/*: bytecode cache files automatically generated by python
./hyperparameter_selection/dataset/datasets.py: for processing data

./hyperparameter_selection/models/__pycache__/*: bytecode cache files automatically generated by python
./hyperparameter_selection/models/initialization.py: initializing the neural network
./hyperparameter_selection/models/losses.py: computing the neural network losses
./hyperparameter_selection/models/math.py: helper file with useful math functions
./hyperparameter_selection/models/modelIO.py: helper file for reading/writing model
./hyperparameter_selection/models/regression.py: supervised layer
./hyperparameter_selection/models/vae.py: code for setting up the VAE

./hyperparameter_selection/training/__pycache__/*: bytecode cache files automatically generated by python
./hyperparameter_selection/training/evaluate.py: code to evaluate the trained model
./hyperparameter_selection/training/training.py: code to train the model

./hyperparameter_selection/utils/__pycache__/*: bytecode cache files automatically generated by python
./hyperparameter_selection/utils/__init__.py
./hyperparameter_selection/utils/helpers.py: helper functions
./hyperparameter_selection/utils/visualize.py: code to visualize the learned visual characteristics
./hyperparameter_selection/utils/viz_helpers.py: helper functions for visualization

./hyperparameter_selection/hyperparam.ini: configuration file for hyperparameters
./hyperparameter_selection/example_commands.txt: file with example commands

./hyperparameter_selection/main.py: main python execution file 



./post_model_search/__pycache__/*: bytecode cache files automatically generated by python

./post_model_search/data/watches/christies.npz: Training Data
./post_model_search/data/watches/christies_test1.npz: Test1 Data
./post_model_search/data/watches/christies_test2.npz: Test2 Data

./post_model_search/dataset/__pycache__/*: bytecode cache files automatically generated by python
./post_model_search/dataset/datasets.py: for processing data

./post_model_search/models/__pycache__/*: bytecode cache files automatically generated by python
./post_model_search/models/initialization.py: initializing the neural network
./post_model_search/models/losses.py: computing the neural network losses
./post_model_search/models/math.py: helper file with useful math functions
./post_model_search/models/modelIO.py: helper file for reading/writing model
./post_model_search/models/regression.py: supervised layer
./post_model_search/models/vae.py: code for setting up the VAE

./post_model_search/results/<model_name>/*: files generated on executing the model with name <model_name>

./post_model_search/training/__pycache__/*: bytecode cache files automatically generated by python
./post_model_search/training/evaluate.py: code to evaluate the trained model
./post_model_search/training/training.py: code to train the model

./post_model_search/utils/__pycache__/*: bytecode cache files automatically generated by python
./post_model_search/utils/__init__.py
./post_model_search/utils/helpers.py: helper functions
./post_model_search/utils/visualize.py: code to visualize the learned visual characteristics
./post_model_search/utils/viz_helpers.py: helper functions for visualization

./post_model_search/hyperparam.ini: configuration file for hyperparameters
./post_model_search/execute_step2.txt: file with example commands

./post_model_search/main.py: main python execution file
./post_model_search/main_viz.py: main python execution file to generate images for conjoint analysis

Computing Resources

We used 64G GPU to run each model instance of 100 epoch, 64 batch size, 5e-4 learning rate. Each model run takes 8 minutes. We train 10 random seeds * 32 supervisory signal combinations * 25*15 grid values = 120,000 models. This would mean 16000 hours of model training.

Citation

Feel free to use this code for your research. If you find this code useful, please use the following citation information in plaintext or BibTeX format:

Sisodia, A., Burnap, A., & Kumar, V. (2024). Generative Interpretable Visual Design: Using Disentanglement for Visual Conjoint Analysis. Journal of Marketing Research, 0(0). https://doi.org/10.1177/00222437241276736

@article{sisodia2024generative,
  title= Generative Interpretable Visual Design: Using Disentanglement for Visual Conjoint Analysis},
  author={Sisodia, Ankit and Burnap, Alex and Kumar, Vineet},
  journal={Journal of Marketing Research},
  year={2024},
  publisher={SAGE Publications Sage CA: Los Angeles, CA},
  url={https://doi.org/10.1177/00222437241276736},
}

Acknowledgments

Portions of this codebase were built on elements from the following open source projects, please see their respective licenses and cite accordingly:

disentangling-vae

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generative Interpretable Visual Design: Using Disentanglement for Visual Conjoint Analysis

Installation

Libraries

Data

Replication Steps

Step 1: Grid Search for Hyperparamaters

Step 2: Comparison of Different Supervisory Signals

Step 3: UDR Calculation

Step 4: Helper Script

Step 5: Conjoint Analysis and "Ideal Point'' Generative Design

List of Files

Computing Resources

Citation

Acknowledgments

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
.idea		.idea
calculate_udr		calculate_udr
conjoint_analysis_and_ideal_point_design		conjoint_analysis_and_ideal_point_design
hyperparameter_selection		hyperparameter_selection
post_model_search		post_model_search
r_script		r_script
LICENSE		LICENSE
README.md		README.md
disentanglement_env.yml		disentanglement_env.yml

License

sisodia-a/jmr24_generativevisualdesign

Folders and files

Latest commit

History

Repository files navigation

Generative Interpretable Visual Design: Using Disentanglement for Visual Conjoint Analysis

Installation

Libraries

Data

Replication Steps

Step 1: Grid Search for Hyperparamaters

Step 2: Comparison of Different Supervisory Signals

Step 3: UDR Calculation

Step 4: Helper Script

Step 5: Conjoint Analysis and "Ideal Point'' Generative Design

List of Files

Computing Resources

Citation

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages