gradient_descent_learning

This is the code for analysing the data from a rerun of https://osf.io/preprints/psyarxiv/75e4t/ .

The code contains:

Initial data wrangling
A maximum likelihood model that outputs the strategy a participant was using for a trial
A Hidden Markov Model that infers hidden states based on the output of the maximum likelihood model

Data details `/data`

This directory contains the initial data and preprocessing pipeline.

datawrangle.py: Collects all participant responses into a single csv file

exclusion.py: Creates file of participant IDs to be excluded from experiment (see script comments)

fullcode.csv: Code file for decoding randomisation

cleaned_pilotdata.csv: output from datawrangle.py. Contains participant trials along with randomisation coding.

exclusions.csv: output from exclusion.py. Contains excluded participant IDs

/pilotdata: contains initial raw data from 10 participants.

Likelihood Model `/likelihood_model`

model_fitting.py

This model finds the optimal strategy for a given participant trial.

Each participant has four observations per trial (e.g., [21211, 21121, 21221, 11211]).

The script first decodes the strong and weak features and then based on the trial observations finds the strategy that is most likely to result in the given observations.

The model outputs a csv file /likel_results/model_fit_results.csv:

This contains the likelihoods and guessing parameter (alpha) values for each strategy.

It also contains three columns with alternative ways of deriving the best_strategy:

best_strategy_guess: If multiple strategies have the same likelihood, the label "guessing" is set as the value for that trial
best_strategy_rand: If multiple strategies have the same likelihood, a strategy is selected at random from the equally likely strategies
best_strategies: If multiple strategies have the same likelihood, a list containing all the equally likely strategies is set as the value for that trial

Hidden Markov Model `/hmm`

Data preprocessing

hmm_data.py: Reads in model_fit_results.csv and generates three .csv files with the following templates:

under /hmm_data:

FOR: /hmm_data_best_strategy_rand.csv & /hmm_data_best_strategies.csv

Strong	Weak1	Weak 2	Prototype
0/1	0/1	0/1	0/1

FOR: /hmm_data_best_strategy_guess.csv

Strong	Weak1	Weak 2	Prototype	Guessing
0/1	0/1	0/1	0/1	0/1

HMM search

hmm_search.py: Searches for the optimal number of hidden states based on AIC and BIC. Shows a plot with AIC and BIC values for each number of hidden states.

HMM fit

hmm_test.py: Fits 1000 randomly initialised HMMs with a set number of hidden states.

Outputs:

The best model (/hmm_results/best_model_states_X.pkl)
Inferred states for each trial (/hmm_results/hmm_results_states_X.csv).

Plots `/plots`

alpha_plot.py: Plots the average guessing parameter (alpha) across trials for all strategies

strategy_plot.py: Plots the average use of strategies (likelihood model) across trials

hmm_plot.py: Plots the HMM model parameters as a graph

prototype_plot.py: Plots the proportion of participants selecting the "prototype bug" across trials

Analysis Procedure

Initial Data Setup Run /data/datawrangle.py. This creates a single dataframe from all participant data with columns for ID, trial number, choices for given trial (1-4), randomisation spreadsheet, and the bug_id the test trial was asking to identify. Output: /data/cleaned_pilotdata.csv
Maximum Likelihood Model fit Run /likelihood_model/model_fitting.py. This runs the maximum likelihood model which calculates the most likely strategy a participant was using, based on their selections for a given trial. Output: /likelihood_model/likel_results/model_fit_results.csv
Participant Exclusion Run /data/exclusion.py to exclude participants. Participants are excluded based on two criteria:
1. Not selecting the prototype (most predictive) bug on the final test trial.
2. If the maximum likelihood model suggested the participant was guessing on the final test trial (alpha = 0).
  
  Output: /data/exclusions.csv (list of excluded participant IDs)
HMM Data Setup Run /hmm/hmm_data.py. This generates an input file for the HMM from the maximum likelihood model output. Output: /hmm/hmm_data/hmm_best_(...).csv (filename depends on the number of participants and method for dealing with equally likely strategies for a trial)
HMM Model Search Run /hmm/hmm_search.py. This plots the BIC and AIC values across different number of hidden states. Use the lowest BIC value to select the best number of states. Output: /hmm/hmm_results/hmm_search_(...).png (filename depends on number on the number of participants and method for dealing with equally likely strategies for a trial)
HMM Model Fit Run /hmm/hmm_test.py. This fits a HMM with a specified number of states to a /hmm/hmm_data/... file. The best model is then used to predict states for the same input data. Output: /hmm/hmm_results/best_model_(...).pkl and /hmm/hmm_results/hmm_result_(...).csv
Plot Results Run /plots/alpha_plot.py, /plots/strategy_plot.py, /plots/prototype_plot.py, and /plots/hmm_plot.py. Output: /plots/images/...

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
ANN		ANN
data		data
hmm		hmm
likelihood_model		likelihood_model
plots		plots
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gradient_descent_learning

Data details `/data`

Likelihood Model `/likelihood_model`

Hidden Markov Model `/hmm`

Data preprocessing

HMM search

HMM fit

Plots `/plots`

Analysis Procedure

About

Releases

Packages

Languages

AkillesU/gradient_descent_learning

Folders and files

Latest commit

History

Repository files navigation

gradient_descent_learning

Data details /data

Likelihood Model /likelihood_model

Hidden Markov Model /hmm

Data preprocessing

HMM search

HMM fit

Plots /plots

Analysis Procedure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Data details `/data`

Likelihood Model `/likelihood_model`

Hidden Markov Model `/hmm`

Plots `/plots`

Packages