Procedural Dilemma Generation for Moral Reasoning in Humans and Language Models
This is a supporting repository for our paper "Procedural Dilemma Generation for Moral Reasoning in Humans and Language Models" (2024, CogSci).
Preregistrations for all experiments are available on the Open Science Framework (OSF):
- Experiment 1 - Good/Harm Judgments
- Experiment 2 - Permissibility and Intention Judgments
├── data
│ ├── conditions_mild_harm_mild_good (50 scenarios)
│ ├── conditions_severe_harm_severe_good (10 scenarios, only used in Experiment 1)
│ └── results
├── prolific-exp-1
├── prolific-exp-2
└── src
├── prompts
├── stage_1.py
└── stage_2.py
data
contains the conditionsconditions_mild_harm_mild_good
including matched mild harm and mild good outcomes. We used to first 10 scenarios from each condition for our comparison in the paper (80 items). We also includeconditions_severe_harm_severe_good
which includes conditions with matched severe harm and severe good outcomes.results
include model responses including chain-of-thought examples
prolific-exp-1
andprolific-exp-2
include the experimental stimuli and formatted participant data frames (_long_format.csv
)src
includes prompts and code (stage_1.py
andstage_2.py
) for generating items.docs
contains all the experiment code. You can preview the experiments below:- Experiment 1 - Good/Harm Judgments
- Experiment 2 - Permissibility and Intention Judgments