This is my (Daniel Young's) project for COMS E6998 002 Natural Language Generation and Summarization. It is exploring underlying biases in large language models via. Contrast Consistent Search.
- Hidden states are generated via. LLM in
hs.ipynb
- CCS model and logistic regression models are trained on hidden states in
train.ipynb
- Perplexities can be generated using
msp.ipynb
- Cross-result analysis can be done with
analysis.ipynb
- Results are saved in structure
<Bias Type>/<Model Name>
- Hidden states from
hs.ipynb
are stored insaved/
- Results for CCS and perplexity analysis from
train.ipynb
andmsp.ipynb
are stored inresults/crowspairs
andresults/msp
&results/perp
- Professions data is from Man is to Computer Programmer as Woman is to Homemaker saved in
professions.json
- CrowS-Pairs data is from CrowS-Pairs paper We filter out antistereos.
- The CrowS-Pairs dataset has a lot of errors that needs to be fixed.