- When prompting a language model (LM), users frequently expect the model to adhere to a set of behavioral principles across diverse tasks, such as producing insightful content while avoiding harmful or biased language. Instilling such principles into …
+ As AI systems like language models are increasingly integrated into decision-making processes affecting people's lives, it's critical to ensure that these systems have sound moral reasoning. To test whether they do, we need to develop systematic …
- As AI systems like language models are increasingly integrated into decision-making processes affecting people's lives, it's critical to ensure that these systems have sound moral reasoning. To test whether they do, we need to develop systematic …
+ This paper examines the transformative potential of Counterfactual World Simulation Models (CWSMs). CWSMs use pieces of multi-modal evidence, such as the CCTV footage or sound recordings of a road accident, to build a high-fidelity 3D reconstruction …
diff --git a/docs/publication_types/3/index.xml b/docs/publication_types/3/index.xml
index 1c53827..14e9ad6 100644
--- a/docs/publication_types/3/index.xml
+++ b/docs/publication_types/3/index.xml
@@ -84,15 +84,6 @@
-
- Self-supervised alignment with mutual information: Learning to follow principles without preference labels
- https://cicl.stanford.edu/publication/franken2024sami/
- Mon, 22 Apr 2024 00:00:00 +0000
-
- https://cicl.stanford.edu/publication/franken2024sami/
-
-
-
Procedural dilemma generation for evaluating moral reasoning in humans and language models
https://cicl.stanford.edu/publication/franken2024rails/
diff --git a/docs/publication_types/3/page/2/index.html b/docs/publication_types/3/page/2/index.html
index 1465bb3..57ef31f 100644
--- a/docs/publication_types/3/page/2/index.html
+++ b/docs/publication_types/3/page/2/index.html
@@ -238,15 +238,6 @@
-
- This paper examines the transformative potential of Counterfactual World Simulation Models (CWSMs). CWSMs use pieces of multi-modal evidence, such as the CCTV footage or sound recordings of a road accident, to build a high-fidelity 3D reconstruction …
-
-
+
+ When choosing how to describe what happened, we have a number of causal verbs at our disposal. In this paper, we develop a model-theoretic formal semantics for nine causal verbs that span the categories of CAUSE, ENABLE, and PREVENT. We use …
+
+