Skip to content
View paul-rottger's full-sized avatar

Highlights

  • Pro

Block or report paul-rottger

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. xstest xstest Public

    Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"

    Jupyter Notebook 85 9

  2. hatecheck-data hatecheck-data Public

    Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data

    57 11

  3. msts-multimodal-safety msts-multimodal-safety Public

    Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"

    Jupyter Notebook 12 2

  4. hatecheck-experiments hatecheck-experiments Public

    Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Experimental Code

    Jupyter Notebook 11 3

  5. efficient-low-resource-hate-detection efficient-low-resource-hate-detection Public

    Röttger et al. (EMNLP 2022): "Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages"

    Jupyter Notebook 7 1

  6. issuebench issuebench Public

    Röttger et al. (2024): "IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance"

    Jupyter Notebook 6