Add potential safety precaution: curated failsafe #178

NatalieZelenka · 2023-07-05T11:13:40Z

This safety precaution may apply to multiple labels, for example "Danger of Misuse" or "Reinforces Existing Bias"

For example, if you're building a chatbot adding a process after the LLM to check if certain keywords are in the output of the LLM and ensuring such responses are not shared if they are, e.g. racist, telling people to commit suicide, or otherwise inappropriate.

Sometimes these kinds of fixes are a bit annoying as they paper over deeper issues in using data science/AI (and it's also possible in this example that they are weak to prompt injections), but I also think that it is one type of precaution to be aware of.

For example, in the phenotype predictor, perhaps it would have been useful to have a curated list of phenotypes that are sensible to predict on. This could really have ensured that the thing I was making was less open to misuse.

NatalieZelenka added the feedback Feedback on Data Hazards label Jul 5, 2023

ninadicara mentioned this issue Feb 16, 2024

Website Updates Mega List #191

Closed

24 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add potential safety precaution: curated failsafe #178

Add potential safety precaution: curated failsafe #178

NatalieZelenka commented Jul 5, 2023 •

edited

Loading

Add potential safety precaution: curated failsafe #178

Add potential safety precaution: curated failsafe #178

Comments

NatalieZelenka commented Jul 5, 2023 • edited Loading

NatalieZelenka commented Jul 5, 2023 •

edited

Loading