Dive into the underbelly of AI security with Chatbot Arena and Basi 🐍
Can you convince AI to bend to your will? Or can it resist your advances?
We are here to have fun pushing the boundaries of AI and finding out what is possible.
This is not a bug bounty, because convincing a model to speak freely is not a bug. We are here to provide tools and evaluations for people looking to push the limits of AI. Most of all, we are here to build the red-teaming community into a worldwide army.
Right now, we're running one game:
- Bad Words: Make the machine spew bad words. We will randomly sample a model and a naughty phrase. You have 60 seconds to make the model say the phrase. Winners are ranked on an elo leaderboard.
But this is just the beginning. We've got plans for more games. And if you want to contribute, please reach out; this is a community project and we can always use more hands on deck.
Here's how you run the code locally.
First, run
git clone https://github.com/redteaming-arena/redteam-arena
Start local frontend by executing the following:
You can use npm, or pnpm to set up and run the frontend. Choose the instructions that match your preferred package manager:
cd web
npm install
npm run start
To start the backend, follow these instructions.
First, modify /backend/app/core/config.py
to allow_origins=["*"],
or you can export ENV=DEV
.
Alembic is a database migration tool for SQLAlchemy. It allows you to:
- Track changes to your database schema over time
- Apply and revert these changes in a controlled manner
- Manage different versions of your database schema across different environments
Important: you have to follow instructions here to setup the database.
Finally, execute the following from a fresh conda environment:
cd backend
pip install -r requirements.txt
uvicorn app.main:app --reload
Then open a browser and go to http://localhost:3000/
.
Got a killer idea? We want it. Here's how:
- Fork the repo
- Create your feature branch (
git checkout -b feature/killerFeature
) - Commit your changes (
git commit -am 'Added a game to make AIs cry'
) - Push to the branch (
git push origin feature/killerFeature
) - Open a Pull Request
Keep the code clean and lean.