Athina is an Observability and Experimentation platform for AI teams.
This SDK is an open-source repository of 50+ preset evals. You can also use custom evals.
This SDK also serves as a companion to Athina IDE where you can prototype pipelines, run experiments and evaluations, and compare datasets.
Follow this notebook for a quick start guide.
To get an Athina API key, sign up at https://app.athina.ai
These evals can be run programmatically, or via the UI on Athina IDE.
Compare datasets side-by-side (Docs)
Once a dataset is logged to Athina IDE, you can also compare it against another dataset.
Once you run evals using Athina, they will be visible in Athina IDE where you can run experiments, evals, and compare datasets side-by-side.