Skip to content

👋 Welcome to Athina AI

Athina is building monitoring and evaluation tools for LLM developers.

Sign Up | Website | Contact

  • Evals SDK: Open-source framework for evaluating LLMs (Python + CLI)
  • Platform: Monitor your production inferences, and automatically run evals

hero

Open-Source SDK for Evals

athina-ai/athina-evals

Documentation | Quick Start | Running Evals

We have a library of preset evaluators, but you can also write custom evaluators within the Athina framework.

Example Preset Evals:

  • Context Contains Enough Information: Detect bad or insufficient retrievals.
  • Does Response Answer Query: Detect incomplete or irrelevant responses.
  • Response Faithfulness: Detect when responses are deviating from the provided context.
  • Summarization Accuracy: Detect hallucinations and mistakes in summaries
  • Grading Criteria: If X, then fail. Otherwise pass.
  • Custom Evals: Custom prompt for LLM-powered evaluation.
  • RAGAS: A set of evaluators that return RAGAS metrics.

Results can also be viewed and tracked on our platform. develop-view

Monitoring & Evaluations Platform for LLM Inferences

Documentation | Demo Video | Sign Up

  • UI for monitoring and visibility into your LLM inferences.
  • Run evals automatically against logged inferences in production.
  • Track cost, token usage, response times, feedback, pass rate and other eval metrics.
  • Analytics segmented by Customer ID, Model, Prompt, Environment, and More.
  • Topic Classification
  • Data Exports
  • ... and more

Contact [email protected] if you have any questions.

Pinned Loading

  1. athina-evals athina-evals Public

    Python SDK for running evaluations on LLM generated responses

    Python 223 13

Repositories

Showing 10 of 14 repositories
  • athina-evals Public

    Python SDK for running evaluations on LLM generated responses

    athina-ai/athina-evals’s past year of commit activity
    Python 223 13 1 5 Updated Nov 26, 2024
  • athina-ai/athina-deploy’s past year of commit activity
    Shell 3 0 0 0 Updated Nov 26, 2024
  • athina-client Public

    A light weight version of athina SDK

    athina-ai/athina-client’s past year of commit activity
    Python 0 0 0 0 Updated Nov 22, 2024
  • athina-logger Public

    SDK to log LLM inference calls to Athina

    athina-ai/athina-logger’s past year of commit activity
    Python 2 2 0 1 Updated Oct 30, 2024
  • athina-ai/athina-docs’s past year of commit activity
    MDX 1 MIT 0 0 2 Updated Oct 29, 2024
  • ai-research-papers Public

    Summaries of AI Research Papers

    athina-ai/ai-research-papers’s past year of commit activity
    10 2 0 0 Updated Jun 29, 2024
  • athina-ai/athina-evals-ci’s past year of commit activity
    Python 2 0 0 0 Updated Feb 23, 2024
  • ragas Public Forked from explodinggradients/ragas

    Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

    athina-ai/ragas’s past year of commit activity
    Python 0 Apache-2.0 749 0 0 Updated Feb 5, 2024
  • athina-sdk Public

    LLM Testing SDK that helps you write and run tests to monitor your LLM app in production

    athina-ai/athina-sdk’s past year of commit activity
    Python 132 1 1 1 Updated Jan 22, 2024
  • ariadne Public

    LLM Evals for Text Summarization and RAG use-cases.

    athina-ai/ariadne’s past year of commit activity
    Python 35 Apache-2.0 0 0 0 Updated Jan 22, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…