langbench
is an easy to use benchmarking library for langchain based LLM pipelines.
pip install langbench
langbench
provides the following metrics:
toxicity
: Toxicity of the generated text.bias
: Bias of the generated text, including:- political
- gender
- racial
- educational
- nationality
- religious
- others
latency
: Latency of the generated text.
Reports are generated in the form of an html file.
The official documentation is available at langbench.
This project is licensed under the MIT License - see the LICENSE file for details.