Inference Perf

The Inference Perf project aims to provide GenAI inference performance benchmarking tool. It came out of wg-serving and is sponsored by SIG Scalability. See the proposal for more info.

Status

This project is currently in development.

Getting Started

PDM Python Package Manager is utilized in this repository for dependecy management.

Setup virtual environment with pdm and install dependencies
```
make all-deps
```
Run inference-perf CLI
```
pdm run inference-perf
```

Contributing

Our community meeting is weekly at Th 11:30 PDT (Zoom Link, Meeting Notes).

We currently utilize the #wg-serving Slack channel for communications.

Contributions are welcomed, thanks for joining us!

Code of conduct

Participation in the Kubernetes community is governed by the Kubernetes Code of Conduct.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.github/workflows		.github/workflows
docs		docs
inference_perf		inference_perf
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
OWNERS		OWNERS
README.md		README.md
RELEASE.md		RELEASE.md
SECURITY.md		SECURITY.md
SECURITY_CONTACTS		SECURITY_CONTACTS
code-of-conduct.md		code-of-conduct.md
pdm.lock		pdm.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Inference Perf

Status

Getting Started

Contributing

Code of conduct

About

Releases

Packages

Contributors 7

Languages

License

kubernetes-sigs/inference-perf

Folders and files

Latest commit

History

Repository files navigation

Inference Perf

Status

Getting Started

Contributing

Code of conduct

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 7

Languages

Packages