From e7c392c5da1b74cc2d203d0519db7c803d7988ca Mon Sep 17 00:00:00 2001 From: EricLBuehler Date: Wed, 6 Mar 2024 14:52:07 -0500 Subject: [PATCH] Add mistralrs mention --- README.md | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/README.md b/README.md index 75eba95..88504f8 100644 --- a/README.md +++ b/README.md @@ -23,6 +23,13 @@ X-LoRA is easily applied to any HuggingFace Transformers model. Please see our w See the [examples](examples) folder for some examples of how to get started with X-LoRA. +## Efficent Inference Support +[Mistral.rs](https://github.com/EricLBuehler/mistral.rs) is an inference framework which supports X-LoRA! To use it, follow the installation instructions and run the following command to start up an X-LoRA inference platform! + +`./mistralrs-server --port 1234 x-lora-mistral -o ordering.json` + +Base and X-LoRA Huggingface model IDs may be specified through command line switches to use your own models. Please see the Github page for further details. + ## Installation Pending a pip release, run the following command to install X-LoRA.