Skip to content

Create 25-25-04-GenAIComps-001-SGLang-Support.md #369

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
89 changes: 89 additions & 0 deletions community/rfcs/25-25-04-GenAIComps-001-SGLang-Support.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,89 @@
# Support SGLang in OPEA



This RFC is for supporting SGLang as one of the inference engine in OPEA.



## Author(s)



[<gooxidalao>](<(https://github.com/gooxidalao?tab=repositories)>)



## Status



`Under Review`



## Objective



SGLang is a fast serving framework for large language models and vision language models. It makes your interaction with models faster and more controllable by co-designing the backend runtime and frontend language, while achieving higher throughput than competitive solution. Supporting SGLang in OPEA provides user with a faster and more efficient option to interact with the large language models.



This RFC discusses about why and how shall we add SGLang as one of the inference backends in OPEA.



## Motivation



SGLang's Python-based DSL frontend and highly optimized backend enable fast inference and structured output generation, making it a powerful tool for efficient execution of large language model programs. OPEA as a framework that harnesses the best innovations across the ecosystem shall adopt the engine to provide user with more options.



## Design Proposal



The proposed code structure for SGLang is:



```

GenAIComps/

├── third_parties/

└── sglang/ # the folder containing the deployment file and docker script

├── README.md

├── deployment/

└── src/

```



## Compatibility



n/a.



## Miscellaneous



List other information user and developer may care about, such as:



- Engineering Impact:

- provide one extra option for inference engine