Home

Awesome

<div align="center" id="sglangtop"> <img src="https://raw.githubusercontent.com/sgl-project/sglang/main/assets/logo.png" alt="logo" width="400" margin="10px"></img>

PyPI PyPI - Downloads license issue resolution open issues

</div>

| Blog | Documentation | Join Slack | Join Bi-Weekly Development Meeting | Slides |

News

<details> <summary>More</summary> </details>

About

SGLang is a fast serving framework for large language models and vision language models. It makes your interaction with models faster and more controllable by co-designing the backend runtime and frontend language. The core features include:

Getting Started

Install SGLang: See https://sgl-project.github.io/start/install.html

Send requests: See https://sgl-project.github.io/start/send_request.html

Backend: SGLang Runtime (SRT)

See https://sgl-project.github.io/backend/backend.html

Frontend: Structured Generation Language (SGLang)

See https://sgl-project.github.io/frontend/frontend.html

Benchmark And Performance

Learn more in our release blogs: v0.2 blog, v0.3 blog

Roadmap

Development Roadmap (2024 Q4)

Citation And Acknowledgment

Please cite our paper, SGLang: Efficient Execution of Structured Language Model Programs, if you find the project useful. We also learned from the design and reused code from the following projects: Guidance, vLLM, LightLLM, FlashInfer, Outlines, and LMQL.