sglang
Here are 66 public repositories matching this topic...
Language: All
Sort: Most stars
Performance-optimized AI inference on your GPUs. Unlock superior throughput by selecting and tuning engines like vLLM or SGLang.
-
Updated
Feb 28, 2026 - Python
MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, and multilingual support, while enabling zero-shot voice cloning from short audio references.
-
Updated
Feb 27, 2026 - Python
LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
-
Updated
Feb 27, 2026 - Python
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
-
Updated
Feb 27, 2026 - Python
MOVA: Towards Scalable and Synchronized Video-Audio Generation
-
Updated
Feb 19, 2026 - Python
Ji Yu SparkTTS, OrpheusTTSDeng Mo Xing ,Ti Gong Gao Zhi Liang Zhong Wen Yu Yin He Cheng Yu Sheng Yin Ke Long Fu Wu .
-
Updated
May 18, 2025 - Python
Open Model Engine (OME) -- Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton
-
Updated
Feb 27, 2026 - Go
OpenClaw-RL: Personalize openclaw simply by talking to it
-
Updated
Feb 27, 2026 - TypeScript
Easy, advanced inference platform for large language models on Kubernetes. Star to support our work!
-
Updated
Jan 26, 2026 - Go
gpt_serverShi Yi Ge Yong Yu Sheng Chan Ji Bu Shu LLMs, Embedding, Reranker, ASR, TTS, Wen Sheng Tu , Tu Pian Bian Ji He Wen Sheng Shi Pin De Kai Yuan Kuang Jia .
-
Updated
Feb 27, 2026 - Python
High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.
-
Updated
Feb 27, 2026 - Go
Efficient LLM inference on Slurm clusters.
-
Updated
Feb 23, 2026 - Python
Shepherd Model Gateway
-
Updated
Feb 28, 2026 - Rust
An Open-source, self-hosted AI model hub with Hugging Face compatibility, accelerating vLLM/SGLang performance.
-
Updated
Feb 27, 2026 - Go
A tool for benchmarking LLMs on Modal
-
Updated
Aug 29, 2025 - Python
Improve this page
Add a description, image, and links to the sglang topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the sglang topic, visit your repo's landing page and select "manage topics."