Skip to content
@sgl-project

sgl-project

Pinned Loading

  1. sglang sglang Public

    SGLang is a high-performance serving framework for large language models and multimodal models.

    Python 23.4k 4.4k

  2. sgl-learning-materials sgl-learning-materials Public

    Materials for learning SGLang

    740 57

  3. ome ome Public

    Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

    Go 368 61

  4. genai-bench genai-bench Public

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    Python 263 49

  5. SpecForge SpecForge Public

    Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

    Python 679 153

  6. sglang-jax sglang-jax Public

    JAX backend for SGL

    Python 235 68

Repositories

Showing 10 of 22 repositories
  • sglang-jax Public

    JAX backend for SGL

    sgl-project/sglang-jax’s past year of commit activity
    Python 235 Apache-2.0 68 94 (8 issues need help) 29 Updated Feb 9, 2026
  • sglang Public

    SGLang is a high-performance serving framework for large language models and multimodal models.

    sgl-project/sglang’s past year of commit activity
    Python 23,433 Apache-2.0 4,369 633 (30 issues need help) 1,535 Updated Feb 9, 2026
  • SpecForge Public

    Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

    sgl-project/SpecForge’s past year of commit activity
    Python 679 MIT 153 55 (1 issue needs help) 20 Updated Feb 9, 2026
  • ome Public

    Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

    sgl-project/ome’s past year of commit activity
    Go 368 Apache-2.0 61 33 (2 issues need help) 43 Updated Feb 9, 2026
  • sgl-kernel-xpu Public

    SGLang kernel library for Intel XPU

    sgl-project/sgl-kernel-xpu’s past year of commit activity
    Python 17 MIT 17 0 11 Updated Feb 8, 2026
  • sgl-project.github.io Public

    This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang

    sgl-project/sgl-project.github.io’s past year of commit activity
    HTML 101 25 10 1 Updated Feb 9, 2026
  • whl Public

    Kernel Library Wheel for SGLang

    sgl-project/whl’s past year of commit activity
    HTML 17 MIT 7 1 1 Updated Feb 9, 2026
  • DeepGEMM Public Forked from deepseek-ai/DeepGEMM

    DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

    sgl-project/DeepGEMM’s past year of commit activity
    Cuda 22 MIT 820 0 0 Updated Feb 9, 2026
  • mini-sglang Public

    A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

    sgl-project/mini-sglang’s past year of commit activity
    Python 3,388 418 10 18 Updated Feb 8, 2026
  • FlashMLA Public Forked from deepseek-ai/FlashMLA

    FlashMLA: Efficient Multi-head Latent Attention Kernels

    sgl-project/FlashMLA’s past year of commit activity
    C++ 0 MIT 987 0 0 Updated Feb 8, 2026