Skip to content

[Roadmap] SGLang-Diffusion (26 Q2) #23035

@mickqian

Description

@mickqian

Performance Improvements

  • LTX-2 model series (one-stage and two-stage pipelines)
  • low precision (nvfp4, mxfp4, fp8)
  • graph-level optimization
  • optimizations on consumer-level GPUs
  • kernel fusion instead of compiler-driven fusion

Features

Disaggregation (already supported)

  • Fully disaggregated pipeline, with mooncake as transfer engine and compatibility with all models
  • Extendability: make it easier to adjust and define the boundaries for disaggregated pipeline stages
  • Hybrid Parallelism: support complex and advanced parallelism within pipeline stages

Model support

Community Contributions

We welcome all forms of community contribution — from bug reports and documentation improvements
to kernel optimizations, model integration, and new feature ideas.

We will be actively posting tasks in Slack channel: #collab-diffusion, feel free to take them over

If you're interested in participating, please check:

Your improvements — no matter small or large — can help make SGLang-Diffusion serving faster, easier, and more versatile.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions