Performance Improvements
- LTX-2 model series (one-stage and two-stage pipelines)
- low precision (nvfp4, mxfp4, fp8)
- graph-level optimization
- optimizations on consumer-level GPUs
- kernel fusion instead of compiler-driven fusion
Features
Disaggregation (already supported)
- Fully disaggregated pipeline, with mooncake as transfer engine and compatibility with all models
- Extendability: make it easier to adjust and define the boundaries for disaggregated pipeline stages
- Hybrid Parallelism: support complex and advanced parallelism within pipeline stages
Model support
Community Contributions
We welcome all forms of community contribution — from bug reports and documentation improvements
to kernel optimizations, model integration, and new feature ideas.
We will be actively posting tasks in Slack channel: #collab-diffusion, feel free to take them over
If you're interested in participating, please check:
Your improvements — no matter small or large — can help make SGLang-Diffusion serving faster, easier, and more versatile.
Performance Improvements
Features
Disaggregation (already supported)
Model support
Community Contributions
We welcome all forms of community contribution — from bug reports and documentation improvements
to kernel optimizations, model integration, and new feature ideas.
We will be actively posting tasks in Slack channel: #collab-diffusion, feel free to take them over
If you're interested in participating, please check:
Your improvements — no matter small or large — can help make SGLang-Diffusion serving faster, easier, and more versatile.