[diffusion] fix LTX2 resident defaults and stage profiling by mickqian · Pull Request #25596 · sgl-project/sglang

mickqian · 2026-05-18T09:17:12Z

What changed

Keep unset auxiliary components resident for high-memory LTX-2.3 two-stage resident mode.
Preserve explicit --layerwise-offload-components and explicit component offload args.
Record pipeline stage metrics with the registered stage name when a stage has one, so duplicate stage classes no longer overwrite each other in perf logs.

Why

High-memory resident mode keeps both LTX2 DiTs on GPU. The previous auto defaults could still apply non-DiT layerwise offload to text/image encoders or VAE, so the mode was not fully resident for unset auxiliary placement.

The profiler also used only the Python class name, so repeated stage classes such as the two LTX2 LoRA switch stages collapsed into one LTX2LoRASwitchStage metric.

Validation

Added unit coverage for high-memory LTX-2.3 resident defaults, original-mode default layerwise behavior, explicit layerwise preservation, and registered stage-name profiling.
Not run locally per sglang-diffusion development policy; CI should validate.

CI States

Latest PR Test (Base): ⏳ Run #26044398041
Latest PR Test (Extra): ⚠️ Not enabled -- add run-ci-extra label to opt in.

gemini-code-assist

Code Review

This pull request introduces logic to automatically keep auxiliary components (text encoder, image encoder, and VAE) resident in GPU memory when running LTX-2.3 in two-stage 'resident' mode on high-memory CUDA devices. It also updates the pipeline stage profiling to use registered stage names and adds comprehensive unit tests for these changes. Feedback was provided to unify the high-memory detection logic by including a specific check for H200 devices, ensuring consistency with existing device-specific configurations.

gemini-code-assist · 2026-05-18T09:19:34Z

+    def _uses_ltx23_high_memory_resident_two_stage_mode(self) -> bool:
+        if (
+            self.ltx2_two_stage_device_mode != "resident"
+            or not self._is_ltx23_two_stage_pipeline()
+            or not current_platform.is_cuda()
+        ):
+            return False
+        return (
+            current_platform.get_device_total_memory() / BYTES_PER_GB
+            >= LTX2_RESIDENT_AUTO_ENABLE_MEM_GB
+        )


The high-memory check in _uses_ltx23_high_memory_resident_two_stage_mode is inconsistent with the logic used in _resolve_default_ltx2_two_stage_device_mode (lines 478-481). Specifically, it is missing the check for the H200 device name, which is also considered a high-memory platform regardless of the exact reported memory value. Unifying this logic ensures that auxiliary components are correctly kept resident on all high-memory platforms.

def _uses_ltx23_high_memory_resident_two_stage_mode(self) -> bool: if ( self.ltx2_two_stage_device_mode != "resident" or not self._is_ltx23_two_stage_pipeline() or not current_platform.is_cuda() ): return False device_name = str(current_platform.get_device_name(0)).upper() device_total_memory_gb = ( current_platform.get_device_total_memory() / BYTES_PER_GB ) return ( "H200" in device_name or device_total_memory_gb >= LTX2_RESIDENT_AUTO_ENABLE_MEM_GB )

gemini-code-assist · 2026-05-18T12:03:54Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

mickqian · 2026-05-18T12:05:05Z

/tag-and-rerun-ci

…project#25596)

Fix LTX2 resident defaults and stage profiling

ec5bc3b

github-actions Bot added the diffusion SGLang Diffusion label May 18, 2026

gemini-code-assist Bot reviewed May 18, 2026

View reviewed changes

Use registered stage names for profiling

05ce51c

mickqian marked this pull request as ready for review May 18, 2026 12:03

mickqian requested review from ping1jing2 and yhyang201 as code owners May 18, 2026 12:03

github-actions Bot added the run-ci label May 18, 2026

mickqian added 2 commits May 18, 2026 20:45

Fix lightweight stage profiling tests

851b56d

Preserve class stage names for perf metrics

be97510

amd-bot mentioned this pull request May 19, 2026

[CI Monitor] Daily Report - 2026-05-19 bingxche/sglang-ci-bot#77

Open

mickqian merged commit a7b3ced into sgl-project:main May 19, 2026
93 of 99 checks passed

fzyzcjy mentioned this pull request May 19, 2026

Sandbox: verify full main CI is green on latest main (do not merge) #25647

Closed

4 tasks

Shunkangz pushed a commit to Shunkangz/sglang that referenced this pull request May 27, 2026

[diffusion] fix: fix LTX2 resident defaults and stage profiling (sgl-…

f421d6e

…project#25596)

alphabetc1 pushed a commit to alphabetc1/sglang that referenced this pull request Jun 4, 2026

[diffusion] fix: fix LTX2 resident defaults and stage profiling (sgl-…

922514b

…project#25596)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[diffusion] fix LTX2 resident defaults and stage profiling#25596

[diffusion] fix LTX2 resident defaults and stage profiling#25596
mickqian merged 4 commits into
sgl-project:mainfrom
mickqian:codex/ltx2-resident-profiler-fix-20260518

mickqian commented May 18, 2026 •

edited by github-actions Bot

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 18, 2026

Uh oh!

gemini-code-assist Bot commented May 18, 2026

Uh oh!

mickqian commented May 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mickqian commented May 18, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed

Why

Validation

CI States

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot commented May 18, 2026

Uh oh!

mickqian commented May 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mickqian commented May 18, 2026 •

edited by github-actions Bot

Loading