Skip to content

[diffusion][CI] Add CI for MOVA model inference#20430

Merged
sglang-npu-bot merged 10 commits intosgl-project:mainfrom
CloudRipple:main
Mar 24, 2026
Merged

[diffusion][CI] Add CI for MOVA model inference#20430
sglang-npu-bot merged 10 commits intosgl-project:mainfrom
CloudRipple:main

Conversation

@CloudRipple
Copy link
Copy Markdown
Contributor

Motivation

This pull request adds support for testing the MOVA-360p video generation model in the multimodal generation test suite. The changes include introducing the model constant, defining sampling parameters, and adding several test cases for different GPU configurations.

Modifications

Model integration:

  • Added the constant DEFAULT_MOVA_360P_MODEL_NAME_FOR_TEST for the MOVA-360p model in test_utils.py.
  • Included DEFAULT_MOVA_360P_MODEL_NAME_FOR_TEST in the list of available models in testcase_configs.py.

Sampling and test case setup:

  • Defined MOVA_I2V_360P_sampling_params with prompt, image, and video settings for MOVA-360p tests.
  • Added a single GPU test case for MOVA-360p to ONE_GPU_CASES_A.
  • Added multiple two-GPU test cases for MOVA-360p with different parallelism configurations to TWO_GPU_CASES_A.

Accuracy Tests

Benchmarking and Profiling

Checklist

Review Process

  1. Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
  2. Get approvals from CODEOWNERS and other reviewers.
  3. Trigger CI tests with comments or contact authorized users to do so.
    • /tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci
  4. After green CI and required approvals, ask Merge Oncalls to merge.

@github-actions github-actions Bot added the diffusion SGLang Diffusion label Mar 12, 2026
@gemini-code-assist
Copy link
Copy Markdown
Contributor

Summary of Changes

Hello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request integrates the MOVA-360p video generation model into the existing multimodal generation test suite. It establishes the necessary configurations and introduces a series of test cases to validate the model's functionality and performance across different GPU setups, ensuring robust CI coverage for this new model.

Highlights

  • MOVA-360p Model Integration: The DEFAULT_MOVA_360P_MODEL_NAME_FOR_TEST constant was introduced and registered within the test configuration.
  • Sampling Parameters Defined: MOVA_I2V_360P_sampling_params were added, specifying prompt, image, video settings, and frame rates for MOVA-360p tests.
  • Comprehensive Test Cases Added: New test cases were included for MOVA-360p, covering both single-GPU and multiple two-GPU configurations with varying parallelism settings (TP, ring, ulysses degrees).

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog
  • python/sglang/multimodal_gen/test/server/testcase_configs.py
    • Imported the DEFAULT_MOVA_360P_MODEL_NAME_FOR_TEST constant.
    • Defined MOVA_I2V_360P_sampling_params for video generation.
    • Added a DiffusionTestCase for MOVA-360p on a single GPU.
    • Added three DiffusionTestCase entries for MOVA-360p on two GPUs, exploring different tensor parallelism, ring, and ulysses degrees.
  • python/sglang/multimodal_gen/test/test_utils.py
    • Declared DEFAULT_MOVA_360P_MODEL_NAME_FOR_TEST with its model path.
Activity
  • The author completed several checklist items, including code formatting, adding unit tests, and adhering to SGLang code style guidance.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request correctly adds CI tests for the new MOVA-360p model, including configurations for both single and multi-GPU setups with various parallelism strategies. The changes are functional and align with the goal of expanding test coverage. I have a couple of suggestions to enhance code organization and maintainability, primarily by adding a grouping comment for the new model constant and refactoring duplicated test case configurations to improve clarity and reduce redundancy.

Comment on lines +804 to +841
DiffusionTestCase(
"mova_360p_tp2",
DiffusionServerArgs(
model_path=DEFAULT_MOVA_360P_MODEL_NAME_FOR_TEST,
modality="video",
num_gpus=2,
tp_size=2,
dit_layerwise_offload=True,
),
MOVA_I2V_360P_sampling_params,
run_perf_check=False,
),
DiffusionTestCase(
"mova_360p_ring1_uly2",
DiffusionServerArgs(
model_path=DEFAULT_MOVA_360P_MODEL_NAME_FOR_TEST,
modality="video",
num_gpus=2,
ring_degree=1,
ulysses_degree=2,
dit_layerwise_offload=True,
),
MOVA_I2V_360P_sampling_params,
run_perf_check=False,
),
DiffusionTestCase(
"mova_360p_ring2_uly1",
DiffusionServerArgs(
model_path=DEFAULT_MOVA_360P_MODEL_NAME_FOR_TEST,
modality="video",
num_gpus=2,
ring_degree=2,
ulysses_degree=1,
dit_layerwise_offload=True,
),
MOVA_I2V_360P_sampling_params,
run_perf_check=False,
),
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

There is significant duplication in the DiffusionServerArgs for the new MOVA 2-GPU test cases. To improve maintainability, consider defining a base configuration object for the common arguments and then use dataclasses.replace to create the specific variations for each test case. This would make it easier to update common parameters in the future.

For example:

# Define a base config before the list
base_mova_2gpu_args = DiffusionServerArgs(
    model_path=DEFAULT_MOVA_360P_MODEL_NAME_FOR_TEST,
    modality="video",
    num_gpus=2,
    dit_layerwise_offload=True,
)

# ...

TWO_GPU_CASES_A = [
    # ... other cases
    DiffusionTestCase(
        "mova_360p_tp2",
        dataclasses.replace(base_mova_2gpu_args, tp_size=2),
        MOVA_I2V_360P_sampling_params,
        run_perf_check=False,
    ),
    # ... other mova cases using replace
]

Comment thread python/sglang/multimodal_gen/test/test_utils.py
num_gpus=1,
dit_layerwise_offload=True,
),
MOVA_I2V_360P_sampling_params,
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

could we reuse the existing sampling_params?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ok, we fixed this @mickqian

@CloudRipple CloudRipple requested a review from mickqian March 17, 2026 06:56
@mickqian
Copy link
Copy Markdown
Collaborator

/tag-and-rerun-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/tag-and-rerun-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

4 similar comments
@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/tag-and-rerun-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

3 similar comments
@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

6 similar comments
@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/tag-and-rerun-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

4 similar comments
@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/tag-and-rerun-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

3 similar comments
@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@yhyang201
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@sglang-npu-bot sglang-npu-bot merged commit a3ed2e4 into sgl-project:main Mar 24, 2026
84 of 89 checks passed
@ping1jing2
Copy link
Copy Markdown
Collaborator

i merged it as Mick approved and all CIs passed

adityavaid pushed a commit to adityavaid/sglang that referenced this pull request Mar 24, 2026
Co-authored-by: Luo <139519292+0-693@users.noreply.github.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
adityavaid pushed a commit to adityavaid/sglang that referenced this pull request Mar 24, 2026
Co-authored-by: Luo <139519292+0-693@users.noreply.github.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
johnnycxm pushed a commit to johnnycxm/sglang that referenced this pull request Mar 25, 2026
Co-authored-by: Luo <139519292+0-693@users.noreply.github.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
johnnycxm pushed a commit to johnnycxm/sglang that referenced this pull request Mar 25, 2026
Co-authored-by: Luo <139519292+0-693@users.noreply.github.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
JustinTong0323 pushed a commit to JustinTong0323/sglang that referenced this pull request Apr 7, 2026
Co-authored-by: Luo <139519292+0-693@users.noreply.github.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
yhyang201 pushed a commit to yhyang201/sglang that referenced this pull request Apr 22, 2026
Co-authored-by: Luo <139519292+0-693@users.noreply.github.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

diffusion SGLang Diffusion run-ci

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants