[diffusion][CI] Add CI for MOVA model inference#20430
[diffusion][CI] Add CI for MOVA model inference#20430sglang-npu-bot merged 10 commits intosgl-project:mainfrom
Conversation
Summary of ChangesHello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request integrates the MOVA-360p video generation model into the existing multimodal generation test suite. It establishes the necessary configurations and introduces a series of test cases to validate the model's functionality and performance across different GPU setups, ensuring robust CI coverage for this new model. Highlights
🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console. Changelog
Activity
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
There was a problem hiding this comment.
Code Review
This pull request correctly adds CI tests for the new MOVA-360p model, including configurations for both single and multi-GPU setups with various parallelism strategies. The changes are functional and align with the goal of expanding test coverage. I have a couple of suggestions to enhance code organization and maintainability, primarily by adding a grouping comment for the new model constant and refactoring duplicated test case configurations to improve clarity and reduce redundancy.
| DiffusionTestCase( | ||
| "mova_360p_tp2", | ||
| DiffusionServerArgs( | ||
| model_path=DEFAULT_MOVA_360P_MODEL_NAME_FOR_TEST, | ||
| modality="video", | ||
| num_gpus=2, | ||
| tp_size=2, | ||
| dit_layerwise_offload=True, | ||
| ), | ||
| MOVA_I2V_360P_sampling_params, | ||
| run_perf_check=False, | ||
| ), | ||
| DiffusionTestCase( | ||
| "mova_360p_ring1_uly2", | ||
| DiffusionServerArgs( | ||
| model_path=DEFAULT_MOVA_360P_MODEL_NAME_FOR_TEST, | ||
| modality="video", | ||
| num_gpus=2, | ||
| ring_degree=1, | ||
| ulysses_degree=2, | ||
| dit_layerwise_offload=True, | ||
| ), | ||
| MOVA_I2V_360P_sampling_params, | ||
| run_perf_check=False, | ||
| ), | ||
| DiffusionTestCase( | ||
| "mova_360p_ring2_uly1", | ||
| DiffusionServerArgs( | ||
| model_path=DEFAULT_MOVA_360P_MODEL_NAME_FOR_TEST, | ||
| modality="video", | ||
| num_gpus=2, | ||
| ring_degree=2, | ||
| ulysses_degree=1, | ||
| dit_layerwise_offload=True, | ||
| ), | ||
| MOVA_I2V_360P_sampling_params, | ||
| run_perf_check=False, | ||
| ), |
There was a problem hiding this comment.
There is significant duplication in the DiffusionServerArgs for the new MOVA 2-GPU test cases. To improve maintainability, consider defining a base configuration object for the common arguments and then use dataclasses.replace to create the specific variations for each test case. This would make it easier to update common parameters in the future.
For example:
# Define a base config before the list
base_mova_2gpu_args = DiffusionServerArgs(
model_path=DEFAULT_MOVA_360P_MODEL_NAME_FOR_TEST,
modality="video",
num_gpus=2,
dit_layerwise_offload=True,
)
# ...
TWO_GPU_CASES_A = [
# ... other cases
DiffusionTestCase(
"mova_360p_tp2",
dataclasses.replace(base_mova_2gpu_args, tp_size=2),
MOVA_I2V_360P_sampling_params,
run_perf_check=False,
),
# ... other mova cases using replace
]| num_gpus=1, | ||
| dit_layerwise_offload=True, | ||
| ), | ||
| MOVA_I2V_360P_sampling_params, |
There was a problem hiding this comment.
could we reuse the existing sampling_params?
…_360P_sampling_params with TI2V_sampling_params (#8)
|
/tag-and-rerun-ci |
|
/tag-and-rerun-ci |
|
/rerun-failed-ci |
4 similar comments
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/tag-and-rerun-ci |
|
/rerun-failed-ci |
3 similar comments
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/rerun-failed-ci |
6 similar comments
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/tag-and-rerun-ci |
|
/rerun-failed-ci |
4 similar comments
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/tag-and-rerun-ci |
|
/rerun-failed-ci |
3 similar comments
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
/rerun-failed-ci |
|
i merged it as Mick approved and all CIs passed |
Co-authored-by: Luo <139519292+0-693@users.noreply.github.com> Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: Luo <139519292+0-693@users.noreply.github.com> Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: Luo <139519292+0-693@users.noreply.github.com> Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: Luo <139519292+0-693@users.noreply.github.com> Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: Luo <139519292+0-693@users.noreply.github.com> Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: Luo <139519292+0-693@users.noreply.github.com> Co-authored-by: Mick <mickjagger19@icloud.com>
Motivation
This pull request adds support for testing the
MOVA-360pvideo generation model in the multimodal generation test suite. The changes include introducing the model constant, defining sampling parameters, and adding several test cases for different GPU configurations.Modifications
Model integration:
DEFAULT_MOVA_360P_MODEL_NAME_FOR_TESTfor the MOVA-360p model intest_utils.py.DEFAULT_MOVA_360P_MODEL_NAME_FOR_TESTin the list of available models intestcase_configs.py.Sampling and test case setup:
MOVA_I2V_360P_sampling_paramswith prompt, image, and video settings for MOVA-360p tests.ONE_GPU_CASES_A.TWO_GPU_CASES_A.Accuracy Tests
Benchmarking and Profiling
Checklist
Review Process
/tag-run-ci-label,/rerun-failed-ci,/tag-and-rerun-ci