Skip to content

[Chore][CI] Split k3 multiprocess tests into parallel pipeline steps#2914

Merged
sammshen merged 1 commit intoLMCache:devfrom
sammshen:split-k3-mp-tests
Mar 30, 2026
Merged

[Chore][CI] Split k3 multiprocess tests into parallel pipeline steps#2914
sammshen merged 1 commit intoLMCache:devfrom
sammshen:split-k3-mp-tests

Conversation

@sammshen
Copy link
Copy Markdown
Contributor

No description provided.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request refactors the multiprocess CI tests into parallel Buildkite steps and introduces run-single-test.sh for individual test orchestration. Feedback identifies a likely typo in the model name, the omission of step priority in the new pipeline configuration, and a recommendation to use stricter shell options for script robustness.

export VLLM_BASELINE_PORT="${VLLM_BASELINE_PORT:-9000}"
export MAX_WAIT_SECONDS="${MAX_WAIT_SECONDS:-300}"
export BUILD_ID="${BUILDKITE_BUILD_ID:-local_$$}"
export MODEL="${MODEL:-Qwen/Qwen3-14B}"
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

Qwen/Qwen3-14B appears to be a typo, as Qwen 2.5 is the current version and Qwen 3 has not been released. Please verify if Qwen/Qwen2.5-14B was intended to avoid CI failures during model download.

Suggested change
export MODEL="${MODEL:-Qwen/Qwen3-14B}"
export MODEL="${MODEL:-Qwen/Qwen2.5-14B}"

Comment on lines +10 to +11
timeout_in_minutes: 30
agents: { queue: "k8s" }
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The priority: 1 setting from the original step has been omitted in the new parallel steps. This may cause these tests to be scheduled with lower priority than before. Consider adding it back to each step in the group to maintain consistent CI performance.

        timeout_in_minutes: 30
        priority: 1
        agents: { queue: "k8s" }

#
# Each invocation is self-contained: launches servers, runs one test, cleans up.
# This mirrors the comprehensive tests' run-single-config.sh pattern.
set -o pipefail
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

For better robustness and consistency with run.sh, use set -euo pipefail. This ensures the script exits immediately if a command fails or an unset variable is used, which is particularly important for an orchestrator script.

Suggested change
set -o pipefail
set -euo pipefail

@sammshen sammshen added the full Run comprehensive tests on this PR label Mar 30, 2026
@sammshen sammshen requested review from Oasis-Git and deng451e March 30, 2026 22:57
@sammshen sammshen enabled auto-merge (squash) March 30, 2026 23:09
@sammshen sammshen merged commit 27a8c1e into LMCache:dev Mar 30, 2026
41 of 43 checks passed
jooho-XCENA pushed a commit to xcena-dev/LMCache that referenced this pull request Apr 2, 2026
…MCache#2914)

[CI] Split k3 multiprocess tests into parallel pipeline steps

Co-authored-by: Samuel Shen <slshen@uchciago.edu>
jooho-XCENA pushed a commit to xcena-dev/LMCache that referenced this pull request Apr 2, 2026
…MCache#2914)

[CI] Split k3 multiprocess tests into parallel pipeline steps

Co-authored-by: Samuel Shen <slshen@uchciago.edu>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

full Run comprehensive tests on this PR

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants