Skip to content

[Temp CI Patch]: torch version for UT#2856

Merged
sammshen merged 4 commits intoLMCache:devfrom
sammshen:torch-11-UT-patch
Mar 24, 2026
Merged

[Temp CI Patch]: torch version for UT#2856
sammshen merged 4 commits intoLMCache:devfrom
sammshen:torch-11-UT-patch

Conversation

@sammshen
Copy link
Copy Markdown
Contributor

@sammshen sammshen commented Mar 23, 2026

torch 11 was just released built on CUDA 13.0 but our Unit Test CI machine still has CUDA 12.1. temp unblock solution.


Note

Low Risk
Low risk: only adjusts CI dependency installation for NVIDIA runners; main risk is CI flakes if the pinned torch range is too narrow or conflicts with other requirements.

Overview
Updates the Buildkite unit-test pipeline to pin torch to >=2.6.0,<2.11.0 on NVIDIA (CUDA) runners, ensuring compatibility with the CI machines' CUDA 12.x drivers.

AMD/ROCm installation behavior is unchanged.

Written by Cursor Bugbot for commit bc60cfb. This will update automatically on new commits. Configure here.

Signed-off-by: Samuel Shen <slshen@uchciago.edu>
@gemini-code-assist
Copy link
Copy Markdown
Contributor

Summary of Changes

Hello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request implements a temporary fix for CI failures by restricting the torch version used in the Unit Test CI environment. The recent release of torch 2.11, which requires CUDA 13.0, was causing build issues because the CI machines are currently configured with CUDA 12.1. By pinning the torch version, the CI can now successfully run unit tests without encountering incompatible dependency errors.

Highlights

  • CI Compatibility: Pinned the torch library version in the CI pipeline to ensure compatibility with CUDA 12.1 drivers, preventing issues with the newly released torch 2.11 which requires CUDA 13.0.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a temporary patch to the Buildkite CI pipeline to pin the torch version, aiming to resolve a compatibility issue with the CI machine's CUDA version. While the intention is correct, the implementation has a critical flaw: the specified torch version range torch>=2.6.0,<2.11.0 is invalid and will break the build. My review includes a comment with a suggested fix that uses a valid version specifier and also improves maintainability by adding a TODO comment to track the removal of this temporary fix. This is important to prevent accumulating technical debt in the CI configuration.

Comment thread .buildkite/pipeline.yml
Comment on lines +18 to +19
# Pin torch to a version compatible with the CI machine's CUDA 12.x driver
uv pip install "torch>=2.6.0,<2.11.0"
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

The specified torch version range torch>=2.6.0,<2.11.0 is invalid as no public torch versions exist in this range, which will cause the CI step to fail.

Additionally, since this is a temporary patch, it's a best practice to add a TODO comment with a reference to a tracking issue. This makes the temporary nature of the fix explicit and helps ensure it's removed later to avoid technical debt. The suggested change below corrects the version pin and adds a TODO for tracking.

        # TODO(CI): Remove torch pin after upgrading runners beyond CUDA 12.1. See issue #<issue_number>.
        uv pip install "torch<2.4.0"

@deng451e deng451e self-requested a review March 23, 2026 21:37
@sammshen sammshen enabled auto-merge (squash) March 23, 2026 22:01
@ApostaC ApostaC added the integration_testing Run the functional test for container image label Mar 23, 2026
@ApostaC ApostaC added the full Run comprehensive tests on this PR label Mar 24, 2026
@sammshen sammshen merged commit dcff86a into LMCache:dev Mar 24, 2026
26 of 27 checks passed
realAaronWu pushed a commit to realAaronWu/LMCache that referenced this pull request Mar 26, 2026
UT patch

Signed-off-by: Samuel Shen <slshen@uchciago.edu>
Co-authored-by: Samuel Shen <slshen@uchciago.edu>
deng451e pushed a commit to deng451e/LMCache that referenced this pull request Mar 27, 2026
UT patch

Signed-off-by: Samuel Shen <slshen@uchciago.edu>
Co-authored-by: Samuel Shen <slshen@uchciago.edu>
jooho-XCENA pushed a commit to xcena-dev/LMCache that referenced this pull request Apr 2, 2026
UT patch

Signed-off-by: Samuel Shen <slshen@uchciago.edu>
Co-authored-by: Samuel Shen <slshen@uchciago.edu>
jooho-XCENA pushed a commit to xcena-dev/LMCache that referenced this pull request Apr 2, 2026
UT patch

Signed-off-by: Samuel Shen <slshen@uchciago.edu>
Co-authored-by: Samuel Shen <slshen@uchciago.edu>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

full Run comprehensive tests on this PR integration_testing Run the functional test for container image

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants