[CI] Disable TRANSFORMERS_OFFLINE for nightly vLLM benchmark runs by huydhn · Pull Request #176553 · pytorch/pytorch

huydhn · 2026-03-05T02:02:44Z

Instead of always using offline mode, we need to have a way to periodically refresh the local cache. This job runs only once per day, so rate limit wouldn't be an issue

I observe a failure in trunk for pytorch/gemma-3-12b-it-int4 where the local cache doesn't seem to work anymore, not sure why, could be due to the recent transformers version update. The failure goes away when I turn off TRANSFORMERS_OFFLINE to refresh the local cache.

pytorch-bot · 2026-03-05T02:02:47Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/176553

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

⏳ 25 Pending, 1 Unrelated Failure

As of commit 067833d with merge base 9dfac50 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / dynamo-cpython-test / test (dynamo_cpython, 1, 1, lf.linux.c7i.2xlarge) (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2026-03-05T02:02:52Z

The committers listed above are authorized under a signed CLA.

✅ login: huydhn / name: Huy Do (067833d)

Nightly scheduled runs should be able to download from HuggingFace directly rather than being restricted to the local cache. This adds an `is_nightly` input to the reusable workflow, set to true when the caller detects `github.event_name == 'schedule'`. Manual dispatch runs continue to use the cache (TRANSFORMERS_OFFLINE=1). Includes a comment explaining the rationale for the nightly override. Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn · 2026-03-05T02:22:19Z

@pytorchbot merge -f 'CI only tweak, should be fine'

pytorchmergebot · 2026-03-05T02:24:12Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…torch#176553) Instead of always using offline mode, we need to have a way to periodically refresh the local cache. This job runs only once per day, so rate limit wouldn't be an issue I observe [a failure in trunk](https://github.com/pytorch/pytorch/actions/runs/22625578355/job/65575251092#step:15:13837) for `pytorch/gemma-3-12b-it-int4` where the local cache doesn't seem to work anymore, not sure why, could be due to the recent transformers version update. The failure goes away when I turn off TRANSFORMERS_OFFLINE to refresh the local cache. Pull Request resolved: pytorch#176553 Approved by: https://github.com/zou3519

huydhn added the test-config/default label Mar 5, 2026

huydhn requested a review from a team as a code owner March 5, 2026 02:02

pytorch-bot bot added the topic: not user facing topic category label Mar 5, 2026

huydhn requested review from desertfire and zou3519 and removed request for a team March 5, 2026 02:03

huydhn force-pushed the vllm-benchmark-disable-transformers-offline-nightly branch from 789b1dc to 067833d Compare March 5, 2026 02:07

zou3519 approved these changes Mar 5, 2026

View reviewed changes

pytorchmergebot added the merging label Mar 5, 2026

pytorchmergebot closed this in 2bdb71c Mar 5, 2026

pytorchmergebot added Merged and removed merging labels Mar 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI] Disable TRANSFORMERS_OFFLINE for nightly vLLM benchmark runs#176553

[CI] Disable TRANSFORMERS_OFFLINE for nightly vLLM benchmark runs#176553
huydhn wants to merge 1 commit intopytorch:mainfrom
huydhn:vllm-benchmark-disable-transformers-offline-nightly

huydhn commented Mar 5, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Mar 5, 2026 •

edited

Loading

Uh oh!

linux-foundation-easycla bot commented Mar 5, 2026 •

edited

Loading

Uh oh!

huydhn commented Mar 5, 2026

Uh oh!

pytorchmergebot commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

huydhn commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/176553

⏳ 25 Pending, 1 Unrelated Failure

Uh oh!

linux-foundation-easycla bot commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

huydhn commented Mar 5, 2026

Uh oh!

pytorchmergebot commented Mar 5, 2026

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

huydhn commented Mar 5, 2026 •

edited

Loading

pytorch-bot bot commented Mar 5, 2026 •

edited

Loading

linux-foundation-easycla bot commented Mar 5, 2026 •

edited

Loading