Skip to content

[Dependabot] Update(deps): Bump transformers from 4.54.0 to 4.55.2 in /.ci/docker/ci_commit_pins#160807

Closed
dependabot[bot] wants to merge 1 commit intomainfrom
dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.55.2
Closed

[Dependabot] Update(deps): Bump transformers from 4.54.0 to 4.55.2 in /.ci/docker/ci_commit_pins#160807
dependabot[bot] wants to merge 1 commit intomainfrom
dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.55.2

Conversation

@dependabot
Copy link
Contributor

@dependabot dependabot bot commented on behalf of github Aug 16, 2025

Bumps transformers from 4.54.0 to 4.55.2.

Release notes

Sourced from transformers's releases.

Patch release 4.55.2: for FA2 users!

Patch release 4.55.2!

only affects FA2 generations!

😢 Well sorry everyone, sometimes shit can happen... 4.55.1 was broken because of 🥁 git merge conflict. I cherry-picked huggingface/transformers#40002 without having huggingface/transformers#40029 , thus from ..modeling_flash_attention_utils import prepare_fa_kwargs_from_position_ids is missing, and since this is a slow test, nothing caught it.

Will work to remediate and write the post-mortem when yanking the release.

Patch release 4.55.1:

Mostly focused around stabalizing the Mxfp4 for GPTOSS model!

Bug Fixes & Improvements

CI & Build

v4.55.0: New openai GPT OSS model!

Welcome GPT OSS, the new open-source model family from OpenAI!

For more detailed information about this model, we recommend reading the following blogpost: https://huggingface.co/blog/welcome-openai-gpt-oss

GPT OSS is a hugely anticipated open-weights release by OpenAI, designed for powerful reasoning, agentic tasks, and versatile developer use cases. It comprises two models: a big one with 117B parameters (gpt-oss-120b), and a smaller one with 21B parameters (gpt-oss-20b). Both are mixture-of-experts (MoEs) and use a 4-bit quantization scheme (MXFP4), enabling fast inference (thanks to fewer active parameters, see details below) while keeping resource usage low. The large model fits on a single H100 GPU, while the small one runs within 16GB of memory and is perfect for consumer hardware and on-device applications.

Overview of Capabilities and Architecture

  • 21B and 117B total parameters, with 3.6B and 5.1B active parameters, respectively.
  • 4-bit quantization scheme using mxfp4 format. Only applied on the MoE weights. As stated, the 120B fits in a single 80 GB GPU and the 20B fits in a single 16GB GPU.
  • Reasoning, text-only models; with chain-of-thought and adjustable reasoning effort levels.
  • Instruction following and tool use support.
  • Inference implementations using transformers, vLLM, llama.cpp, and ollama.
  • Responses API is recommended for inference.
  • License: Apache 2.0, with a small complementary use policy.

... (truncated)

Commits

Dependabot compatibility score

You can trigger a rebase of this PR by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

cc @seemethere @malfet @pytorch/pytorch-dev-infra @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

Note
Automatic rebases have been disabled on this pull request as it has been open for over 30 days.

@dependabot dependabot bot added dependencies Pull requests that update a dependency file module: ci Related to continuous integration module: inductor open source python Pull requests that update Python code topic: not user facing topic category labels Aug 16, 2025
@dependabot dependabot bot requested a review from jeffdaily as a code owner August 16, 2025 05:57
@dependabot dependabot bot added module: ci Related to continuous integration open source topic: not user facing topic category python Pull requests that update Python code module: inductor dependencies Pull requests that update a dependency file labels Aug 16, 2025
@pytorch-bot
Copy link

pytorch-bot bot commented Aug 16, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/160807

Note: Links to docs will display an error until the docs builds have been completed.

❌ 9 New Failures, 9 Unrelated Failures

As of commit 7bdf8ce with merge base a9fabeb (image):

NEW FAILURES - The following jobs have failed:

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@zou3519 zou3519 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Aug 19, 2025
pytorchmergebot pushed a commit that referenced this pull request Aug 19, 2025
After #160635, I can see dependabot creating the PR to bump `transformers` version at #160807.  This a good start, but there are several tweaks we need:

1. Run inductor tests on the PR including one round of perf benchmark, which is always needed.  So, we need `ciflow/inductor` label and a `pull_request` trigger for the benchmark
2. Per @anijain2305 feedback, we don't need to update patch version.  So, I add a rule to ignore it.  Again, we would need to test this out after this lands.
Pull Request resolved: #160935
Approved by: https://github.com/anijain2305
@huydhn
Copy link
Contributor

huydhn commented Aug 20, 2025

@pytorchbot rebase

@huydhn huydhn requested review from anijain2305 and huydhn August 20, 2025 17:52
@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

Bumps [transformers](https://github.com/huggingface/transformers) from 4.54.0 to 4.55.2.
- [Release notes](https://github.com/huggingface/transformers/releases)
- [Commits](huggingface/transformers@v4.54.0...v4.55.2)

---
updated-dependencies:
- dependency-name: transformers
  dependency-version: 4.55.2
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
@pytorchmergebot
Copy link
Collaborator

Successfully rebased dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.55.2 onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.55.2 && git pull --rebase)

@pytorchmergebot pytorchmergebot force-pushed the dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.55.2 branch from 4896161 to 7bdf8ce Compare August 20, 2025 17:57
markc-614 pushed a commit to markc-614/pytorch that referenced this pull request Sep 17, 2025
After pytorch#160635, I can see dependabot creating the PR to bump `transformers` version at pytorch#160807.  This a good start, but there are several tweaks we need:

1. Run inductor tests on the PR including one round of perf benchmark, which is always needed.  So, we need `ciflow/inductor` label and a `pull_request` trigger for the benchmark
2. Per @anijain2305 feedback, we don't need to update patch version.  So, I add a rule to ignore it.  Again, we would need to test this out after this lands.
Pull Request resolved: pytorch#160935
Approved by: https://github.com/anijain2305
@huydhn
Copy link
Contributor

huydhn commented Sep 19, 2025

Closing in favor of #162063

@huydhn huydhn closed this Sep 19, 2025
@dependabot @github
Copy link
Contributor Author

dependabot bot commented on behalf of github Sep 19, 2025

OK, I won't notify you again about this release, but will get in touch when a new version is available. If you'd rather skip all updates until the next major or minor version, let me know by commenting @dependabot ignore this major version or @dependabot ignore this minor version. You can also ignore all major, minor, or patch releases for a dependency by adding an ignore condition with the desired update_types to your config file.

If you change your mind, just re-open this PR and I'll resolve any conflicts on it.

@dependabot dependabot bot deleted the dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.55.2 branch September 19, 2025 17:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/inductor ciflow/inductor-periodic dependencies Pull requests that update a dependency file module: ci Related to continuous integration module: inductor open source python Pull requests that update Python code topic: not user facing topic category triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants