Skip to content

[cutlass-4][take 2] upgrade to cutlass 4.2.1#164159

Closed
henrylhtsang wants to merge 1 commit intopytorch:mainfrom
henrylhtsang:export-D83492704
Closed

[cutlass-4][take 2] upgrade to cutlass 4.2.1#164159
henrylhtsang wants to merge 1 commit intopytorch:mainfrom
henrylhtsang:export-D83492704

Conversation

@henrylhtsang
Copy link
Contributor

@henrylhtsang henrylhtsang commented Sep 29, 2025

@pytorch-bot
Copy link

pytorch-bot bot commented Sep 29, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/164159

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit aa16e5e with merge base 566ea4e (image):

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot
Copy link
Contributor

@henrylhtsang has exported this pull request. If you are a Meta employee, you can view the originating diff in D83492704.

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 29, 2025
@@ -421,7 +421,7 @@ def get_max_alignment(inductor_layout: Layout) -> int:
offset = inductor_layout.offset

def is_static_int(number):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
def is_static_int(number):
def is_static_int(number: object) -> Type[int | sympy.Integer]:

Otherwise this will hide the type information of the isinstance call...

@@ -234,7 +234,6 @@ def _get_arg_from_node(
# node's memory, a stride tuple, the datatype
# Once again, need to check for local class type for stride tuple
if str(arg_ty) in {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be an equality check now

@@ -173,7 +174,6 @@ def _render_argument_type(
# Fragile, but this is the only way to guarantee t is expected type because t is a local class
def is_nested_visitor_type(t: type) -> bool:
return ".".join([t.__module__, t.__qualname__]) in {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same here...

Comment on lines +40 to 42
import cutlass_cppgen as python_cutlass # type: ignore[import-untyped, import-not-found] # noqa: F401
else:
import cutlass_cppgen as python_cutlass # type: ignore[import-untyped, import-not-found] # noqa: F401
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
import cutlass_cppgen as python_cutlass # type: ignore[import-untyped, import-not-found] # noqa: F401
else:
import cutlass_cppgen as python_cutlass # type: ignore[import-untyped, import-not-found] # noqa: F401
import cutlass_cppgen as python_cutlass # type: ignore[import-untyped, import-not-found] # noqa: F401

@henrylhtsang
Copy link
Contributor Author

@Skylion007 unfortunately vllm side is very tricky. vllm-project/vllm#24673

@Skylion007
Copy link
Collaborator

@Skylion007 unfortunately vllm side is very tricky. vllm-project/vllm#24673

Hmm, the other PR is currently merging though and missing some of these other import changes? So CUTLASS inductor will be broken after the other PR lands? Or are you saying VLLM will need to use an old version of CUTLASS temporarily.

@henrylhtsang
Copy link
Contributor Author

@Skylion007 unfortunately vllm side is very tricky. vllm-project/vllm#24673

Hmm, the other PR is currently merging though and missing some of these other import changes? So CUTLASS inductor will be broken after the other PR lands? Or are you saying VLLM will need to use an old version of CUTLASS temporarily.

I think there are two upgrade going on:

@Skylion007 Skylion007 added topic: not user facing topic category release notes: cuda release notes category and removed topic: not user facing topic category labels Sep 29, 2025
Summary: Pull Request resolved: pytorch#164159

Test Plan: Sandcastle

Differential Revision: D83492704
@facebook-github-bot
Copy link
Contributor

@henrylhtsang has exported this pull request. If you are a Meta employee, you can view the originating Diff in D83492704.

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@pytorchmergebot
Copy link
Collaborator

Merge failed

Reason: 2 jobs have failed, first few of them are: inductor / unit-test / inductor-halide-build / build, inductor / unit-test / inductor-cpu-build / build

Details for Dev Infra team Raised by workflow job

@henrylhtsang
Copy link
Contributor Author

@pytorchbot merge -i

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged while ignoring the following 2 checks: inductor / unit-test / inductor-halide-build / build, inductor / unit-test / inductor-cpu-build / build

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

pytorchmergebot pushed a commit that referenced this pull request Oct 6, 2025
…4589)

Differential Revision: D83809105

Handle reviews from #164159

Pull Request resolved: #164589
Approved by: https://github.com/Skylion007
Chao1Han pushed a commit to Chao1Han/pytorch that referenced this pull request Oct 21, 2025
Test Plan: Sandcastle

Differential Revision: D83492704

Pull Request resolved: pytorch#164159
Approved by: https://github.com/Skylion007, https://github.com/mlazos
Chao1Han pushed a commit to Chao1Han/pytorch that referenced this pull request Oct 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants