fix: replace add_identity by add_cast for type cast by junstar92 · Pull Request #3563 · pytorch/TensorRT

junstar92 · 2025-06-09T04:06:12Z

Description

This PR updates the type_cast helper function to ensure compatibility with TensorRT's strongly typed network mode.

type_cast used add_identity() followed by set_output_type() to perform the data type cast. However, in strongly typed mode, calling set_output_type() on the identity layer causes an error below:

ILayer::setOutputType: Error Code 3: API Usage Error (Parameter check failed, condition: !mNetwork->usingStronglyTyped(). INetworkLayer::setOutputType cannot be called for a strongly typed network.)
[graphShapeAnalyzer.cpp::checkCalculationStatusSanity::1962] Error Code 2: Internal Error (Assertion !isInFlight(p.second.symbolicRep) failed. )

type_cast is called by expand function in torch_tensorrt/dynamo/conversion/impl/slice/ops.py with dynamic dimension index.

TensorRT/py/torch_tensorrt/dynamo/conversion/impl/slice/ops.py

Lines 232 to 237 in f09be72

    
           input_t = prepend_ones( 
        
               ctx.net, 
        
               input_t, 
        
               name + "_expand_broadcast", 
        
               shape_rank - initial_tensor_rank, 
        
           )

The following code snippet reproduces the error:

import torch
import torch_tensorrt
from torch.export._trace import _export
from torch_tensorrt.dynamo._compiler import CompilationSettings
from torch_tensorrt.dynamo.conversion import TRTInterpreter
from torch_tensorrt.dynamo.lowering import get_decompositions


class Model(torch.nn.Module):
    def __init__(self):
        super().__init__()
        self.visual = torch.nn.Linear(10, 10)

    def forward(self, input: torch.Tensor):
        return input.unsqueeze(1).repeat(1, 1, 2).unsqueeze(0)


model = Model().to("cuda")
x = torch.randn(1, 40).to("cuda")
ep = _export(model, (x,))
ep = ep.run_decompositions(get_decompositions(False))
gm = ep.module()


interpreter = TRTInterpreter(
    gm,
    [torch_tensorrt.Input(name="input", min_shape=(1, 40), opt_shape=(4, 40), max_shape=(8, 40), dtype=torch.float32)],
    compilation_settings=CompilationSettings(use_explicit_typing=True),
)
results = interpreter.run()

To address this, the function now uses add_cast() to explicitly insert a cast layer that converts the input tensor to the desired cast_type.

If there was a specific reason for using add_identity(), please let me know, as this change assumes that the identity layer was not essential beyond type casting.

Type of change

Bug fix (non-breaking change which fixes an issue)

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

peri044 · 2025-06-09T18:54:54Z

Thanks @junstar92 for the contribution. Instead of modifying the FX path, we should import these utilities from the dynamo path since it is actively being developed. So, instead can you modify this change so that the prepend_ones is imported from dynamo/conversion/converter_utils instead ?

from torch_tensorrt.dynamo.converters.converter_utils import (
    has_dynamic_shape,
    prepend_ones,
    set_layer_name,
)

zewenli98

@junstar92 Thanks for your contribution! As @peri044 mentioned, we have switched our attention to Dynamo path. In this PR, instead of importing from fx, can you change

TensorRT/py/torch_tensorrt/dynamo/conversion/impl/slice/ops.py

Lines 26 to 30 in f09be72

    
           from torch_tensorrt.fx.converters.converter_utils import ( 
        
               has_dynamic_shape, 
        
               prepend_ones, 
        
               set_layer_name, 
        
           )

to

from torch_tensorrt.dynamo.conversion.converter_utils import (
    has_dynamic_shape,
    prepend_ones,
    set_layer_name,
)

and change

TensorRT/py/torch_tensorrt/dynamo/conversion/impl/slice/ops.py

Line 233 in f09be72

ctx.net,

to ctx accordingly?

Besides, I noticed that you are using from torch.export._trace import _export instead of from torch.export import export in your repro. May I know the reason?

apbose · 2025-06-09T20:07:48Z

LGTM apart from the changes mentioned above

junstar92 · 2025-06-10T00:37:26Z

@peri044 @zewenli98 Thanks for the suggestion. As you mentioned, I changed fx's conversion utilities to dynamo's.

junstar92 · 2025-06-10T00:42:46Z

@zewenli98

Besides, I noticed that you are using from torch.export._trace import _export instead of from torch.export import export in your repro. May I know the reason?

There's no special reason, it's just how I've been doing it.

apbose · 2025-06-10T23:54:43Z

py/torch_tensorrt/fx/converters/converter_utils.py

-    layer_i = network.add_identity(input)
-    layer_i.set_output_type(0, cast_type)
+    layer_i = network.add_cast(input, cast_type)
    set_layer_name(layer_i, target, f"{name}_dtype_change")


Thanks for the quick change @junstar92. LGTM as such. Just a minor change, since now we use the cast_trt_tensor in py/torch_tensorrt/dynamo/conversion/converter_utils.py and the above change is related to that, you could change the comment there -

Adds an Identity layer to the network which performs the conversion if the input's dtype is different from the cast type. Otherwise returns input unchanged

to something like

Adds a Cast layer to the network to convert the input tensor to the specified dtype. If the input tensor already has the desired dtype, it is returned unchanged. Otherwise, a Cast layer is added to perform the conversion

Thanks for the feedback. I updated the comment for cast_trt_tensor as you mentioned.

zewenli98

LGTM

peri044

1 minor comment. mostly looks good

peri044 · 2025-06-13T17:35:29Z

py/torch_tensorrt/fx/converters/converter_utils.py

    """
-    layer_i = network.add_identity(input)
-    layer_i.set_output_type(0, cast_type)
+    layer_i = network.add_cast(input, cast_type)


Can you use the cast_trt_tensor function to this instead ?

This is a patch for FX, but looks like cast_trt_tensor is only in dynamo?

@peri044 As @zewenli98 mentioned, cast_trt_tensor is in Dynamo path. So it needs to import dynamo.conversion.converter_utils in FX path. It this what you intended? If not, would you prefer me to implement cast_trt_tensor just like in Dynamo path and use it instead of type_cast?

No, the subpackages should remain as distinct as possible. IMO this implementation is fine for FX as essentially all development is on the dynamo side now.

peri044 · 2025-06-13T17:36:23Z

Also, @junstar92 please rebase with main. Some of the CI failures should be resolved

facebook-github-bot added the cla signed label Jun 9, 2025

github-actions bot added component: api [Python] Issues re: Python API component: fx labels Jun 9, 2025

facebook-github-bot added the fx label Jun 9, 2025

narendasan requested review from apbose, peri044 and zewenli98 June 9, 2025 18:44

zewenli98 requested changes Jun 9, 2025

View reviewed changes

junstar92 force-pushed the fix-type-cast branch from aa60372 to 83f04ee Compare June 10, 2025 00:32

github-actions bot added component: conversion Issues re: Conversion stage component: converters Issues re: Specific op converters component: dynamo Issues relating to the `torch.compile` or `torch._dynamo.export` paths labels Jun 10, 2025

apbose reviewed Jun 10, 2025

View reviewed changes

junstar92 force-pushed the fix-type-cast branch from 83f04ee to ad9cf27 Compare June 11, 2025 00:57

apbose approved these changes Jun 11, 2025

View reviewed changes

zewenli98 approved these changes Jun 11, 2025

View reviewed changes

peri044 reviewed Jun 13, 2025

View reviewed changes

junstar92 added 2 commits June 14, 2025 15:21

fix: replace add_identity by add_cast for type cast

844b0ad

fix: use dynamo path for conversion utils instead of fx

cc98fec

junstar92 force-pushed the fix-type-cast branch from ad9cf27 to cc98fec Compare June 14, 2025 06:21

peri044 merged commit 328da32 into pytorch:main Jul 4, 2025
48 of 53 checks passed

	input_t = prepend_ones(
	ctx.net,
	input_t,
	name + "_expand_broadcast",
	shape_rank - initial_tensor_rank,
	)

	from torch_tensorrt.fx.converters.converter_utils import (
	has_dynamic_shape,
	prepend_ones,
	set_layer_name,
	)

Conversation

junstar92 commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Checklist:

Uh oh!

peri044 commented Jun 9, 2025

Uh oh!

zewenli98 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

apbose commented Jun 9, 2025

Uh oh!

junstar92 commented Jun 10, 2025

Uh oh!

junstar92 commented Jun 10, 2025

Uh oh!

apbose Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

junstar92 Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

zewenli98 left a comment

Choose a reason for hiding this comment

Uh oh!

peri044 left a comment

Choose a reason for hiding this comment

Uh oh!

peri044 Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

zewenli98 Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

junstar92 Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

narendasan Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

peri044 commented Jun 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

junstar92 commented Jun 9, 2025 •

edited

Loading

zewenli98 left a comment •

edited

Loading