better error message in load_state_dict when there are inconsistent tensor sizes by greaber · Pull Request #2151 · pytorch/pytorch

greaber · 2017-07-19T16:13:54Z

I have often got the message "inconsistent tensor sizes" from load_state_dict, usually just because I am trying to load a checkpoint with a different version of the code or a different configuration. This patch makes it easier to locate the problem.

…ensor sizes

soumith · 2017-07-19T19:50:39Z

thanks a lot @greaber . Better messages for all :)

http://rocm-ci.amd.com/blue/organizations/jenkins/rocm-pytorch-manylinux-wheel-builder/detail/rocm-pytorch-manylinux-wheel-builder/2009/pipeline/131/ `/pytorch/.github/scripts/amd/package_triton_wheel.sh: line 54: syntax error in conditional expression` Validation: http://rocm-ci.amd.com/job/mainline-pytorch2.6-manylinux-wheels/79/

…iton kernels User-defined Triton kernels (via @triton.jit or @triton_op) that take bool tensor arguments produce incorrect results when compiled through AOTI. The root cause is that Triton's mangle_type maps torch.bool tensors to *i1/*u1 (1-bit pointer), but PyTorch stores bool tensors as uint8 (1 byte per element). The compiled cubin kernel generates bit-packed loads for *i1/*u1 pointers, reading garbled data from the byte-addressed memory. Inductor-generated kernels already work around this (Triton issue #2151) by adding .to(tl.int1) after loads and converting to int8 for stores. But user-defined kernels don't get these workarounds since their code is user-written. Fix: override *i1/*u1 -> *u8 in the mangle_type signature for user-defined kernels. This makes the compiled kernel use byte-addressed loads matching PyTorch's bool memory layout.

better error message in load_state_dict when there are inconsistent t…

fda8e65

…ensor sizes

soumith approved these changes Jul 19, 2017

View reviewed changes

soumith merged commit 95ccbf8 into pytorch:master Jul 19, 2017

ezyang added the open source label Jun 24, 2019

xwang233 pushed a commit to xwang233/pytorch that referenced this pull request Nov 9, 2022

Fix build (pytorch#2151)

b70c978

mergennachin mentioned this pull request Mar 4, 2026

Fix AOTI incorrect loads from bool tensor pointers in user-defined Triton kernels #176353

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

better error message in load_state_dict when there are inconsistent tensor sizes#2151

better error message in load_state_dict when there are inconsistent tensor sizes#2151
soumith merged 1 commit intopytorch:masterfrom
greaber:master

greaber commented Jul 19, 2017

Uh oh!

soumith commented Jul 19, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

greaber commented Jul 19, 2017

Uh oh!

soumith commented Jul 19, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants