ONNX export constant folding messes up with shared weight deduplication

### 🐛 Describe the bug

Hi, it appears that using `do_constant_folding=True` in the ONNX export will undo some weight deduplication. For example, an `nn.Linear` weight will go from

![image](https://github.com/pytorch/pytorch/assets/9808326/5e0a9e87-8c2a-4f23-ae7c-4c4548e95641)

& 

![image](https://github.com/pytorch/pytorch/assets/9808326/752aabad-eeda-44eb-bb36-326c692ba242)

to


![image](https://github.com/pytorch/pytorch/assets/9808326/19c8e6c4-00f5-4973-bf0c-356670348c78)

effectively transposing the weight. Given that the `DeduplicateInitializersByDataPtr` relies on the tensor size, in case the shared weight has a different size (e.g. an embedding weight), the deduplication pass will fail.

It seems to me that the initializer deduplication should happen before the constant folding, and constant folding should be done only for non-shared weights.

WDTY @justinchuby @BowenBao ?

Thank you!


Repro:
```
pip install optimum
optimum-cli export onnx -m bigscience/bloom-560m bloom_onnx --no-post-process
```

and inspect the output with netron


### Versions

Both on 2.0.1 and nightly

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ONNX export constant folding messes up with shared weight deduplication #108342

🐛 Describe the bug

Versions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

ONNX export constant folding messes up with shared weight deduplication #108342

Description

🐛 Describe the bug

Versions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions