[BUG] test-electra.py failing with improper matmul shapes when num gpus > 1 

**Describe the bug**
test-electra.py fails with following error 

  File "/home/deepspeed/data/DeepSpeed/deepspeed/module_inject/layers.py", line 42, in forward
    output = torch.matmul(input, self.weight.transpose(-1, -2))
RuntimeError: mat1 and mat2 shapes cannot be multiplied (18x128 and 256x128)

**To Reproduce**
Steps to reproduce the behavior:
1. Inference Script : https://github.com/microsoft/DeepSpeedExamples/blob/master/inference/huggingface/test-electra.py
2. Packages: Deepspeed from master , ff427438, torch 1.12, cuda 11.6, transformers 4.21.2
3. deepspeed --num_gpus 2 test-electra.py

**Expected behavior**
<img width="1888" alt="image" src="https://user-images.githubusercontent.com/112720551/193950479-9500cc06-467e-4a00-95b5-2e9a54b53be2.png">

**ds_report output**
<img width="1043" alt="image" src="https://user-images.githubusercontent.com/112720551/193950990-19860b22-80e0-4bdb-9a91-8d7c3a0faf8b.png">

**Screenshots**
If applicable, add screenshots to help explain your problem.

**System info (please complete the following information):**
 - OS: Ubuntu 20.04.5 LTS
 - GPU count and types: 2x RTX A6000
 - Python version : Python 3.8.10

**Additional context**
This test does not fail with deepspeed 0.7.3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG] test-electra.py failing with improper matmul shapes when num gpus > 1 #2386

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG] test-electra.py failing with improper matmul shapes when num gpus > 1 #2386

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions