remove outdated Float8Linear workarounds by vkuzo · Pull Request #2595 · pytorch/ao

vkuzo · 2025-07-24T13:51:32Z

Summary:

These workarounds are no longer needed after
#2356 and the corresponding
improvements in PyTorch core.

Test Plan:

torchtitan bench on llama 3 8b on 8 H100s:

before

rowwise
Median Tokens/Second (excluding step 1): 7013.0
Max Memory Usage: 37.19 GiB

gw_hp
Median Tokens/Second (excluding step 1): 7232.0
Max Memory Usage: 37.13 GiB

after

rowwise
Median Tokens/Second (excluding step 1): 6984.5
Max Memory Usage: 37.19 GiB

gw_hp
Median Tokens/Second (excluding step 1): 7319.5
Max Memory Usage: 37.13 GiB

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]

vkuzo · 2025-07-24T13:51:33Z

Stack from ghstack (oldest at bottom):

-> remove outdated Float8Linear workarounds #2595

pytorch-bot · 2025-07-24T13:51:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2595

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: These workarounds are no longer needed after #2356 and the corresponding improvements in PyTorch core. Test Plan: torchtitan bench on llama 3 8b on 8 H100s: before rowwise Median Tokens/Second (excluding step 1): 7013.0 Max Memory Usage: 37.19 GiB gw_hp Median Tokens/Second (excluding step 1): 7232.0 Max Memory Usage: 37.13 GiB after rowwise Median Tokens/Second (excluding step 1): 6984.5 Max Memory Usage: 37.19 GiB gw_hp Median Tokens/Second (excluding step 1): 7319.5 Max Memory Usage: 37.13 GiB Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: ae11ea7 ghstack-comment-id: 3113561383 Pull Request resolved: #2595

[ghstack-poisoned]

Summary: These workarounds are no longer needed after #2356 and the corresponding improvements in PyTorch core. Test Plan: torchtitan bench on llama 3 8b on 8 H100s: before rowwise Median Tokens/Second (excluding step 1): 7013.0 Max Memory Usage: 37.19 GiB gw_hp Median Tokens/Second (excluding step 1): 7232.0 Max Memory Usage: 37.13 GiB after rowwise Median Tokens/Second (excluding step 1): 6984.5 Max Memory Usage: 37.19 GiB gw_hp Median Tokens/Second (excluding step 1): 7319.5 Max Memory Usage: 37.13 GiB Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: ae11ea7 ghstack-comment-id: 3113561383 Pull Request resolved: #2595

* Update [ghstack-poisoned] * Update [ghstack-poisoned]

vkuzo added 2 commits July 24, 2025 06:14

Update

1c1ad9c

[ghstack-poisoned]

Update

3fd7a06

[ghstack-poisoned]

vkuzo mentioned this pull request Jul 24, 2025

simplify Float8Linear #2594

Merged

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 24, 2025

vkuzo requested review from danielvegamyhre and drisspg July 24, 2025 13:52

vkuzo added the module: not user facing Use this tag if you don't want this PR to show up in release notes label Jul 24, 2025

danielvegamyhre approved these changes Jul 24, 2025

View reviewed changes

Update

a9c2c14

[ghstack-poisoned]

vkuzo changed the base branch from gh/vkuzo/94/head to main July 24, 2025 16:27

vkuzo merged commit f5b5567 into main Jul 24, 2025
21 of 24 checks passed

liangel-02 pushed a commit that referenced this pull request Aug 25, 2025

remove outdated Float8Linear workarounds (#2595)

1bcb03b

* Update [ghstack-poisoned] * Update [ghstack-poisoned]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove outdated Float8Linear workarounds#2595

remove outdated Float8Linear workarounds#2595
vkuzo merged 3 commits into
mainfrom
gh/vkuzo/95/head

vkuzo commented Jul 24, 2025

Uh oh!

vkuzo commented Jul 24, 2025 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Jul 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

vkuzo commented Jul 24, 2025

Uh oh!

vkuzo commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2595

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vkuzo commented Jul 24, 2025 •

edited

Loading

pytorch-bot Bot commented Jul 24, 2025 •

edited

Loading