simplify Float8Linear by vkuzo · Pull Request #2594 · pytorch/ao

vkuzo · 2025-07-24T13:14:51Z

Summary:

Removing code which we no longer need after
#2356 . torch.compile now does the
right thing automatically, and the relevant config has been deprecated.

Also fix links in float8 training benchmark README.md to point to
updated locations.

Test Plan:

./test/float8/test_everything.sh

// before
with-proxy TORCHTITAN_ROOT=~/local/torchtitan/ FLOAT8_RECIPE_WITH_BEST_SETTINGS="tensorwise" ./torchtitan_benchmark.sh
...
Median Tokens/Second (excluding step 1): 7999.0
Max Memory Usage: 36.68 GiB

// after
with-proxy TORCHTITAN_ROOT=~/local/torchtitan/ FLOAT8_RECIPE_WITH_BEST_SETTINGS="tensorwise" ./torchtitan_benchmark.sh
...
Median Tokens/Second (excluding step 1): 8038.5
Max Memory Usage: 36.68 GiB

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]

vkuzo · 2025-07-24T13:14:52Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2025-07-24T13:14:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2594

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 1c1ad9c with merge base 12ff479 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: Removing code which we no longer need after #2356 . `torch.compile` now does the right thing automatically, and the relevant config has been deprecated. Also fix links in float8 training benchmark README.md to point to updated locations. Test Plan: ``` ./test/float8/test_everything.sh ``` ``` // before with-proxy TORCHTITAN_ROOT=~/local/torchtitan/ FLOAT8_RECIPE_WITH_BEST_SETTINGS="tensorwise" ./torchtitan_benchmark.sh ... Median Tokens/Second (excluding step 1): 7999.0 Max Memory Usage: 36.68 GiB // after with-proxy TORCHTITAN_ROOT=~/local/torchtitan/ FLOAT8_RECIPE_WITH_BEST_SETTINGS="tensorwise" ./torchtitan_benchmark.sh ... Median Tokens/Second (excluding step 1): 8038.5 Max Memory Usage: 36.68 GiB ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 1211464 ghstack-comment-id: 3113439246 Pull Request resolved: #2594

danielvegamyhre · 2025-07-24T15:24:58Z

-   - bf16 + compile: `TORCHTITAN_ROOT=<path> ./float8_training_benchmark.sh`
-   - float8 tensorwise with float8 all-gather + compile: `TORCHTITAN_ROOT=<path> FLOAT8_RECIPE_WITH_BEST_SETTINGS="tensorwise" ./float8_training_benchmark.sh`
-   - float8 rowwise with bf16 all-gather + compile: `TORCHTITAN_ROOT=<path> FLOAT8_RECIPE_WITH_BEST_SETTINGS="rowwise" ./float8_training_benchmark.sh`
+3. From the `torchao/benchmarks/float8/training/` directory, you can run the following commands to reproduce the benchmarks above:


I think this should be ao/benchmarks/float8/training/ (repo is named ao and benchmarks dir is in the repo root dir). Same for other places in this PR

Update [ghstack-poisoned]

Update

1c1ad9c

[ghstack-poisoned]

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 24, 2025

vkuzo requested review from danielvegamyhre and drisspg July 24, 2025 13:15

vkuzo added the module: not user facing Use this tag if you don't want this PR to show up in release notes label Jul 24, 2025

vkuzo mentioned this pull request Jul 24, 2025

remove outdated Float8Linear workarounds #2595

Merged

danielvegamyhre approved these changes Jul 24, 2025

View reviewed changes

vkuzo merged commit c6de9b4 into main Jul 24, 2025
53 of 55 checks passed

liangel-02 pushed a commit that referenced this pull request Aug 25, 2025

simplify Float8Linear (#2594)

a1b9032

Update [ghstack-poisoned]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

simplify Float8Linear#2594

simplify Float8Linear#2594
vkuzo merged 1 commit into
mainfrom
gh/vkuzo/94/head

vkuzo commented Jul 24, 2025

Uh oh!

vkuzo commented Jul 24, 2025 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Jul 24, 2025 •

edited

Loading

Uh oh!

danielvegamyhre Jul 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

vkuzo commented Jul 24, 2025

Uh oh!

vkuzo commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2594

✅ No Failures

Uh oh!

danielvegamyhre Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vkuzo commented Jul 24, 2025 •

edited

Loading

pytorch-bot Bot commented Jul 24, 2025 •

edited

Loading