Support FP8 grouped GEMM with rowwise scailing by jiawenliu64 · Pull Request #3560 · pytorch/FBGEMM

jiawenliu64 · 2025-01-10T18:15:03Z

Summary: This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance

Differential Revision: D67806685

facebook-github-bot · 2025-01-10T18:15:21Z

This pull request was exported from Phabricator. Differential Revision: D67806685

netlify · 2025-01-10T18:15:21Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`0392abe`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/679fb330704fc60008cb5dda
😎 Deploy Preview	https://deploy-preview-3560--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Summary: X-link: facebookresearch/FBGEMM#646 This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance Reviewed By: jwfromm Differential Revision: D67806685

facebook-github-bot · 2025-01-10T18:25:24Z

This pull request was exported from Phabricator. Differential Revision: D67806685

Summary: Pull Request resolved: pytorch#3560 X-link: facebookresearch/FBGEMM#646 This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance Differential Revision: D67806685 Reviewed By: jwfromm

Summary: X-link: facebookresearch/FBGEMM#646 This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance Reviewed By: jwfromm Differential Revision: D67806685

facebook-github-bot · 2025-02-02T18:03:32Z

This pull request was exported from Phabricator. Differential Revision: D67806685

Summary: Pull Request resolved: pytorch#3560 X-link: facebookresearch/FBGEMM#646 This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance Differential Revision: D67806685 Reviewed By: jwfromm

facebook-github-bot · 2025-02-02T22:43:30Z

This pull request has been merged in a5a72c3.

Summary: Pull Request resolved: pytorch#3560 X-link: https://github.com/facebookresearch/FBGEMM/pull/646 This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance Reviewed By: jwfromm Differential Revision: D67806685 fbshipit-source-id: 631136c1d119f0869ab3a6e3c0c5299f83039ffc

facebook-github-bot added the cla signed label Jan 10, 2025

facebook-github-bot added the fb-exported label Jan 10, 2025

jiawenliu64 force-pushed the export-D67806685 branch from fc5fd15 to 17b656e Compare January 10, 2025 18:25

jiawenliu64 mentioned this pull request Jan 10, 2025

[EVT] Add support for Row/Col broadcast PtrArray NVIDIA/cutlass#2033

Merged

jiawenliu64 force-pushed the export-D67806685 branch from 17b656e to 0392abe Compare February 2, 2025 18:02

facebook-github-bot closed this in a5a72c3 Feb 2, 2025

facebook-github-bot added the Merged label Feb 2, 2025

q10 added feature:fp8 category:new labels Feb 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support FP8 grouped GEMM with rowwise scailing#3560

Support FP8 grouped GEMM with rowwise scailing#3560
jiawenliu64 wants to merge 1 commit intopytorch:mainfrom
jiawenliu64:export-D67806685

jiawenliu64 commented Jan 10, 2025

Uh oh!

facebook-github-bot commented Jan 10, 2025

Uh oh!

netlify bot commented Jan 10, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Jan 10, 2025

Uh oh!

facebook-github-bot commented Feb 2, 2025

Uh oh!

facebook-github-bot commented Feb 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jiawenliu64 commented Jan 10, 2025

Uh oh!

facebook-github-bot commented Jan 10, 2025

Uh oh!

netlify bot commented Jan 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Uh oh!

facebook-github-bot commented Jan 10, 2025

Uh oh!

facebook-github-bot commented Feb 2, 2025

Uh oh!

facebook-github-bot commented Feb 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

netlify bot commented Jan 10, 2025 •

edited

Loading