Skip to content

Support FP8 grouped GEMM with rowwise scailing#3560

Closed
jiawenliu64 wants to merge 1 commit intopytorch:mainfrom
jiawenliu64:export-D67806685
Closed

Support FP8 grouped GEMM with rowwise scailing#3560
jiawenliu64 wants to merge 1 commit intopytorch:mainfrom
jiawenliu64:export-D67806685

Conversation

@jiawenliu64
Copy link
Member

Summary: This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance

Differential Revision: D67806685

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D67806685

@netlify
Copy link

netlify bot commented Jan 10, 2025

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit 0392abe
🔍 Latest deploy log https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/679fb330704fc60008cb5dda
😎 Deploy Preview https://deploy-preview-3560--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

jiawenliu64 added a commit to jiawenliu64/FBGEMM that referenced this pull request Jan 10, 2025
Summary:

X-link: facebookresearch/FBGEMM#646

This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance

Reviewed By: jwfromm

Differential Revision: D67806685
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D67806685

jiawenliu64 added a commit to jiawenliu64/FBGEMM that referenced this pull request Jan 23, 2025
Summary:
Pull Request resolved: pytorch#3560

X-link: facebookresearch/FBGEMM#646

This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance

Differential Revision: D67806685

Reviewed By: jwfromm
jwfromm pushed a commit to jwfromm/FBGEMM that referenced this pull request Jan 26, 2025
Summary:
Pull Request resolved: pytorch#3560

X-link: facebookresearch/FBGEMM#646

This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance

Differential Revision: D67806685

Reviewed By: jwfromm
Summary:

X-link: facebookresearch/FBGEMM#646

This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance

Reviewed By: jwfromm

Differential Revision: D67806685
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D67806685

jiawenliu64 added a commit to jiawenliu64/FBGEMM that referenced this pull request Feb 2, 2025
Summary:
Pull Request resolved: pytorch#3560

X-link: facebookresearch/FBGEMM#646

This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance

Differential Revision: D67806685

Reviewed By: jwfromm
@facebook-github-bot
Copy link
Contributor

This pull request has been merged in a5a72c3.

avbokovoy pushed a commit to ROCm/FBGEMM that referenced this pull request Feb 14, 2025
Summary:
Pull Request resolved: pytorch#3560

X-link: https://github.com/facebookresearch/FBGEMM/pull/646

This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance

Reviewed By: jwfromm

Differential Revision: D67806685

fbshipit-source-id: 631136c1d119f0869ab3a6e3c0c5299f83039ffc
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants