-
Notifications
You must be signed in to change notification settings - Fork 362
Tune best performance for FSDP on GB300 for LLAMA31 405B #2118
Copy link
Copy link
Open
Labels
area:perfPerformance optimizations and benchmarkingPerformance optimizations and benchmarkingfeatureNew capabilities, enhancements, or enablement workNew capabilities, enhancements, or enablement workperformanceperformance/optimizePerformance optimization trackingPerformance optimization trackingtrackingTracking issue for an ongoing project with smaller stepsTracking issue for an ongoing project with smaller steps
Milestone
Metadata
Metadata
Assignees
Labels
area:perfPerformance optimizations and benchmarkingPerformance optimizations and benchmarkingfeatureNew capabilities, enhancements, or enablement workNew capabilities, enhancements, or enablement workperformanceperformance/optimizePerformance optimization trackingPerformance optimization trackingtrackingTracking issue for an ongoing project with smaller stepsTracking issue for an ongoing project with smaller steps
Type
Fields
Give feedbackNo fields configured for issues without a type.
We need to tune LLAMA3 405B FP8-CS performance on GB300, some of the optimization to enable:
cc: @erhoo82