-
Notifications
You must be signed in to change notification settings - Fork 362
[determinism] [feature] DSV4 Determinism Kernel Level Optimization #3538
Copy link
Copy link
Open
Labels
DeterminismTo track the bugs/issues in deterministic training in Megatron-Bridge.To track the bugs/issues in deterministic training in Megatron-Bridge.area:perfPerformance optimizations and benchmarkingPerformance optimizations and benchmarkingfeatureNew capabilities, enhancements, or enablement workNew capabilities, enhancements, or enablement worktrackingTracking issue for an ongoing project with smaller stepsTracking issue for an ongoing project with smaller steps
Metadata
Metadata
Assignees
Labels
DeterminismTo track the bugs/issues in deterministic training in Megatron-Bridge.To track the bugs/issues in deterministic training in Megatron-Bridge.area:perfPerformance optimizations and benchmarkingPerformance optimizations and benchmarkingfeatureNew capabilities, enhancements, or enablement workNew capabilities, enhancements, or enablement worktrackingTracking issue for an ongoing project with smaller stepsTracking issue for an ongoing project with smaller steps
Type
Fields
Give feedbackNo fields configured for issues without a type.
User problem
As per deepseekv4 paper, track, implement and benchmark the feature for optimized determinism:
Desired outcome
Deepseek-v4 is deterministic in training with all the optimized deterministic kernels.
Alternatives considered
No response
Affected area
area:model
Urgency / use case
Blocking current work
Extra context
No response