You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What performance optimizations were added in CUDA 12.9?
I run test_fp8.py in the H20-96G using CUDA 12.8 and CUDA 12.9, the performance was consistent.
DeepGemm commit f85ec64
cuda 12.8:
What performance optimizations were added in CUDA 12.9?
I run test_fp8.py in the H20-96G using CUDA 12.8 and CUDA 12.9, the performance was consistent.
DeepGemm commit f85ec64
cuda 12.8:
cuda12.9: