Use versioned flavor of get driver entrypoint function#1835
Use versioned flavor of get driver entrypoint function#1835ptrendx merged 6 commits intoNVIDIA:mainfrom
Conversation
Signed-off-by: Przemek Tredak <ptredak@nvidia.com>
|
/te-ci |
it was added Signed-off-by: Przemek Tredak <ptredak@nvidia.com>
|
I verified that it works! |
Signed-off-by: Przemek Tredak <ptredak@nvidia.com>
for more information, see https://pre-commit.ci
|
@timmoon10 sorry for the churn, I was looking at how others dealt with this issue and found this issue from cutlass: NVIDIA/cutlass#2079 - since we, just like them, link against libcudart.so.12, the check for CUDA 12.5 during the compilation is not enough and we actually need to dynamically load the symbols. Fortunately, since we already link against libcudart, we don't need to try to find the lib by name (so at least there is that). @flx42 Could you verify this new version? |
|
/te-ci |
Signed-off-by: Przemek Tredak <ptredak@nvidia.com>
Still works fine! |
|
/te-ci |
Description
Fixes the issue with cuStreamGetCtx pointing to cuStreamCtx_v2 in the CUDA 13 drivers.
Summary (mostly) by copilot:
This pull request updates the
transformer_engine/common/util/cuda_driver.cppandtransformer_engine/common/util/cuda_driver.hfiles to enhance compatibility with different CUDA versions. The changes introduce a mechanism to query driver entry points based on the CUDA version, improving flexibility in handling CUDA driver symbols.Enhancements for CUDA version compatibility:
get_symbolfunction intransformer_engine/common/util/cuda_driver.cpp: Refactored the function to support querying driver entry points using either a versioned or non-versioned mechanism. The function now accepts acuda_versionparameter and dynamically resolves the appropriate entry point function (cudaGetDriverEntryPointorcudaGetDriverEntryPointByVersion).get_symbolfunction declaration intransformer_engine/common/util/cuda_driver.h: Added an optionalcuda_versionparameter with a default value of12010(our oldest supported version) to allow backward compatibility while enabling version-specific queries.Type of change
Changes
Please list the changes introduced in this PR:
Checklist: