Skip to content

Support compilation via nvcc in RawKernel #1928

@sjperkins

Description

@sjperkins

Currently, RawKernel utilises NVRTC to compile device code into cubins. This is great for code that only contains device code.

There are useful C++ CUDA libraries such as cub and trove that contain device, block and warp wide primitives (scan, transpose, reduce etc.) that are useful for CUDA programmers in general.

Unfortunately it is not currently possibly to use these libraries with NVRTC as it does not automatically include system headers. For example, see:

There's an issue requesting NVRTC compatibility from cub (https://github.com/NVlabs/cub/issues/131) due to these problems, but in the mean time it would be useful to work around this by allowing RawKernel to compile code via nvcc (perhaps using the functionality in install/*.py)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions