make: add --device-debug to NVCC debug flags#7542
make: add --device-debug to NVCC debug flags#7542JohannesGaessler merged 1 commit intoggml-org:masterfrom
Conversation
|
This improves the compile time, but it generates the following warnings: Not sure what are the implications of overriding the |
f735014 to
88e405e
Compare
|
|
|
This needs to be optional. The only case this helps is when looking for bugs on the kernels that are slow to compile, otherwise these files do not need to be recompiled when making changes to other parts of the code. However, this makes the CUDA backend so slow that it will make debug builds unusable for quick iteration in other areas.
|
88e405e to
e0b2a40
Compare
|
I thought I had measured a much smaller performance regression when I tested it but I must have done something wrong because upon renewed testing I'm getting very similar results. I added a new flag |
This PR adds the
--device-debugflag to the NVCC compile flags ifLLAMA_DEBUGis set. This adds device debugging information but also turns off device code optimization so the compilation is faster.