cuda: remove linking to cublasLt#14790
Conversation
|
I am pretty sure that cublas depends on cublasLt, and building without it with the |
Let me revert the change to |
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
28d7b3c to
02373a3
Compare
JohannesGaessler
left a comment
There was a problem hiding this comment.
From the CUDA documentation:
The cuBLASLt library is a new lightweight library dedicated to GEneral Matrix-to-matrix Multiply (GEMM) operations with a new flexible API.
We don't use cuBLASLt directly, but I guess they may be mapping the old cuBLAS API to cuBLASLt?
In my understanding (though I’m not certain), cuBLAS might have a mechanism to automatically dispatch or fall back to cuBLASLt depending on the data types, memory layouts, or hardware. |
* origin/master: (49 commits) ci : correct label refactor->refactoring (ggml-org#14832) CUDA: fix quantized KV cache + multiple sequences (ggml-org#14822) tests : add non-cont K,V FA tests memory : handle saving/loading null layers in recurrent memory (ggml-org#14675) ggml: fix loongarch quantize_row_q8_1 error (ggml-org#14827) CANN: weight format to NZ for Ascend310P3 (ggml-org#14407) CUDA: add fused rms norm (ggml-org#14800) ggml : model card yaml tab->2xspace (ggml-org#14819) vulkan: fix rms_norm_mul to handle broadcasting dim0 (ggml-org#14817) llama : add model type detection for rwkv7 7B&14B (ggml-org#14816) imatrix: add option to display importance score statistics for a given imatrix file (ggml-org#12718) Mtmd: add a way to select device for vision encoder (ggml-org#14236) cuda : implement bf16 cpy ops and enable bf16 cont (ggml-org#14763) opencl: remove unreachable `return` (ggml-org#14806) server : allow setting `--reverse-prompt` arg (ggml-org#14799) cuda: remove linking to cublasLt (ggml-org#14790) opencl: fix `im2col` when `KW!=KH` (ggml-org#14803) opencl: add conv2d kernel (ggml-org#14403) sycl: Fix im2col (ggml-org#14797) kleidiai: add support for get_rows (ggml-org#14676) ...
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
Make sure to read the contributing guidelines before submitting a PR
The new MUSA SDK now includes
mublasLt(the equivalent ofcublasLt). However, I found thatllama.cppdoesn't use any of thecublasLt*APIs. Therefore, this PR removescublasLtfrom the build link dependencies.