CUDA: use registers instead of smem in topk-moe#16647
CUDA: use registers instead of smem in topk-moe#16647JohannesGaessler merged 1 commit intoggml-org:masterfrom
Conversation
Uses the technique used in the vulkan PR ggml-org#16641. Neat trick!
|
I am not able to restart tests for CI, The nvidia-cuda test is failing with I'd rather not push another commit to trigger the CI again. @ggerganov is this a permission issue and can it be changed? |
|
If one first clicks on a job and then one of the buttons it is possible to manually re-run it. Since permissions were recently tightened I don't know whether you could do that yourself though (I just did). |
Yes I was able to do that earlier as well |
|
@am17an It seems that approving workflows requires write access, so I just loosened up the approval requirement to "first-time contributors" from "all external contributors":
Your workflows should always run now without additional approval. If this becomes too heavy for the CI or we think it can become a security concern, we might switch back to manual workflow approval for all PRs. |
|
Thanks. Seems like "collaborator" is not a well thought design by Github, my intuition would be a collaborator would be able to run workflows on their own PRs (at the least) |
Uses the technique used in the vulkan PR ggml-org#16641. Neat trick!
Uses the technique used in the vulkan PR ggml-org#16641. Neat trick!
Uses the technique used in the vulkan PR #16641. Neat trick!


Uses the technique used in the vulkan PR #16641. Neat trick!