vulkan : support conv-2d with large output size by Acly · Pull Request #17685 · ggml-org/llama.cpp

Acly · 2025-12-02T10:01:48Z

The main goal is to support convolutions with large output spatial size. Currently number of NPQ=N*OH*OW blocks is limited by maxComputeWorkGroupCount[1] which is typically 2^16. Depending on block size this means ~2M / 8M / 16M elements. 2M elements is not a lot and exceeded by eg. a 1536x1536 image.

Selecting a larger NPQ block size pushes the limit a bit, but even 16M elements doesn't feel comfortable, so I split the workgroups between y & z.

I reorganized code a bit to make pipeline selection not depend on computing the workgroup count, that makes it easier to calculate the split without being affected by blocks shape.

Also cleaned up some obsolete stuff:

removed separate conv2d_transpose_push_constants, it's the same as regular conv2d now
removed push constants that were changed to spec constants
removed checks for conv-transpose push constant size

No changes to actual logic apart from the y/z split. Performance looks unchanged.

jeffbolznv

LGTM. I also tested for perf/correctness and it was fine.

0cc4m

Thank you, LGTM

vulkan : support conv-2d with large output size

839d174

Acly requested review from 0cc4m and ggerganov as code owners December 2, 2025 10:01

github-actions bot added testing Everything test related Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Dec 2, 2025

jeffbolznv approved these changes Dec 2, 2025

View reviewed changes

0cc4m approved these changes Dec 5, 2025

View reviewed changes

0cc4m merged commit e15cd06 into ggml-org:master Dec 5, 2025
69 of 74 checks passed

JayZenith pushed a commit to JayZenith/llama.cpp that referenced this pull request Dec 7, 2025

vulkan : support conv-2d with large output size (ggml-org#17685)

47b44fc

gabe-l-hart mentioned this pull request Dec 10, 2025

feat: llama.cpp bump (17f7f4) for SSM performance improvements ollama/ollama#13408

Merged

0Marble pushed a commit to 0Marble/llama.cpp that referenced this pull request Dec 18, 2025

vulkan : support conv-2d with large output size (ggml-org#17685)

59ba6ef

Anico2 added a commit to Anico2/llama.cpp that referenced this pull request Jan 15, 2026

vulkan : support conv-2d with large output size (ggml-org#17685)

01fe915

blime4 referenced this pull request in blime4/llama.cpp Feb 5, 2026

vulkan : support conv-2d with large output size (#17685)

565e4fd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vulkan : support conv-2d with large output size#17685

vulkan : support conv-2d with large output size#17685
0cc4m merged 1 commit intoggml-org:masterfrom
Acly:vulkan-conv2d-workgroup-split

Acly commented Dec 2, 2025

Uh oh!

jeffbolznv left a comment

Uh oh!

0cc4m left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Acly commented Dec 2, 2025

Uh oh!

jeffbolznv left a comment

Choose a reason for hiding this comment

Uh oh!

0cc4m left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants