Right now I have this hacked into koboldcpp.py with an environment variable and I don't see any reason why it's avaliable for CUDA but not for Vulkan.