Skip to content

Misc. bug: SYCL: BF16 falling to CPU #20478

@WizardlyBump17

Description

@WizardlyBump17

Name and Version

root@dcd9fcec754d:/app# ./llama-cli --version
load_backend: loaded SYCL backend from /app/libggml-sycl.so
load_backend: loaded CPU backend from /app/libggml-cpu-haswell.so
version: 8248 (5f4cdac38)
built with IntelLLVM 2025.2.1 for Linux x86_64

Operating systems

Linux

Which llama.cpp modules do you know to be affected?

No response

Command line

Problem description & steps to reproduce

llama.cpp SYCL is using the CPU 99% of the time for BF16 models. F16 models work fine on the GPU

First Bad Commit

No response

Relevant log output

Image

Metadata

Metadata

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions