Misc. bug: SYCL: BF16 falling to CPU

### Name and Version

```
root@dcd9fcec754d:/app# ./llama-cli --version
load_backend: loaded SYCL backend from /app/libggml-sycl.so
load_backend: loaded CPU backend from /app/libggml-cpu-haswell.so
version: 8248 (5f4cdac38)
built with IntelLLVM 2025.2.1 for Linux x86_64
```

### Operating systems

Linux

### Which llama.cpp modules do you know to be affected?

_No response_

### Command line

```shell

```

### Problem description & steps to reproduce

llama.cpp SYCL is using the CPU 99% of the time for BF16 models. F16 models work fine on the GPU

### First Bad Commit

_No response_

### Relevant log output

<img width="2559" height="1079" alt="Image" src="https://github.com/user-attachments/assets/2f4fdab1-5455-4d4c-9949-88127048c329" />


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Misc. bug: SYCL: BF16 falling to CPU #20478

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Command line

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Misc. bug: SYCL: BF16 falling to CPU #20478

Description

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Command line

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions