Eval bug: Context-shift with gemma 4

### Name and Version

llama-server: b8648

### Operating systems

Linux

### GGML backends

CUDA

### Hardware

RTX 5090 32GB vram

### Models

Gemma4-31B-it-Q5-K-M

### Problem description & steps to reproduce

Context shift isn't working when using Gemma-4. kv-quantization with context-shift isn't working aswell. :\
using latest build of llamacpp rn.

### First Bad Commit

_No response_

### Relevant log output

`slot update_slots: id  3 | task 102 | forcing full prompt re-processing due to lack of cache data (likely due to SWA or hybrid/recurrent memory, see https://github.com/ggml-org/llama.cpp/pull/13194#issuecomment-2868343055)`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eval bug: Context-shift with gemma 4 #21379

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Eval bug: Context-shift with gemma 4 #21379

Description

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions