Name and Version
Windows server 2019 machine, pure CPU, offline.
llama version: b8468 (version after b8467, which solved grammar problems)
llama command by PowerShell:
.\llama-server.exe -t 24 --mlock --no-mmap -ngl 0 -m NVIDIA-Nemotron-3-Super-120B-A12B-GGUF\NVIDIA-Nemotron-3-Super-120B-A12B-UD-Q6_K_XL-00001-of-00004.gguf --alias "NVIDIA-Nemotron-120B" --jinja --offline -fa on --prio 3 --min_p 0.05 --temp 0.6 --top-p 0.9 --top_k 30 --ctx-size 1048576 --no-context-shift --host 0.0.0.0 --port 8080 --threads-batch 24 --threads-http 6 --timeout 36000
after called by OpenClaw, many grammar errors displayed in PowerShell windows:
tool-sessions-send-arg-timeoutSeconds ::= "<parameter=" "timeoutSeconds" ">\n" "" number space "\n" tool-sessions-spawn ::= "<function=" "sessions_spawn" ">\n" space tool-sessions-spawn-arg-task (space (tool-sessions-spawn-arg-label | tool-sessions-spawn-arg-runtime | tool-sessions-spawn-arg-agentId | tool-sessions-spawn-arg-resumeSessionId | tool-sessions-spawn-arg-model | tool-sessions-spawn-arg-thinking | tool-sessions-spawn-arg-cwd | tool-sessions-spawn-arg-runTimeoutSeconds | tool-sessions-spawn-arg-timeoutSeconds | tool-sessions-spawn-arg-thread | tool-sessions-spawn-arg-mode | tool-sessions-spawn-arg-cleanup | tool-sessions-spawn-arg-sandbox | tool-sessions-spawn-arg-streamTo | tool-sessions-spawn-arg-attachments | tool-sessions-spawn-arg-attachAs)){0,16} space "\n" tool-sessions-spawn-arg-agentId ::= "<parameter=" "agentId" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-attachAs ::= "<parameter=" "attachAs" ">\n" "" tool-sessions-spawn-arg-attachAs-schema space "\n" tool-sessions-spawn-arg-attachAs-schema ::= "{" space (tool-sessions-spawn-arg-attachAs-schema-mountPath-kv )? "}" space tool-sessions-spawn-arg-attachAs-schema-mountPath-kv ::= ""mountPath"" space ":" space string tool-sessions-spawn-arg-attachments ::= "<parameter=" "attachments" ">\n" "" tool-sessions-spawn-arg-attachments-schema space "\n" tool-sessions-spawn-arg-attachments-schema ::= "[" space (tool-sessions-spawn-arg-attachments-schema-item ("," space tool-sessions-spawn-arg-attachments-schema-item){0,49})? "]" space tool-sessions-spawn-arg-attachments-schema-item ::= "{" space tool-sessions-spawn-arg-attachments-schema-item-name-kv "," space tool-sessions-spawn-arg-attachments-schema-item-content-kv ( "," space ( tool-sessions-spawn-arg-attachments-schema-item-encoding-kv tool-sessions-spawn-arg-attachments-schema-item-encoding-rest | tool-sessions-spawn-arg-attachments-schema-item-mimeType-kv ) )? "}" space tool-sessions-spawn-arg-attachments-schema-item-content-kv ::= ""content"" space ":" space string tool-sessions-spawn-arg-attachments-schema-item-encoding ::= (""utf8"" | ""base64"") space tool-sessions-spawn-arg-attachments-schema-item-encoding-kv ::= ""encoding"" space ":" space tool-sessions-spawn-arg-attachments-schema-item-encoding tool-sessions-spawn-arg-attachments-schema-item-encoding-rest ::= ( "," space tool-sessions-spawn-arg-attachments-schema-item-mimeType-kv )? tool-sessions-spawn-arg-attachments-schema-item-mimeType-kv ::= ""mimeType"" space ":" space string tool-sessions-spawn-arg-attachments-schema-item-name-kv ::= ""name"" space ":" space string tool-sessions-spawn-arg-cleanup ::= "<parameter=" "cleanup" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-cwd ::= "<parameter=" "cwd" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-label ::= "<parameter=" "label" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-mode ::= "<parameter=" "mode" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-model ::= "<parameter=" "model" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-resumeSessionId ::= "<parameter=" "resumeSessionId" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-runTimeoutSeconds ::= "<parameter=" "runTimeoutSeconds" ">\n" "" number space "\n" tool-sessions-spawn-arg-runtime ::= "<parameter=" "runtime" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-sandbox ::= "<parameter=" "sandbox" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-streamTo ::= "<parameter=" "streamTo" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-task ::= "<parameter=" "task" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-thinking ::= "<parameter=" "thinking" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-thread ::= "<parameter=" "thread" ">\n" "" boolean space "\n" tool-sessions-spawn-arg-timeoutSeconds ::= "<parameter=" "timeoutSeconds" ">\n" "" number space "\n" tool-sessions-yield ::= "<function=" "sessions_yield" ">\n" space (space (tool-sessions-yield-arg-message))? space "\n" tool-sessions-yield-arg-message ::= "<parameter=" "message" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-subagents ::= "<function=" "subagents" ">\n" space (space (tool-subagents-arg-action | tool-subagents-arg-target | tool-subagents-arg-message | tool-subagents-arg-recentMinutes)){0,4} space "\n" tool-subagents-arg-action ::= "<parameter=" "action" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-subagents-arg-message ::= "<parameter=" "message" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-subagents-arg-recentMinutes ::= "<parameter=" "recentMinutes" ">\n" "" number space "\n" tool-subagents-arg-target ::= "<parameter=" "target" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-tts ::= "<function=" "tts" ">\n" space tool-tts-arg-text (space (tool-tts-arg-channel))? space "\n" tool-tts-arg-channel ::= "<parameter=" "channel" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-tts-arg-text ::= "<parameter=" "text" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-fetch ::= "<function=" "web_fetch" ">\n" space tool-web-fetch-arg-url (space (tool-web-fetch-arg-extractMode | tool-web-fetch-arg-maxChars)){0,2} space "\n" tool-web-fetch-arg-extractMode ::= "<parameter=" "extractMode" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-fetch-arg-maxChars ::= "<parameter=" "maxChars" ">\n" "" number space "\n" tool-web-fetch-arg-url ::= "<parameter=" "url" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search ::= "<function=" "web_search" ">\n" space tool-web-search-arg-query (space (tool-web-search-arg-count | tool-web-search-arg-country | tool-web-search-arg-language | tool-web-search-arg-freshness | tool-web-search-arg-date-after | tool-web-search-arg-date-before | tool-web-search-arg-search-lang | tool-web-search-arg-ui-lang)){0,8} space "\n" tool-web-search-arg-count ::= "<parameter=" "count" ">\n" "" number space "\n" tool-web-search-arg-country ::= "<parameter=" "country" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-date-after ::= "<parameter=" "date_after" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-date-before ::= "<parameter=" "date_before" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-freshness ::= "<parameter=" "freshness" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-language ::= "<parameter=" "language" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-query ::= "<parameter=" "query" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-search-lang ::= "<parameter=" "search_lang" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-ui-lang ::= "<parameter=" "ui_lang" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-write ::= "<function=" "write" ">\n" space tool-write-arg-content (space (tool-write-arg-path | tool-write-arg-file-path)){0,2} space "\n" tool-write-arg-content ::= "<parameter=" "content" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-write-arg-file-path ::= "<parameter=" "file_path" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-write-arg-path ::= "<parameter=" "path" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" value ::= object | array | string | number | boolean | null failed to parse grammar �[0mslot launch_slot_: id 3 | task -1 | sampler chain: logits -> ?penalties -> ?dry -> ?top-n-sigma -> top-k -> ?typical -> top-p -> min-p -> ?xtc -> temp-ext -> dist slot launch_slot_: id 3 | task 19 | processing task, is_child = 0 slot update_slots: id 3 | task 19 | new prompt, n_ctx_slot = 1048576, n_keep = 0, task.n_tokens = 22602 slot update_slots: id 3 | task 19 | n_tokens = 14336, memory_seq_rm [14336, end) slot update_slots: id 3 | task 19 | prompt processing progress, n_tokens = 16384, batch.n_tokens = 2048, progress = 0.724892 slot update_slots: id 3 | task 19 | n_tokens = 16384, memory_seq_rm [16384, end) slot update_slots: id 3 | task 19 | 8192 tokens since last checkpoint at 8192, creating new checkpoint during processing at position 18432 slot update_slots: id 3 | task 19 | prompt processing progress, n_tokens = 18432, batch.n_tokens = 2048, progress = 0.815503 slot update_slots: id 3 | task 19 | created context checkpoint 2 of 32 (pos_min = 16383, pos_max = 16383, n_tokens = 16384, size = 164.688 MiB)
Operating systems
Windows
Which llama.cpp modules do you know to be affected?
No response
Command line
.\llama-server.exe -t 24 --mlock --no-mmap -ngl 0 -m NVIDIA-Nemotron-3-Super-120B-A12B-GGUF\NVIDIA-Nemotron-3-Super-120B-A12B-UD-Q6_K_XL-00001-of-00004.gguf --alias "NVIDIA-Nemotron-120B" --jinja --offline -fa on --prio 3 --min_p 0.05 --temp 0.6 --top-p 0.9 --top_k 30 --ctx-size 1048576 --no-context-shift --host 0.0.0.0 --port 8080 --threads-batch 24 --threads-http 6 --timeout 36000
Problem description & steps to reproduce
Windows server 2019 machine, pure CPU, offline.
llama version: b8468 (version after b8467, which solved grammar problems)
llama command by PowerShell:
.\llama-server.exe -t 24 --mlock --no-mmap -ngl 0 -m NVIDIA-Nemotron-3-Super-120B-A12B-GGUF\NVIDIA-Nemotron-3-Super-120B-A12B-UD-Q6_K_XL-00001-of-00004.gguf --alias "NVIDIA-Nemotron-120B" --jinja --offline -fa on --prio 3 --min_p 0.05 --temp 0.6 --top-p 0.9 --top_k 30 --ctx-size 1048576 --no-context-shift --host 0.0.0.0 --port 8080 --threads-batch 24 --threads-http 6 --timeout 36000
after called by OpenClaw (use OpenClaw to analyze many pdf document), many grammar errors displayed in PowerShell windows.
First Bad Commit
No response
Relevant log output
Logs
Name and Version
Windows server 2019 machine, pure CPU, offline.
llama version: b8468 (version after b8467, which solved grammar problems)
llama command by PowerShell:
.\llama-server.exe -t 24 --mlock --no-mmap -ngl 0 -m NVIDIA-Nemotron-3-Super-120B-A12B-GGUF\NVIDIA-Nemotron-3-Super-120B-A12B-UD-Q6_K_XL-00001-of-00004.gguf --alias "NVIDIA-Nemotron-120B" --jinja --offline -fa on --prio 3 --min_p 0.05 --temp 0.6 --top-p 0.9 --top_k 30 --ctx-size 1048576 --no-context-shift --host 0.0.0.0 --port 8080 --threads-batch 24 --threads-http 6 --timeout 36000
after called by OpenClaw, many grammar errors displayed in PowerShell windows:
tool-sessions-send-arg-timeoutSeconds ::= "<parameter=" "timeoutSeconds" ">\n" "" number space "\n" tool-sessions-spawn ::= "<function=" "sessions_spawn" ">\n" space tool-sessions-spawn-arg-task (space (tool-sessions-spawn-arg-label | tool-sessions-spawn-arg-runtime | tool-sessions-spawn-arg-agentId | tool-sessions-spawn-arg-resumeSessionId | tool-sessions-spawn-arg-model | tool-sessions-spawn-arg-thinking | tool-sessions-spawn-arg-cwd | tool-sessions-spawn-arg-runTimeoutSeconds | tool-sessions-spawn-arg-timeoutSeconds | tool-sessions-spawn-arg-thread | tool-sessions-spawn-arg-mode | tool-sessions-spawn-arg-cleanup | tool-sessions-spawn-arg-sandbox | tool-sessions-spawn-arg-streamTo | tool-sessions-spawn-arg-attachments | tool-sessions-spawn-arg-attachAs)){0,16} space "\n" tool-sessions-spawn-arg-agentId ::= "<parameter=" "agentId" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-attachAs ::= "<parameter=" "attachAs" ">\n" "" tool-sessions-spawn-arg-attachAs-schema space "\n" tool-sessions-spawn-arg-attachAs-schema ::= "{" space (tool-sessions-spawn-arg-attachAs-schema-mountPath-kv )? "}" space tool-sessions-spawn-arg-attachAs-schema-mountPath-kv ::= ""mountPath"" space ":" space string tool-sessions-spawn-arg-attachments ::= "<parameter=" "attachments" ">\n" "" tool-sessions-spawn-arg-attachments-schema space "\n" tool-sessions-spawn-arg-attachments-schema ::= "[" space (tool-sessions-spawn-arg-attachments-schema-item ("," space tool-sessions-spawn-arg-attachments-schema-item){0,49})? "]" space tool-sessions-spawn-arg-attachments-schema-item ::= "{" space tool-sessions-spawn-arg-attachments-schema-item-name-kv "," space tool-sessions-spawn-arg-attachments-schema-item-content-kv ( "," space ( tool-sessions-spawn-arg-attachments-schema-item-encoding-kv tool-sessions-spawn-arg-attachments-schema-item-encoding-rest | tool-sessions-spawn-arg-attachments-schema-item-mimeType-kv ) )? "}" space tool-sessions-spawn-arg-attachments-schema-item-content-kv ::= ""content"" space ":" space string tool-sessions-spawn-arg-attachments-schema-item-encoding ::= (""utf8"" | ""base64"") space tool-sessions-spawn-arg-attachments-schema-item-encoding-kv ::= ""encoding"" space ":" space tool-sessions-spawn-arg-attachments-schema-item-encoding tool-sessions-spawn-arg-attachments-schema-item-encoding-rest ::= ( "," space tool-sessions-spawn-arg-attachments-schema-item-mimeType-kv )? tool-sessions-spawn-arg-attachments-schema-item-mimeType-kv ::= ""mimeType"" space ":" space string tool-sessions-spawn-arg-attachments-schema-item-name-kv ::= ""name"" space ":" space string tool-sessions-spawn-arg-cleanup ::= "<parameter=" "cleanup" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-cwd ::= "<parameter=" "cwd" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-label ::= "<parameter=" "label" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-mode ::= "<parameter=" "mode" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-model ::= "<parameter=" "model" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-resumeSessionId ::= "<parameter=" "resumeSessionId" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-runTimeoutSeconds ::= "<parameter=" "runTimeoutSeconds" ">\n" "" number space "\n" tool-sessions-spawn-arg-runtime ::= "<parameter=" "runtime" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-sandbox ::= "<parameter=" "sandbox" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-streamTo ::= "<parameter=" "streamTo" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-task ::= "<parameter=" "task" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-thinking ::= "<parameter=" "thinking" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-sessions-spawn-arg-thread ::= "<parameter=" "thread" ">\n" "" boolean space "\n" tool-sessions-spawn-arg-timeoutSeconds ::= "<parameter=" "timeoutSeconds" ">\n" "" number space "\n" tool-sessions-yield ::= "<function=" "sessions_yield" ">\n" space (space (tool-sessions-yield-arg-message))? space "\n" tool-sessions-yield-arg-message ::= "<parameter=" "message" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-subagents ::= "<function=" "subagents" ">\n" space (space (tool-subagents-arg-action | tool-subagents-arg-target | tool-subagents-arg-message | tool-subagents-arg-recentMinutes)){0,4} space "\n" tool-subagents-arg-action ::= "<parameter=" "action" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-subagents-arg-message ::= "<parameter=" "message" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-subagents-arg-recentMinutes ::= "<parameter=" "recentMinutes" ">\n" "" number space "\n" tool-subagents-arg-target ::= "<parameter=" "target" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-tts ::= "<function=" "tts" ">\n" space tool-tts-arg-text (space (tool-tts-arg-channel))? space "\n" tool-tts-arg-channel ::= "<parameter=" "channel" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-tts-arg-text ::= "<parameter=" "text" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-fetch ::= "<function=" "web_fetch" ">\n" space tool-web-fetch-arg-url (space (tool-web-fetch-arg-extractMode | tool-web-fetch-arg-maxChars)){0,2} space "\n" tool-web-fetch-arg-extractMode ::= "<parameter=" "extractMode" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-fetch-arg-maxChars ::= "<parameter=" "maxChars" ">\n" "" number space "\n" tool-web-fetch-arg-url ::= "<parameter=" "url" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search ::= "<function=" "web_search" ">\n" space tool-web-search-arg-query (space (tool-web-search-arg-count | tool-web-search-arg-country | tool-web-search-arg-language | tool-web-search-arg-freshness | tool-web-search-arg-date-after | tool-web-search-arg-date-before | tool-web-search-arg-search-lang | tool-web-search-arg-ui-lang)){0,8} space "\n" tool-web-search-arg-count ::= "<parameter=" "count" ">\n" "" number space "\n" tool-web-search-arg-country ::= "<parameter=" "country" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-date-after ::= "<parameter=" "date_after" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-date-before ::= "<parameter=" "date_before" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-freshness ::= "<parameter=" "freshness" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-language ::= "<parameter=" "language" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-query ::= "<parameter=" "query" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-search-lang ::= "<parameter=" "search_lang" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-web-search-arg-ui-lang ::= "<parameter=" "ui_lang" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-write ::= "<function=" "write" ">\n" space tool-write-arg-content (space (tool-write-arg-path | tool-write-arg-file-path)){0,2} space "\n" tool-write-arg-content ::= "<parameter=" "content" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-write-arg-file-path ::= "<parameter=" "file_path" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" tool-write-arg-path ::= "<parameter=" "path" ">\n" "" ([^<] | "<" [^/] | "</" [^p] | "</p" [^a] | "</pa" [^r] | "</par" [^a] | "</para" [^m] | "</param" [^e] | "</parame" [^t] | "</paramet" [^e] | "</paramete" [^r] | "</parameter" [^>] | "" [^\n])* "\n" value ::= object | array | string | number | boolean | null failed to parse grammar �[0mslot launch_slot_: id 3 | task -1 | sampler chain: logits -> ?penalties -> ?dry -> ?top-n-sigma -> top-k -> ?typical -> top-p -> min-p -> ?xtc -> temp-ext -> dist slot launch_slot_: id 3 | task 19 | processing task, is_child = 0 slot update_slots: id 3 | task 19 | new prompt, n_ctx_slot = 1048576, n_keep = 0, task.n_tokens = 22602 slot update_slots: id 3 | task 19 | n_tokens = 14336, memory_seq_rm [14336, end) slot update_slots: id 3 | task 19 | prompt processing progress, n_tokens = 16384, batch.n_tokens = 2048, progress = 0.724892 slot update_slots: id 3 | task 19 | n_tokens = 16384, memory_seq_rm [16384, end) slot update_slots: id 3 | task 19 | 8192 tokens since last checkpoint at 8192, creating new checkpoint during processing at position 18432 slot update_slots: id 3 | task 19 | prompt processing progress, n_tokens = 18432, batch.n_tokens = 2048, progress = 0.815503 slot update_slots: id 3 | task 19 | created context checkpoint 2 of 32 (pos_min = 16383, pos_max = 16383, n_tokens = 16384, size = 164.688 MiB)
Operating systems
Windows
Which llama.cpp modules do you know to be affected?
No response
Command line
Problem description & steps to reproduce
Windows server 2019 machine, pure CPU, offline.
llama version: b8468 (version after b8467, which solved grammar problems)
llama command by PowerShell:
.\llama-server.exe -t 24 --mlock --no-mmap -ngl 0 -m NVIDIA-Nemotron-3-Super-120B-A12B-GGUF\NVIDIA-Nemotron-3-Super-120B-A12B-UD-Q6_K_XL-00001-of-00004.gguf --alias "NVIDIA-Nemotron-120B" --jinja --offline -fa on --prio 3 --min_p 0.05 --temp 0.6 --top-p 0.9 --top_k 30 --ctx-size 1048576 --no-context-shift --host 0.0.0.0 --port 8080 --threads-batch 24 --threads-http 6 --timeout 36000
after called by OpenClaw (use OpenClaw to analyze many pdf document), many grammar errors displayed in PowerShell windows.
First Bad Commit
No response
Relevant log output
Logs