Skip to content

common: fix --fit verbosity with --verbosity 4#23282

Merged
JohannesGaessler merged 1 commit into
ggml-org:masterfrom
JohannesGaessler:common-fix-fit-verbosity
May 19, 2026
Merged

common: fix --fit verbosity with --verbosity 4#23282
JohannesGaessler merged 1 commit into
ggml-org:masterfrom
JohannesGaessler:common-fix-fit-verbosity

Conversation

@JohannesGaessler

@JohannesGaessler JohannesGaessler commented May 18, 2026

Copy link
Copy Markdown
Contributor

Back when I wrote the --fit code I determined the verbosity of the creation of virtual models/contexts vs. a hard-coded value of 4. This is now incorrect since 4 is now LOG_LEVEL_TRACE so the console output becomes a lot more verbose than intended. This PR replaces 4 with LOG_LEVEL_DEBUG which is currently 5. With LOG_LEVEL_TRACE the fitting code now only prints information about the memory in a compact way.

Also while looking at the console output at different log levels I noticed that the print informing the user of the warmup run is LOG_LEVEL_WARN which seems incorrect to me; I changed it to LOG_LEVEL_INFO.

Requirements

@JohannesGaessler JohannesGaessler requested a review from a team as a code owner May 18, 2026 14:22
@JohannesGaessler JohannesGaessler merged commit 7256fce into ggml-org:master May 19, 2026
46 of 49 checks passed
fhnmor21 pushed a commit to fhnmor21/llama-cpp-turboquant that referenced this pull request May 19, 2026
dbrain pushed a commit to dbrain/hbd-llama-cpp-turboquant that referenced this pull request May 21, 2026
baramofme pushed a commit to baramofme/llama-cpp-turboquant that referenced this pull request May 23, 2026
srossitto79 pushed a commit to srossitto79/llama.cpp that referenced this pull request May 23, 2026
fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026
turbo-tan pushed a commit to turbo-tan/llama.cpp-tq3 that referenced this pull request Jun 2, 2026
Jcfunk added a commit to Jcfunk/llama.cpp that referenced this pull request Jun 11, 2026
* upstream/HEAD: (25 commits)
  metal : optimize pad + cpy (ggml-org#23354)
  snapdragon: update toolchain to v0.6 (ggml-org#23369)
  ggml-cuda: tune RDNA3 Q6_K MMVQ nwarps (ggml-org#23349)
  opencl: add MoE support for q4_k, q5_k, q6_k on Adreno (ggml-org#23303)
  hexagon: add MROPE and IMROPE support in HTP rope op (ggml-org#23317)
  refactor: Chat Screen UI rendering (ggml-org#23333)
  github: mention --log-file in issue templates (ggml-org#23277)
  common: fix --help for --verbosity (ggml-org#23278)
  common: fix --fit verbosity with --verbosity 4 (ggml-org#23282)
  convert : update mtp related help (ggml-org#23334)
  hexagon: enable support for NORM op (ggml-org#23319)
  model : clarify MTP layer comment in qwen35.cpp [no ci] (ggml-org#23338)
  llama : MTP clean-up (ggml-org#23269)
  ui: Bump packages + address build warnings (ggml-org#23300)
  ci : install libssl-dev (ggml-org#23325)
  ci : install server kleidiai runner dependencies (ggml-org#23259)
  server-context: guarantee there is at least 1 token to decode (ggml-org#23280)
  server : print graphs reused in slot timings (ggml-org#23279)
  save-load-state : refactor tests and improve readability (ggml-org#23196)
  llama-eval : add per-task summary stats (ggml-org#23151)
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants