ClickHouse
diff --git a/‎.claude/CLAUDE.md‎
Lines changed: 69 additions & 7 deletions b/‎.claude/CLAUDE.md‎
Lines changed: 69 additions & 7 deletions
diff --git a/‎.claude/settings.json‎
Lines changed: 14 additions & 0 deletions b/‎.claude/settings.json‎
Lines changed: 14 additions & 0 deletions
@@ -1,17 +1,16 @@
 When working with a branch, do not use rebase or amend - add new commits instead.
 
+Do not commit to the master branch. Create a new branch for every task.
+
 When writing text such as documentation, comments, or commit messages, wrap literal names from ClickHouse SQL language, classes and functions, or literal excerpts from log messages inside inline code blocks, such as: `MergeTree`.
 
 When writing text such as documentation, comments, or commit messages, write names of functions and methods as `f` instead of `f()` - we prefer it for mathematical purity when it refers a function itself rather than its application.
 
 When mentioning logical errors, say "exception" instead of "crash", because they don't crash the server in the release build.
 
-Links to ClickHouse CI, such as `https://s3.amazonaws.com/clickhouse-test-reports/json.html?...` should be interpreted with a headless browser, e.g., Playwright, because they contain JavaScript. Use the tool at `.claude/tools/fetch_ci_report.js`:
+Links to ClickHouse CI, such as `https://s3.amazonaws.com/clickhouse-test-reports/json.html?...` should be analyzed using the tool at `.claude/tools/fetch_ci_report.js`, which directly fetches the underlying JSON data without requiring a browser:
 
 ```bash
-# Install playwright if needed (one-time setup)
-cd /tmp && npm install playwright && npx playwright install chromium
-
 # Fetch and analyze CI report
 node /path/to/ClickHouse/.claude/tools/fetch_ci_report.js "<ci-url>" [options]
 
@@ -21,6 +20,7 @@ node /path/to/ClickHouse/.claude/tools/fetch_ci_report.js "<ci-url>" [options]
 #   --all            Show all test results
 #   --links          Show artifact links (logs.tar.gz, etc.)
 #   --download-logs  Download logs.tar.gz to /tmp/ci_logs.tar.gz
+#   --credentials <user,password>  HTTP Basic Auth for private repositories
 
 # Examples:
 node .claude/tools/fetch_ci_report.js "https://s3.amazonaws.com/..." --failed --links
@@ -33,6 +33,62 @@ tar -xzf /tmp/ci_logs.tar.gz ci/tmp/pytest_parallel.jsonl
 grep "test_name" ci/tmp/pytest_parallel.jsonl | python3 -c "import sys,json; [print(json.loads(l).get('longrepr','')) for l in sys.stdin if 'failed' in l]"
 ```
 
+To compile and run C++ code snippets against the ClickHouse codebase without modifying any source files, use the tool at `.claude/tools/cppexpr.sh`. This is a wrapper around `utils/c++expr` that auto-detects build directories and handles working directory setup. When asked about the size, layout, or alignment of ClickHouse data structures, or asked to compare performance of code snippets, use this tool to get a definitive answer instead of guessing.
+
+```bash
+# Query the size of a ClickHouse data structure
+.claude/tools/cppexpr.sh -i Core/Block.h 'OUT(sizeof(DB::Block))'
+
+# Query multiple expressions at once
+.claude/tools/cppexpr.sh -i Core/Field.h 'OUT(sizeof(DB::Field)) OUT(sizeof(DB::Array))'
+
+# Use global code for helper functions or custom types
+.claude/tools/cppexpr.sh -g 'struct Foo { int a; double b; };' 'OUT(sizeof(Foo)) OUT(alignof(Foo))'
+
+# Benchmark a code snippet (100000 iterations, 5 tests)
+.claude/tools/cppexpr.sh -i Common/Stopwatch.h -b 100000 'Stopwatch sw;'
+
+# Standalone mode (no ClickHouse headers, just standard C++)
+.claude/tools/cppexpr.sh --plain 'OUT(sizeof(std::string))'
+```
+
+Key options: `-i HEADER` to include headers, `-g 'CODE'` for global-scope code, `-b STEPS` for benchmarking, `-l LIB` to link extra libraries, `--plain` for standalone compilation without ClickHouse. The `OUT(expr)` macro prints `expr -> value`.
+
+When asked to analyze assembly, inspect generated code, find register spills, check branch density, compare codegen between builds, or investigate optimization opportunities in compiled functions, use the tool at `.claude/tools/analyze-assembly.py`. It disassembles functions from a compiled binary, builds a CFG, computes metrics (spill/branch/call density), and reports findings. Use it instead of manually running `llvm-objdump` or `llvm-nm`.
+
+```bash
+# Basic analysis of a function
+python3 .claude/tools/analyze-assembly.py <binary> "<function_name>"
+
+# Search for overloaded/templated functions by regex
+python3 .claude/tools/analyze-assembly.py <binary> "insertRangeFrom" --search
+
+# Pick a specific overload from ambiguous results
+python3 .claude/tools/analyze-assembly.py <binary> "insertRangeFrom" --search --select 3
+
+# JSON output for structured analysis
+python3 .claude/tools/analyze-assembly.py <binary> "<function_name>" --format json
+
+# Source-interleaved disassembly (needs debug info)
+python3 .claude/tools/analyze-assembly.py <binary> "<function_name>" --source
+
+# Microarchitectural analysis of loop bodies (--mcpu is required)
+python3 .claude/tools/analyze-assembly.py <binary> "<function_name>" --mca --mcpu=znver3
+
+# Profile-weighted analysis (re-ranks findings by runtime impact)
+python3 .claude/tools/analyze-assembly.py <binary> "<function_name>" --perf-map tmp/perf.map.jsonl
+
+# Compare codegen between two builds
+python3 .claude/tools/analyze-assembly.py --before <old_binary> --after <new_binary> "<function_name>"
+
+# Verbose mode to see tool commands
+python3 .claude/tools/analyze-assembly.py <binary> "<function_name>" -v
+```
+
+Key options: `--search` for regex matching, `--fuzzy` for substring matching, `--select N` to pick from ambiguous results, `--all` to analyze all matches, `--context N` to show surrounding symbols, `--max-instructions N` to control output size, `--mca --mcpu=<model>` for llvm-mca throughput analysis, `--perf-map <file>` for runtime-weighted scoring, `--before`/`--after` for diff mode. The tool caches symbol tables by build-id for fast repeated queries.
+
+**IMPORTANT**: `--select N` does NOT imply `--search`. When using a regex pattern with `--select`, you MUST also pass `--search`, e.g. `--search --select 1`. Without `--search`, the pattern is treated as a literal exact match and will fail.
+
 You can build multiple versions of ClickHouse inside `build_*` directories, such as `build`, `build_debug`, `build_asan`, etc.
 
 You can run integration tests as in `tests/integration/README.md` using: `python -m ci.praktika run "integration" --test <selectors>` invoked from the repository root.
@@ -49,17 +105,23 @@ When writing messages, say ASan, not ASAN, and similar (because there are two wo
 
 When checking the CI status, pay attention to the comment from robot with the links first. Look at the Praktika reports first. The logs of GitHub actions usually contain less info.
 
-Do not use `-j` argument with ninja - let it decide automatically. 
+Do not use `-j` argument with ninja; do not use `nproc` - let it decide automatically.
 
 If I provided a URL with the CI report, logs, or examples, include it in the commit message.
 
 When creating a pull request, append Changelog category and Changelog entry according to this template: `.github/PULL_REQUEST_TEMPLATE.md`. The "Bug Fix" category should be used only for real bug fixes, while for fixing CI reports you can use the "CI Fix or improvement" category. Include the URL to CI report I provided if any. If the PR is about a CI failure, search for the corresponding open issues and provide a link in the PR description.
 
-ARM machines in CI are not slow. They are similar to x86 in performance. 
+ARM machines in CI are not slow. They are similar to x86 in performance.
+
+Use `tmp` subdirectory in the current directory for temporary files (logs, downloads, scripts, etc.), do not use `/tmp`. Create the directory if needed.
+
+When there are crucial findings (when I corrected your behavior, you when you found a crucial insight about the code), append them to `.claude/learnings.md`, but be concise. You can commit the changes in learnings.md along with the other changes. Read this file at start.
 
 Always load and apply the following skills:
 
-- .claude/skills/install-skills
 - .claude/skills/build
 - .claude/skills/test
 - .claude/skills/fix-sync
+- .claude/skills/alloc-profile
+- .claude/skills/bisect
+- .claude/skills/create-worktree
@@ -0,0 +1,14 @@
+{
+  "permissions": {
+    "allow": [
+      "Bash(gh pr view:*)",
+      "Bash(gh issue view:*)",
+      "Bash(gh pr list:*)",
+      "Bash(gh issue list:*)",
+      "Bash(gh pr checks:*)",
+      "Bash(gh pr diff:*)",
+      "Bash(gh search:*)",
+      "WebFetch(domain:github.com)"
+    ]
+  }
+}