Performance Review

> _Migrated from [spboyer/waza#445](https://github.com/spboyer/waza/issues/445)_
> _Last validated: March 3, 2026 — 25 still present, 2 partial, 1 fixed (#12). See comments for details._

## Go Performance Audit — Dual-Model Expert Review

**Audited by:** Turk (Go Performance Specialist)
**Models used:** GPT-5.3-Codex (28 findings) + Claude Opus 4.6 (23 findings)
**Codebase:** 239 Go files, ~53K LOC

---

## Summary

30 unique findings across 8 categories. 19 findings overlap between both models (highest confidence), 7 unique to Codex, 4 unique to Opus. 3 severity disagreements resolved below.

---

## Agreement — Both Models Found These (highest confidence)

| # | Severity | File | Issue |
|---|----------|------|-------|
| 1 | 🔴 P0 | `orchestration/runner.go:658-671` | O(N²) stop-on-error scan — re-scans all outcomes each iteration |
| 2 | 🔴 P0 | `orchestration/runner.go:1167+` | Graders recreated per run × test × grader (+ config reload) |
| 3 | 🔴 P0 | `orchestration/runner.go:1081-1140` | Fixture files re-read per run (redundant I/O) |
| 4 | 🟡 P1 | `cmd/waza/cmd_run.go:607` | `context.Background()` — no signal cancellation, Ctrl+C broken |
| 5 | 🟡 P1 | `graders/inline_script_grader.go:142-160` | Temp script file created/deleted per `Grade()` call |
| 6 | 🟡 P1 | `graders/program_grader.go:47-55` | `.waza.yaml` loaded per grader construction |
| 7 | 🟡 P1 | `webapi/store.go:258-299` | Summaries recomputed per request (no caching) |
| 8 | �� P1 | `webapi/store.go:47-128` | Full reload under write lock / race on first load |
| 9 | 🟡 P1 | `cache/cache.go:110-153` | Global mutex held during disk I/O |
| 10 | 🟡 P1 | `tokens/bpe/tokenizer.go:201-332` | Regex + slice churn in hot encode paths |
| 11 | 🟡 P1 | `tokens/bpe/tokenizer.go:605` | Decode buffer starts at zero capacity |
| 12 | ✅ Fixed | `cmd/waza/dev/links.go:336-351` | ~~HTTP response body not drained — breaks keep-alive~~ (Fixed: body now properly closed) |
| 13 | 🟡 P1 | `cmd/waza/dev/links.go:191` | `goldmark.New()` recreated per file |
| 14 | 🟡 P1 | `jsonrpc/handlers.go:260-281` | `h.runs` map never pruned — memory leak |
| 15 | 🟢 P2 | `execution/session_events_collector.go:54-118` | Shared state mutation without synchronization |
| 16 | 🟢 P2 | `trigger/runner.go:56-64` | Goroutines dont check `ctx.Done()` before semaphore |
| 17 | 🟢 P2 | `spinner/spinner.go:19-30` | `time.After` in loop leaks timers |
| 18 | 🟢 P2 | `checks/token_limits.go:73-79` | Double string allocation on file content |
| 19 | 🟢 P2 | `jsonrpc/transport.go:44-67` | Marshal + append newline per message |

## Unique to GPT-5.3-Codex

| # | Severity | File | Issue |
|---|----------|------|-------|
| 20 | 🟡 P1 | `webserver/server.go:46-50` | Missing ReadTimeout, WriteTimeout, IdleTimeout |
| 21 | 🟡 P1 | `execution/copilot.go:266-283` | Temp workspaces retained until engine shutdown (disk/inode buildup) |
| 22 | 🟡 P1 | `orchestration/runner.go:174-178` | Shutdown uses possibly-cancelled context |
| 23 | 🟡 P1 | `tokens/bpe/tokenizer.go:475-589` | `EncodeTrimPrefix` growing progress map |
| 24 | 🟢 P2 | `cmd/waza/cmd_run.go:994-1000` | Full-buffer marshal for large results (use streaming) |
| 25 | 🟢 P2 | `cmd/waza/dev/links.go:55-63` | Default transport not tuned for batch URL checks |
| 26 | 🟢 P2 | `jsonrpc/handlers.go:266-267` | Detached context from request lifecycle |

## Unique to Claude Opus 4.6

| # | Severity | File | Issue |
|---|----------|------|-------|
| 27 | 🟢 P2 | `orchestration/runner.go:533-547` | Template context copies `spec.Inputs` map per CSV row |
| 28 | 🟢 P2 | `cache/cache.go:64-88` | Three separate `json.Marshal` calls for cache key |
| 29 | 🟢 P2 | `webapi/handlers.go:119` | `json.Encoder.Encode()` error silently ignored |
| 30 | 🟢 P2 | `execution/copilot.go:186` | `Shutdown` accepts ctx but ignores it — `RemoveAll` could hang |

## Severity Disagreements (Resolved)

| File | Issue | Codex Says | Opus Says | Resolution |
|------|-------|-----------|----------|------------|
| `orchestration/runner.go:658` | O(N²) scan | 🟢 P2 | 🔴 P0 | **P0** — compounds with run count |
| `jsonrpc/handlers.go:260` | Runs map never pruned | 🔴 P0 | 🟢 P2 | **P1** — real leak, but MCP server is short-lived |
| `orchestration/runner.go:1081` | Fixture re-read per run | 🟢 P2 | 🔴 P0 | **P0** — multiplicative with runs × tests |

---

## 🏆 Prioritized Action List

### P0 — Fix Now (biggest impact)

1. **Cache graders per spec** — create once, reuse across runs. Eliminates #2 and #6 together.
2. **Cache fixture file content** — read once at test start, reuse across runs. Fixes #3.
3. **Replace O(N²) scan with boolean flag** — track `anyFailed` once. Fixes #1.
4. **Wire `signal.NotifyContext`** — one-line fix in `cmd_run.go`. Fixes #4.

### P1 — Fix Soon

5. **Cache `RunSummary` in store** — compute on load, invalidate on reload. Fixes #7.
6. **Reuse inline script grader temp files** — materialize once per grader instance. Fixes #5.
7. **RWMutex + narrow lock scope in cache** — I/O outside critical section. Fixes #9.
8. **Pre-allocate tokenizer slices** — `make([]int, 0, len/4)` heuristic. Fixes #10, #11.
9. ~~**Drain HTTP response bodies**~~ — ✅ Fixed. #12 resolved.
10. **Add server timeouts** — ReadTimeout, WriteTimeout, IdleTimeout. Fixes #20.
11. **Prune JSON-RPC runs map** — delete from `h.runs` alongside `h.cancelFuncs`. Fixes #14.

### P2 — When Convenient

11.
12. Reuse goldmark parser (#13)
13. Compact JSON for cache/subprocess (#19)
14. Remaining items (#15-18, #22-30)

---

> This audit was generated by running two independent model passes and synthesizing the overlap. Full decision records are in `.ai-team/decisions.md`.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Performance Review #23

Go Performance Audit — Dual-Model Expert Review

Summary

Agreement — Both Models Found These (highest confidence)

Unique to GPT-5.3-Codex

Unique to Claude Opus 4.6

Severity Disagreements (Resolved)

🏆 Prioritized Action List

P0 — Fix Now (biggest impact)

P1 — Fix Soon

P2 — When Convenient

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

#	Severity	File	Issue
1	🔴 P0	`orchestration/runner.go:658-671`	O(N²) stop-on-error scan — re-scans all outcomes each iteration
2	🔴 P0	`orchestration/runner.go:1167+`	Graders recreated per run × test × grader (+ config reload)
3	🔴 P0	`orchestration/runner.go:1081-1140`	Fixture files re-read per run (redundant I/O)
4	🟡 P1	`cmd/waza/cmd_run.go:607`	`context.Background()` — no signal cancellation, Ctrl+C broken
5	🟡 P1	`graders/inline_script_grader.go:142-160`	Temp script file created/deleted per `Grade()` call
6	🟡 P1	`graders/program_grader.go:47-55`	`.waza.yaml` loaded per grader construction
7	🟡 P1	`webapi/store.go:258-299`	Summaries recomputed per request (no caching)
8	�� P1	`webapi/store.go:47-128`	Full reload under write lock / race on first load
9	🟡 P1	`cache/cache.go:110-153`	Global mutex held during disk I/O
10	🟡 P1	`tokens/bpe/tokenizer.go:201-332`	Regex + slice churn in hot encode paths
11	🟡 P1	`tokens/bpe/tokenizer.go:605`	Decode buffer starts at zero capacity
12	✅ Fixed	`cmd/waza/dev/links.go:336-351`	~~HTTP response body not drained — breaks keep-alive~~ (Fixed: body now properly closed)
13	🟡 P1	`cmd/waza/dev/links.go:191`	`goldmark.New()` recreated per file
14	🟡 P1	`jsonrpc/handlers.go:260-281`	`h.runs` map never pruned — memory leak
15	🟢 P2	`execution/session_events_collector.go:54-118`	Shared state mutation without synchronization
16	🟢 P2	`trigger/runner.go:56-64`	Goroutines dont check `ctx.Done()` before semaphore
17	🟢 P2	`spinner/spinner.go:19-30`	`time.After` in loop leaks timers
18	🟢 P2	`checks/token_limits.go:73-79`	Double string allocation on file content
19	🟢 P2	`jsonrpc/transport.go:44-67`	Marshal + append newline per message

#	Severity	File	Issue
20	🟡 P1	`webserver/server.go:46-50`	Missing ReadTimeout, WriteTimeout, IdleTimeout
21	🟡 P1	`execution/copilot.go:266-283`	Temp workspaces retained until engine shutdown (disk/inode buildup)
22	🟡 P1	`orchestration/runner.go:174-178`	Shutdown uses possibly-cancelled context
23	🟡 P1	`tokens/bpe/tokenizer.go:475-589`	`EncodeTrimPrefix` growing progress map
24	🟢 P2	`cmd/waza/cmd_run.go:994-1000`	Full-buffer marshal for large results (use streaming)
25	🟢 P2	`cmd/waza/dev/links.go:55-63`	Default transport not tuned for batch URL checks
26	🟢 P2	`jsonrpc/handlers.go:266-267`	Detached context from request lifecycle

#	Severity	File	Issue
27	🟢 P2	`orchestration/runner.go:533-547`	Template context copies `spec.Inputs` map per CSV row
28	🟢 P2	`cache/cache.go:64-88`	Three separate `json.Marshal` calls for cache key
29	🟢 P2	`webapi/handlers.go:119`	`json.Encoder.Encode()` error silently ignored
30	🟢 P2	`execution/copilot.go:186`	`Shutdown` accepts ctx but ignores it — `RemoveAll` could hang

File	Issue	Codex Says	Opus Says	Resolution
`orchestration/runner.go:658`	O(N²) scan	🟢 P2	🔴 P0	P0 — compounds with run count
`jsonrpc/handlers.go:260`	Runs map never pruned	🔴 P0	🟢 P2	P1 — real leak, but MCP server is short-lived
`orchestration/runner.go:1081`	Fixture re-read per run	🟢 P2	🔴 P0	P0 — multiplicative with runs × tests

Uh oh!

Performance Review #23

Description

Go Performance Audit — Dual-Model Expert Review

Summary

Agreement — Both Models Found These (highest confidence)

Unique to GPT-5.3-Codex

Unique to Claude Opus 4.6

Severity Disagreements (Resolved)

🏆 Prioritized Action List

P0 — Fix Now (biggest impact)

P1 — Fix Soon

P2 — When Convenient

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions