Part of #80171 and follow-up to #80323.
The Phase 2 runtime tool fixture set exists, but it is currently an inventory/reporting lane rather than a hard default-tool parity gate.
Audit snapshot from #80323:
Why this matters:
If Codex becomes the default OpenAI runtime, a first-hour gate should prove the default tool surface is both visible and callable under Pi and Codex. Keeping every fixture as known-broken preserves evidence, but it does not yet protect maintainers from regressing default tool availability once the known drifts are fixed.
Acceptance sketch:
Part of #80171 and follow-up to #80323.
The Phase 2 runtime tool fixture set exists, but it is currently an inventory/reporting lane rather than a hard default-tool parity gate.
Audit snapshot from #80323:
qa/scenarios/runtime/tools/contains 20 runtime tool fixtures.knownBrokenmetadata because the harness surfaced real Pi/Codex/default-surface drift tracked by QA tool-defaults suite conflates Codex-native tools with OpenClaw dynamic tool parity #80319, [QA harness] mock Pi provider id and apply_patch failure fixture distort parity #80320, and [QA harness] direct message fixture targets messaging-profile tool from coding profile #80321.agenticparity pack, not the full 20-tool fixture set.Why this matters:
If Codex becomes the default OpenAI runtime, a first-hour gate should prove the default tool surface is both visible and callable under Pi and Codex. Keeping every fixture as known-broken preserves evidence, but it does not yet protect maintainers from regressing default tool availability once the known drifts are fixed.
Acceptance sketch:
knownBrokenmetadata from fixed tools and fail the report on any required-tool call/result drift.--runtime-pair pi,codex.