Fix #667: SMT translator coverage for FloatLit / IndexExpr / ArrayLit by aallan · Pull Request #670 · aallan/vera

aallan · 2026-05-13T14:55:21Z

Summary

Extends vera/smt.py::translate_expr to handle three previously-unsupported Expr subclasses in contract predicates — FloatLit, IndexExpr, ArrayLit. Pre-fix all three returned None from the SMT translator, dropping every affected predicate to Tier 3 (runtime check). Post-fix they translate to Z3 expressions and the predicates verify at Tier 1.

The #667 issue body claimed the contract grammar would reject these shapes. Reality-check showed the parser and type checker already accept them — only the SMT translator was missing the cases. No grammar / checker changes needed.

FloatLit → z3.RealVal(value) (Float64 → Z3 Real sort).
IndexExpr → uninterpreted index_<sort>(arr, i) function; sound for relational facts via function congruence.
ArrayLit → fresh Array_<elt> constant with length(lit) == N and per-element index(lit, i) == elt_i axioms.

Also introduces Array-sort infrastructure in the SMT layer so Array<T> params get a proper Array_<T> uninterpreted sort instead of falling back to declare_int.

Latent issue surfaced

Closing the gap exposed two contracts the pre-fix verifier was vacuously verifying (the untranslatable literal masked the body's semantics, so the postcondition's negation was unsatisfiable in the resulting incomplete model). Both honestly relaxed with explanatory comments:

examples/json.vera::main: ensures(@Int.result == 0) → ensures(true) — the static return value depends on the specific JSON literal parsing successfully; no helper carries a postcondition strong enough to let the verifier conclude result == 0.
tests/conformance/ch06_quantifiers.vera::main and ::test_has_zero: both ensures(@Bool.result == true) → ensures(true) — all_positive / has_zero carry ensures(true) so the static verifier can't conclude the postcondition from the specific array literals.

Tests

5 new tests in tests/test_verifier_coverage.py::TestSmtCoverage667 pinning that the new predicates verify at Tier 1 (assertion: tier1_verified >= N — Tier 3 is also error-free, so "no errors" alone would be too weak).
4 pre-existing edge-case tests swapped their FloatLit sentinel (now Handled) for UnitLit (still intentionally-unsupported).
test_overall_tier_counts updated: 252/26/278 → 253/25/278 (net +1 T1, -1 T3 — one previously-T3 example contract now verifies T1).

Spec

spec/06-contracts.md §6.3.1 "Allowed in All Contracts" now lists Float64 literals, array index expressions, array literals, and String literals (the last was pre-existing but undocumented). Each new entry cross-links to #667.

Test plan

pytest tests/ -q → 3,854 passed, 14 skipped, 14 deselected
mypy vera/ → clean
python scripts/check_walker_coverage.py → clean
python scripts/check_doc_counts.py → consistent (3,882 tests, 32 files)
python scripts/check_spec_examples.py → 74 verify OK, 1 allowlisted, 0 failed
python scripts/check_version_sync.py → consistent across 6 files
Doc sweep across SKILL/CLAUDE/AGENTS/README/vera/README/spec/docs/examples → no stale "FloatLit/IndexExpr/ArrayLit fall to Tier 3" claims
ROADMAP cleanup: Expand smt.translate_expr coverage: FloatLit, ArrayLit, IndexExpr #667 removed from Stabilisation tier (was Order 5); Agent-integration tier renumbered 5-7

Closes #667

Summary by CodeRabbit

New Features
- SMT translator now supports float literals, array index expressions and array literals in contracts, enabling typed-array reasoning and verification.
Bug Fixes
- Relaxed over-strong example/conformance postconditions to address a soundness-related issue.
Tests
- Added regression suite and updated Tier 1 tests to cover the new SMT constructs and adjusted sentinel behaviour.
Documentation
- Changelog, README, ROADMAP, TESTING and spec updated; project version bumped to v0.0.153.

The #667 issue body claimed FloatLit / IndexExpr / ArrayLit in contract predicates were blocked by the contract grammar. Reality-check against `vera check` showed the parser and type checker already accept all three — only the SMT translator was missing the cases. Pre-fix all three returned `None` from `vera/smt.py::translate_expr`, dropping every affected predicate to Tier 3 (runtime check). What ships: **1. SMT translation cases (`vera/smt.py::translate_expr`)** - `FloatLit` → `z3.RealVal(value)`. Float64 already maps to Z3's `Real` sort (sound for relational properties; not a full IEEE-754 model — real arithmetic is decidable in Z3, FP isn't). - `IndexExpr` → uninterpreted `index_<sort>(arr, i)` function call. Sound (function congruence: two references to `arr[i]` with the same `i` produce the same value) but partial — the verifier can't reason about element structure beyond what explicit predicates assert. Quantified contracts ("for all valid i, arr[i] > 0") remain Tier 2 / Tier 3 territory. - `ArrayLit` → fresh `Array_<elt>` constant with `length(lit) == N` and per-element `index(lit, i) == elt_i` axioms asserted to the solver. Element types that can't be sorted (e.g. function-typed) fail cleanly via None. **2. Array-sort infrastructure (`vera/smt.py`)** Added `_get_array_sort`, `_get_index_fn`, `declare_array_var`, and a sibling `_index_fns` dict. `Array<T>` parameters are now declared as constants of uninterpreted `Array_<T>` sorts; pre-fix they fell through to `declare_int(z3_name)` because `Array` isn't in the SMT layer's `_adt_registry`, making `Array<Int>` slots numerically-typed in Z3. **3. Verifier routing (`vera/verifier.py`)** Added `_is_array_type` (peels RefinementType, checks `AdtType("Array", ...)`) and `_declare_array_var` (resolves element type → Z3 sort, delegates to `smt.declare_array_var`). Routed `Array<T>` params and return types through the new path in `_verify_decl`. **4. WALKER_COVERAGE checklist update (`vera/smt.py`)** Moved FloatLit / IndexExpr / ArrayLit from the "Latent gap" section to "Handled" with rationale strings. Cited #427 on the still-unsupported `ForallExpr` / `ExistsExpr` entries. **5. Honest contract relaxations** Closing the gap exposed two contracts the pre-fix verifier was vacuously verifying — the untranslatable literal masked the body's semantics, so the postcondition's negation was unsatisfiable in the resulting incomplete model. Two contracts relaxed with explanatory comments: - `examples/json.vera::main`: `ensures(@Int.result == 0)` → `ensures(true)`. The static return value depends on the specific hardcoded JSON literal parsing successfully; none of `parse_current_temp` / `parse_average_temp` / `round1` carries a postcondition strong enough to let the verifier conclude `@Int.result == 0` statically. - `tests/conformance/ch06_quantifiers.vera::main` and `::test_has_zero`: both `ensures(@Bool.result == true)` → `ensures(true)`. Helpers `all_positive` / `has_zero` carry `ensures(true)`; the static verifier can't conclude the postcondition from the specific array literals. **6. New tests (`tests/test_verifier_coverage.py::TestSmtCoverage667`)** 5 tests pinning that FloatLit / IndexExpr / ArrayLit contracts verify at Tier 1, asserting `tier1_verified >= N` so a regression dropping back to Tier 3 fails (Tier 3 is also error-free). **7. Pre-existing edge-case test updates** Four tests used FloatLit as a sentinel for "untranslatable expr" — swapped to UnitLit, which is still intentionally- unsupported in the SMT translator (predicates are Bool, not Unit). `test_overall_tier_counts` updated: 252/26/278 → 253/25/278 (net +1 T1, -1 T3, total unchanged — one previously-T3 example contract using a float-comparison predicate now verifies T1). **8. Spec update (`spec/06-contracts.md`)** §6.3.1 "Allowed in All Contracts" now lists Float64 literals, array index expressions, array literals, and String literals (the last was pre-existing but undocumented). Each new entry cross-links to #667. **9. ROADMAP cleanup** Removed #667 from the Stabilisation tier (was Order 5). Agent-integration tier renumbered 5-7 (was 6-8). Release prep: - Version bumped to v0.0.153 across `__init__.py`, `pyproject.toml`, `uv.lock`, `docs/index.html`, `README.md`. - CHANGELOG.md v0.0.153 entry with Added + Fixed + Tests sections. - HISTORY.md row added (one-line template-compliant). - TESTING.md totals: 3,877 → 3,882 tests; per-file counts updated for `test_verifier.py` and `test_verifier_coverage.py`. Validation: - `pytest tests/ -q` → 3,854 passed, 14 skipped, 14 deselected - `mypy vera/` → clean - `python scripts/check_walker_coverage.py` → clean - `python scripts/check_doc_counts.py` → consistent - `python scripts/check_spec_examples.py` → 74 verify OK, 1 allowlisted, 0 failed - Doc sweep across SKILL/CLAUDE/AGENTS/README/vera/README/ spec/docs/examples → no stale "FloatLit/IndexExpr/ArrayLit fall to Tier 3" claims Closes #667 Co-Authored-By: Claude <noreply@anthropic.invalid>

coderabbitai · 2026-05-13T14:55:35Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds SMT translation for FloatLit, IndexExpr and ArrayLit with typed-array Z3 sorts; verifier declares Array SMT variables; tests, spec, docs and release metadata updated and project version bumped to v0.0.153.

Changes

Issue #667 SMT Translator and Verifier Array/Float Support

Layer / File(s)	Summary
SMT translator: Float, Index, Array translation `vera/smt.py`	`SmtContext` adds FLOAT64 import, typed-array caches, `_get_array_sort`/`_get_index_fn`/`declare_array_var`, and `translate_expr` branches to handle `FloatLit` → `z3.RealVal`, `IndexExpr` → `index_<sort>(coll,i)`, and `ArrayLit` → fresh typed array constant with `length`/`index` axioms; call/return handling special-cases `Array<T>` results.
Verifier: Array recognition and declaration `vera/verifier.py`	Verifier imports `z3`, recognises `Array<T>` via `_is_array_type`, adds `_declare_array_var` to map element types to Z3 sorts and delegate to `smt.declare_array_var`, and special-cases parameter/return Z3 variable declarations for `Array<T>` with fallback to integer when mapping fails.
Tests: Tier‑1 coverage and sentinel updates `tests/test_verifier_coverage.py`, `tests/test_verifier.py`	Unsupported-expression tests now use `UnitLit`. New `TestSmtCoverage667` asserts Tier‑1 verification for `FloatLit`, `IndexExpr`, and `ArrayLit` (including element access) and adds negative E500 regressions; related binary/unary/if edge tests and overall tier-count assertions updated.
Specification, scripts, docs and release notes `spec/06-contracts.md`, `scripts/check_spec_examples.py`, `CHANGELOG.md`, `HISTORY.md`, `README.md`, `ROADMAP.md`, `TESTING.md`	Spec §6 documents Float64 literals, index expressions and array literals with uninterpreted encoding; spec allowlist line updated; changelog and docs updated and Unreleased compare link advanced to v0.0.153.
Version bump and packaging metadata `pyproject.toml`, `vera/__init__.py`	Project and package version updated to `0.0.153`.

Sequence Diagram

sequenceDiagram
  participant ContractVerifier
  participant SmtContext
  participant Z3 as Z3_Solver

  ContractVerifier->>SmtContext: request translation (FloatLit / IndexExpr / ArrayLit)
  SmtContext->>Z3: produce RealVal / declare Array_<elt> / call index_<sort>(arr,i)
  SmtContext->>Z3: assert length/index axioms (for ArrayLit)
  Z3-->>SmtContext: z3.ExprRef or confirmation
  SmtContext-->>ContractVerifier: translated expression or None

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Suggested labels

compiler, tests, spec, ci, docs

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title directly and clearly summarizes the main change: extending SMT translator coverage for three previously unsupported expression types (FloatLit, IndexExpr, ArrayLit) that are central to issue `#667`.
Linked Issues check	✅ Passed	The PR fully addresses issue `#667`'s three core objectives: FloatLit translates to z3.RealVal, IndexExpr to uninterpreted index_ functions, and ArrayLit to typed array constants with axioms. Array-sort infrastructure and SMT coverage are complete.
Out of Scope Changes check	✅ Passed	All changes directly support the three translation cases (`#667`): version bumps, documentation updates (CHANGELOG, HISTORY, README, ROADMAP, TESTING, SPEC), test infrastructure (including 12 new tests in TestSmtCoverage667), and SMT/verifier implementation files. No extraneous changes detected.
Docstring Coverage	✅ Passed	Docstring coverage is 97.44% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch fix-667-smt-coverage

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

codecov · 2026-05-13T14:56:23Z

Codecov Report

❌ Patch coverage is 82.05128% with 21 lines in your changes missing coverage. Please review.
✅ Project coverage is 91.02%. Comparing base (8c79f22) to head (2dda05a).

Files with missing lines	Patch %	Lines
vera/smt.py	78.88%	19 Missing ⚠️
vera/verifier.py	92.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #670      +/-   ##
==========================================
- Coverage   91.08%   91.02%   -0.07%     
==========================================
  Files          60       60              
  Lines       23283    23398     +115     
  Branches      259      259              
==========================================
+ Hits        21207    21297      +90     
- Misses       2069     2094      +25     
  Partials        7        7

Flag	Coverage Δ
javascript	`57.36% <ø> (ø)`
python	`94.77% <82.05%> (-0.10%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Sweep flagged that `test_overall_tier_counts`'s first-line summary still said "252 T1 / 26 T3 / 278 total (current)" while the assertion below already enforces 253/25/278. The trajectory log in the docstring is correct; only the header line drifted. One-line fix. Not a behaviour change — only a comment. Refs #667 #670 Co-Authored-By: Claude <noreply@anthropic.invalid>

coderabbitai

Actionable comments posted: 3

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@spec/06-contracts.md`:
- Around line 137-144: The spec currently lists array index expressions
(`@Array<T>.0[i]`, "Array index expressions") as Tier 1 in the feature list but
still marks array element access as Tier 2 in Section 6.3.2; reconcile by making
a single authoritative statement: update Section 6.3.2 to reflect that array
indexing is Tier 1 (or else change the feature list to Tier 2) so the tiers
match the compiler behavior, and ensure the wording references the same symbol
(`index_<T>(arr, i)` / `@Array<T>.0[i]`) and notes the same semantics
(uninterpreted `index_<T>` with soundness constraints) to keep both places
consistent.

In `@vera/smt.py`:
- Around line 665-696: The reverse lookup in _get_element_sort_for_array fails
for ADT-backed arrays because array sorts like "Array_List_Int" don't map to ADT
keys stored under _adt_sort_key (e.g. "List<Int>"); fix by maintaining an
explicit mapping (e.g. self._array_element_sorts) from the array sort
object/string to its element SortRef whenever you create or register an
array/index function (where _index_fns is populated), and then have
_get_element_sort_for_array first consult that mapping (falling back to the
existing name-pattern checks and _z3_sorts lookup). Update the code paths that
construct array sorts or register index functions to populate
self._array_element_sorts[array_sort] = element_sort so ADT element sorts are
recovered correctly.

In `@vera/verifier.py`:
- Around line 466-474: The call-site allocates fresh call-result variables as
Int regardless of return type, so callers cannot reason about helper functions
returning Array<T>/String/Float64; update SmtContext._translate_call_with_info()
to mirror the verifier's typed-declaration logic used for parameters: when
computing the fresh result var (instead of always smt.declare_int), check using
_is_array_type, _is_string_type, _is_float64_type, ADT/Bool/Nat branches and
call the same helpers (e.g. _declare_array_var or the corresponding typed
declare routines) to create a properly typed SMT variable for the call result;
apply the same adjustment to the other similar block referenced (the other
branch around the noted lines) so return variables for helper calls receive
typed uninterpreted sorts rather than Int.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: 1e673230-989a-41ca-b200-cd43f6305fbf

📥 Commits

Reviewing files that changed from the base of the PR and between 8c79f22 and 657f5d1.

⛔ Files ignored due to path filters (7)

docs/index.html is excluded by !docs/**
docs/index.md is excluded by !docs/**
docs/llms-full.txt is excluded by !docs/**
docs/llms.txt is excluded by !docs/**
examples/json.vera is excluded by !**/*.vera
tests/conformance/ch06_quantifiers.vera is excluded by !**/*.vera
uv.lock is excluded by !**/*.lock, !uv.lock

📒 Files selected for processing (13)

CHANGELOG.md
HISTORY.md
README.md
ROADMAP.md
TESTING.md
pyproject.toml
scripts/check_spec_examples.py
spec/06-contracts.md
tests/test_verifier.py
tests/test_verifier_coverage.py
vera/__init__.py
vera/smt.py
vera/verifier.py

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@vera/smt.py`:
- Around line 665-697: _get_element_sort_for_array currently extracts elt_key
from the Z3 sort string and only calls self._z3_sorts.get(elt_key), which misses
canonical keys like "Array<MdBlock>" and never consults the _index_fns registry
mentioned in the docstring; update _get_element_sort_for_array to: (1) after the
simple name checks, try self._z3_sorts.get(elt_key) as now, (2) if that returns
None, try alternate canonical keys such as f"Array<{elt_key}>" (and any other
canonical formatting your _z3_sorts uses), and (3) if still None, consult
self._index_fns for the array_sort (or its string) to recover the registered
index function and derive the element sort from that registry entry so
_translate_index_expr can recover ADT element sorts instead of returning None.
Ensure you reference _get_element_sort_for_array, _z3_sorts, _index_fns and
_translate_index_expr when making the change.

In `@vera/verifier.py`:
- Around line 466-475: The modular-call return path in
SmtContext._translate_call_with_info currently maps Array<T> results to the old
return-var branch (Nat/Bool/ADT/Int) causing arrays to be declared as Int;
update _translate_call_with_info to detect array return types and route them
through the same array-variable logic used in the function-boundary path: call
self._declare_array_var(smt, z3_name, ret_ty) (or fall back appropriately)
instead of smt.declare_int(z3_name), and ensure the resulting symbol is used
wherever the return value is assembled so expressions like Array<T>[i] remain
typed; mirror the handling used around _is_array_type and _declare_array_var in
the verifier branch to keep ADT/Int/Bool/Nat branches unchanged.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: 55e7b406-9bbe-456d-bce2-8936427f61d0

📥 Commits

Reviewing files that changed from the base of the PR and between 8c79f22 and 6d64178.

⛔ Files ignored due to path filters (7)

docs/index.html is excluded by !docs/**
docs/index.md is excluded by !docs/**
docs/llms-full.txt is excluded by !docs/**
docs/llms.txt is excluded by !docs/**
examples/json.vera is excluded by !**/*.vera
tests/conformance/ch06_quantifiers.vera is excluded by !**/*.vera
uv.lock is excluded by !**/*.lock, !uv.lock

📒 Files selected for processing (13)

CHANGELOG.md
HISTORY.md
README.md
ROADMAP.md
TESTING.md
pyproject.toml
scripts/check_spec_examples.py
spec/06-contracts.md
tests/test_verifier.py
tests/test_verifier_coverage.py
vera/__init__.py
vera/smt.py
vera/verifier.py

The PR review surfaced three issues that #670's initial commits missed: **CR-X1** (`spec/06-contracts.md:157`) — §6.3.2 still listed "Array element access" as Tier 2, contradicting the §6.3.1 addition in `657f5d1`. Removed the bullet and added a clarifier paragraph noting that array element access + array literals are Tier 1 (with the uninterpreted-function encoding from §6.3.1), not Tier 2; Tier 2 (#427) is reserved for predicates that need user-provided lemmas. **CR-X2/X4** (`vera/smt.py::_get_element_sort_for_array`) — the reverse-lookup parsed the Z3 sort name string and looked up in `_z3_sorts` under the unqualified element key. Worked for nullary ADTs by coincidence, but generic ADTs like `List<Int>` have canonical key `"List<Int>"` (from `_adt_sort_key`) while the Z3 sort name is `"List_Int"` (post `<`/`>` strip in `_get_or_create_adt_sort`). The round-trip failed, returning None and dropping the predicate to Tier 3. Fix: added an explicit `_array_element_sorts: dict[str, z3.SortRef]` dict on `SmtContext`, populated whenever `_get_array_sort` creates a new `Array_<T>` sort. `_get_element_sort_for_array` now does a 3-tier lookup: direct map → primitive pattern match → `_z3_sorts` fallback with canonical-key variants. Storing the association at creation time sidesteps the fragile string round-trip. **CR-X3/X5** (`vera/smt.py::_translate_call_with_info`) — the call-result fresh-var declaration handled only NAT / BOOL / AdtType, falling back to `declare_int` for String / Float64 / Array<T>. Callers couldn't reason about helper return values of those types in postconditions because the result var had the wrong Z3 sort. Fix: added explicit branches for STRING, FLOAT64, and `AdtType ("Array", ...)` mirroring the parameter-declaration dispatch in `vera/verifier.py::_verify_decl`. String returns now get `declare_string`, Float64 returns get `declare_float64`, Array returns get `declare_array_var` (with element-sort resolution via `_vera_type_to_z3_sort`). Imported `FLOAT64` and `STRING` constants from `vera.types`. Added 4 new tests in `tests/test_verifier_coverage.py::TestSmtCoverage667`: - `test_adt_element_array_indexing_verifies_tier1` — pins the ADT-element lookup via the direct map (CR-X2/X4). - `test_string_return_type_typed_at_call_site` — pins String return type propagation through `_translate_call_with_info` (CR-X3/X5). - `test_float64_return_type_typed_at_call_site` — same for Float64. - `test_array_return_type_typed_at_call_site` — same for Array<T>, with `result[i]` in the callee postcondition verifying. Doc counts auto-updated: test count 3,882 → 3,886 (+4), test_verifier_coverage.py 84/1,382 → 88/1,483 (tests/lines). Validation: - `pytest tests/ -q` → 3,858 passed, 14 skipped, 14 deselected - `mypy vera/` → clean - `python scripts/check_walker_coverage.py` → clean - `python scripts/check_doc_counts.py` → consistent - All 9 `TestSmtCoverage667` tests pass (5 originals + 4 new) Refs #667 #670 Co-Authored-By: Claude <noreply@anthropic.invalid>

aallan · 2026-05-13T15:35:14Z

pr-review-toolkit audit outcome

Ran 4 specialised review agents on commit 66a9939. One critical finding corrects a factual error in this PR's own narrative; one important docstring drift flagged; one important test count in CHANGELOG off-by-four; plus suggestions and defensive-hardening recommendations.

Critical (1 — factual narrative error)

The "vacuously verified at Tier 1" framing in this PR is wrong. Comment-analyzer verified empirically against pre-fix main: when the body translation returned None (because of FloatLit / ArrayLit blackouts), vera/verifier.py:564 dropped the contract to Tier 3 with an E522 warning ("Cannot statically verify postcondition…"), not Tier 1. There was no soundness gap — the verifier was already honest about its inability to translate the body. The relaxation to ensures(true) moves the contract from T3 (with warning) to T1 (trivially verified), which explains the 252/26 → 253/25 shift via the relaxation alone, not via the SMT widening.

Files carrying the wrong framing:

examples/json.vera::main comment
tests/conformance/ch06_quantifiers.vera::main/::test_has_zero comment
CHANGELOG.md v0.0.153 "Fixed" section
tests/test_verifier.py::test_overall_tier_counts docstring
The issue comment I posted on Expand smt.translate_expr coverage: FloatLit, ArrayLit, IndexExpr #667
This PR's body

Important (3 — to fix)

CHANGELOG.md says "5 new tests in TestSmtCoverage667" — actually 9 (5 from initial commit + 4 from the call-result-typing follow-up).
_translate_array_lit docstring claims mismatched element sorts "fail the translation" (return None). Actually they raise z3.Z3Exception: sort mismatch uncaught. Either fix the docstring or add a defensive if elt.sort() != element_sort: return None check. Going with the defensive check — cheaper than fixing the docstring while leaving the actual behavior fragile.
No negative tests for deliberately-false contracts on the new constructs. A regression that reverts FloatLit/IndexExpr/ArrayLit to return None would silently pass — exactly the bug class this PR is meant to close. Adding 2-3 negative tests.

Important (defensive, kept as-is with rationale)

or self.declare_int(...) fallback pattern in vera/verifier.py (2 sites) — silent-failure-hunter flagged as "latent under current surface grammar but mirrors the just-fixed bug." Today unreachable; keeping as-is for now with a follow-up issue to either harden or remove once the surface grammar relaxes. Adding # pragma: no cover clarifies the intent.

Suggestions (skipped with brief reasons)

import z3 ordering in verifier.py — PEP 8 nit; consistent with project's lighter-touch import style.
smt._vera_type_to_z3_sort encapsulation break — defensible; _v_t_to_z3_sort is module-internal helper, and the verifier is in the same layer. Promoting it would require renaming downstream.
_get_element_sort_for_array tier 3 documented as "three candidates" but code has 2 — fixed inline; minor wording.
Replace sort_name.startswith("Array_") with explicit set — agent suggested for defence; current check is correct given _get_array_sort's exclusive ownership of Array_* keys.

Strengths confirmed

Test infrastructure correctly distinguishes "no error" from "Tier 1 verified" via tier1_verified >= N assertions.
WALKER_COVERAGE checklist accurately moves FloatLit/IndexExpr/ArrayLit to "Handled" with no false-positive claims (the kind of audit failure Walker-completeness audit: every isinstance(expr, ast.X) dispatch chain enumerates all Expr subclasses #597's script was built to catch).
_is_array_type / _declare_array_var docstrings match implementation.
_get_array_sort's claim that _array_element_sorts is populated at sort-creation time is correct — the only mutation site is _get_array_sort itself.
Spec §6.3.1 entries accurately describe the implementation.
Contract relaxations are conservative (strictly weaker postconditions); the issue is solely the rationale comment text, not the relaxations themselves.

Recommended action plan

Fix the narrative error across 5 files + the issue comment + this PR body.
Bump CHANGELOG test count 5 → 9.
Add defensive sort-mismatch check in _translate_array_lit.
Add 2-3 negative tests for deliberately-false contracts.

Working through these now; will follow up with the fix commit.

🤖 Generated with Claude Code

coderabbitai

Actionable comments posted: 2

♻️ Duplicate comments (1)

spec/06-contracts.md (1)
158-158: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Align the Tier summary table with this Tier 1 array clarification.

Line [158] states array index/literal predicates are Tier 1, but §6.8 still lists arrays under Tier 2 scope. Please make the table consistent with this section to avoid contradictory spec behaviour.

As per coding guidelines, "spec/**/*.md: Language specification. Review for factual accuracy and broken code examples."
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@spec/06-contracts.md` at line 158, The Tier summary table is inconsistent
with §6.3.1: array element access (`@Array<T>.0[i]`) and array literals (`[a, b,
c]`) are described as Tier 1 in the prose but still appear under Tier 2 in the
summary table (and §6.8); update the Tier summary table (and any Tier listing in
§6.8) to classify arrays and array literals/access as Tier 1 to match §6.3.1, or
alternatively adjust §6.3.1 wording to match the table—ensure the table entry
and the prose about arrays (references: §6.3.1, §6.8, and the array examples)
are made consistent.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@tests/test_verifier_coverage.py`:
- Around line 1236-1307: The tests test_string_return_type_typed_at_call_site,
test_float64_return_type_typed_at_call_site, and
test_array_return_type_typed_at_call_site currently only assert there are no
errors; strengthen each by also asserting there are no warnings (so Tier 3
fallback can't pass silently). After computing result = _verify(...), filter
result.diagnostics for severity == "warning" (like how errors are filtered) and
assert that list is empty (or alternatively assert the diagnostics contain
expected Tier 1 predicate counts), using the same local variable names (result,
errors) to locate and modify each test.

In `@vera/smt.py`:
- Around line 1079-1089: The code currently coerces unresolved Array<T> return
types to an Int by calling declare_int when element_sort is None; change this
behavior to return None instead of creating an int so callers won't get the
wrong SMT sort. In the branch that checks isinstance(base_ret, AdtType) and
base_ret.name == "Array", remove or replace the declare_int(fresh) fallback so
that ret_var becomes None when element_sort is None, and ensure any code using
ret_var (call sites of this method) already handles a None (Tier 3) result
safely. Keep references to element_sort, declare_array_var, declare_int and the
Array branch to locate the change.

---

Duplicate comments:
In `@spec/06-contracts.md`:
- Line 158: The Tier summary table is inconsistent with §6.3.1: array element
access (`@Array<T>.0[i]`) and array literals (`[a, b, c]`) are described as Tier
1 in the prose but still appear under Tier 2 in the summary table (and §6.8);
update the Tier summary table (and any Tier listing in §6.8) to classify arrays
and array literals/access as Tier 1 to match §6.3.1, or alternatively adjust
§6.3.1 wording to match the table—ensure the table entry and the prose about
arrays (references: §6.3.1, §6.8, and the array examples) are made consistent.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: 38aef081-e2eb-4648-85da-e3a801e51650

📥 Commits

Reviewing files that changed from the base of the PR and between 6d64178 and 66a9939.

📒 Files selected for processing (7)

README.md
ROADMAP.md
TESTING.md
scripts/check_spec_examples.py
spec/06-contracts.md
tests/test_verifier_coverage.py
vera/smt.py

The pr-review-toolkit audit on `66a9939` surfaced one critical factual narrative error and three important findings. This commit addresses all four; suggestions and defensive-hardening items kept as-is with rationale documented on the PR comment. **CRITICAL — narrative correction across 5 surfaces** Comment-analyzer empirically verified that the pre-#667 verifier did NOT vacuously verify the affected contracts at Tier 1. When `body_expr` translated to None (because of FloatLit / ArrayLit blackouts), `vera/verifier.py:564` dropped the contract to **Tier 3 with an E522 warning** ("Cannot statically verify postcondition…"), not Tier 1. There was no soundness gap — the verifier was already honest about its inability to translate the body. My narrative across 5 files was wrong: - `examples/json.vera::main` comment - `tests/conformance/ch06_quantifiers.vera::main` / `::test_has_zero` comment - `CHANGELOG.md` v0.0.153 "Fixed" section - `tests/test_verifier.py::test_overall_tier_counts` docstring - The issue comment on #667 (will post a correcting comment) Corrected framing across all 5: the pre-fix contracts were classified T3-with-warning; post-fix the verifier reaches the contradiction and would report E500 instead; the contracts were honestly relaxed to `ensures(true)` which trivially verifies T1. The 252/26 → 253/25 shift comes entirely from the relaxation of `examples/json.vera::main` (T3-warning → T1-trivial), not from any SMT-widening-driven movement. **IMPORTANT 1 — CHANGELOG test count drift** CHANGELOG said "5 new tests" in `TestSmtCoverage667`; actually 9 (5 from initial commit + 4 from call-result-typing follow-up). Updated to "9 new tests in two clusters" with the cluster breakdown. **IMPORTANT 2 — `_translate_array_lit` defensive sort-mismatch check** Comment-analyzer flagged that the docstring claims mismatched sorts "fail the translation" (return None), but the code has no sort guard — Z3 raises `Z3Exception: sort mismatch` on the `index_fn(lit, i) == elt` axiom. Type checker rejects heterogeneous arrays upstream so this is latent, but I'm going with the defensive fix rather than the docstring-only fix: added `if any(e.sort() != element_sort for e in element_z3s[1:]): return None` before the axiom assertions. Cheaper than fixing the docstring while leaving the actual behaviour fragile. **IMPORTANT 3 — `_get_element_sort_for_array` docstring** Docstring described "three candidates" in tier 3 but the code constructs two. Rewrote the docstring to match: two candidates (raw stripped key + angle-bracketed generic form). **Three new negative / boundary tests** Pr-test-analyzer flagged that all 9 existing #667 tests are positive cases — a regression that reverts FloatLit / IndexExpr / ArrayLit to `return None` would silently pass. Added 3 new tests in `TestSmtCoverage667`: - `test_floatlit_deliberately_false_contract_reports_e500` — `ensures(@Float64.result == 0.0)` over a body returning 1.5 must report E500, not Tier 3 with warning. - `test_arraylit_element_axiom_deliberately_false_reports_e500` — `ensures([1, 2, 3][0] == 999)` must report E500 (pins the per-element axiom is actually asserted to the solver, not just declared). - `test_indexexpr_congruence_in_contract_verifies_tier1` — `@arr.0[i] == @arr.0[i]` must verify trivially via Z3 function congruence (pins the `index_<sort>` is keyed by sort, not per call-site, so two references to the same arr[i] are recognised as identical). **Skipped suggestions (with rationale on PR)** - `import z3` ordering nit in `verifier.py` (PEP 8 nit; project style is lighter-touch). - `smt._vera_type_to_z3_sort` encapsulation suggestion (the verifier is in the same architectural layer; promoting the helper is renaming churn for marginal value). - `or self.declare_int(...)` fallback hardening (silent-failure- hunter flagged as "latent under current surface grammar"; current is defensible until grammar relaxes — would file as a future-hardening issue if seen again). - Replace `startswith("Array_")` with explicit sort set (current check is correct given `_get_array_sort`'s exclusive ownership of `Array_*` keys). Doc counts auto-updated: test count 3,886 → 3,889 (+3 negative tests), test_verifier_coverage.py 88/1,483 → 91/1,557. Validation: - `pytest tests/ -q` → 3,861 passed, 14 skipped, 14 deselected - `mypy vera/` → clean - `python scripts/check_doc_counts.py` → consistent - 12/12 TestSmtCoverage667 tests pass (9 originals + 3 new) Refs #667 #670 Co-Authored-By: Claude <noreply@anthropic.invalid>

aallan · 2026-05-13T15:43:54Z

pr-review fixes landed in `ee81c04`

Following up on the audit summary — all four critical-and-important findings addressed:

Critical — narrative correction

Corrected the "vacuously verified at Tier 1" framing across 5 surfaces. The empirically-correct story:

Pre-Expand smt.translate_expr coverage: FloatLit, ArrayLit, IndexExpr #667: body translation failed (FloatLit / ArrayLit returned None) → vera/verifier.py:564 dropped the contract to Tier 3 with an E522 warning, not Tier 1. The verifier was already honest about its inability to translate the body. No soundness gap.
Post-Expand smt.translate_expr coverage: FloatLit, ArrayLit, IndexExpr #667: body translates fully; verifier reaches the contradiction; would report E500. Contract honestly relaxed to ensures(true) to match what's statically provable from the helpers' ensures(true) clauses. The 252/26 → 253/25 tier-count shift comes entirely from the relaxation (T3-warning → T1-trivial), not from SMT-widening movement.

Files corrected: examples/json.vera, tests/conformance/ch06_quantifiers.vera, CHANGELOG.md v0.0.153, tests/test_verifier.py::test_overall_tier_counts docstring. The original issue comment on #667 will be corrected via a follow-up reply on the issue.

Important — three additional fixes

CHANGELOG test count drift: "5 new tests" → "9 new tests in two clusters" with cluster breakdown.
_translate_array_lit defensive sort-mismatch check: added if any(e.sort() != element_sort for e in element_z3s[1:]): return None before the per-element axiom assertions. The docstring claimed mismatched sorts return None; without the check, Z3 raises an uncaught Z3Exception. Type checker rejects heterogeneous arrays upstream (latent under current grammar), but the defensive guard is correct-by-construction.
_get_element_sort_for_array docstring: said "three candidates" in tier 3 but code has two. Rewrote to match.

Negative tests added

TestSmtCoverage667 gained 3 new negative / boundary tests:

Test	What it pins
`test_floatlit_deliberately_false_contract_reports_e500`	`ensures(@Float64.result == 0.0)` over body returning 1.5 reports E500 — regression that reverts FloatLit→None would silently pass without this
`test_arraylit_element_axiom_deliberately_false_reports_e500`	`ensures([1, 2, 3][0] == 999)` reports E500 — pins that per-element axioms are actually asserted to the solver
`test_indexexpr_congruence_in_contract_verifies_tier1`	`@arr.0[i] == @arr.0[i]` verifies trivially — pins that `index_<sort>` is keyed by sort, not per call-site

Skipped suggestions (with rationale)

import z3 ordering in verifier.py — PEP 8 nit; project style is lighter-touch.
smt._vera_type_to_z3_sort encapsulation — verifier is in the same architectural layer; promoting the helper is renaming churn for marginal value.
or self.declare_int(...) fallback hardening — silent-failure-hunter flagged as "latent under current surface grammar"; defensible until grammar relaxes. If the surface admits Array<FnType<...>> or similar in the future, follow-up issue to harden the fallback into an explicit InternalError assertion.
Replace startswith("Array_") with explicit sort set — current check is correct given _get_array_sort's exclusive ownership of Array_* keys.

Validation

pytest tests/ -q → 3,861 passed, 14 skipped, 14 deselected (+3 from negative tests)
mypy vera/ → clean
12/12 TestSmtCoverage667 tests pass (9 originals + 3 new)
python scripts/check_doc_counts.py → consistent (3,889 tests)

🤖 Generated with Claude Code

Three new + two duplicates. Skipping the duplicates with rationale; fixing all three new findings + the test-shape issue the warning-strict assertion surfaced. **CR-Y3** (`tests/test_verifier_coverage.py:1236-1307`) — the call-site typing tests asserted no errors but not no warnings. Tier 3 fallback emits a warning (E522 / E523), not an error, so the test could pass even if the call-result typing wasn't being respected. Adding `warnings == []` assertions immediately surfaced a second issue: the original tests used `(())` (UnitLit) as the call argument. UnitLit returns None from `translate_expr`, so the whole call's argument-translation short-circuits at `_translate_call_with_info:1037` (`if z3_arg is None: return None`), the call returns None, the body returns None, and the verifier falls to Tier 3 with E522. The call-result-typing code path was never exercised. Restructured the 3 tests with `@Int.0` slot-ref args (which translate cleanly to Z3 Int vars). Now the calls actually succeed, the call-result typing is actually exercised, and the warning-strict assertion catches any future regression that drops back to Tier 3. **CR-Y4** (`vera/smt.py::_translate_call_with_info` Array branch) — the fallback `ret_var = self.declare_int(fresh)` when `element_sort is None` is the same silent-failure pattern silent-failure-hunter flagged earlier in this PR. Replaced with `return None`, signalling Tier 3 cleanly to the caller. The caller's postcondition translation then sees None, classifies as Tier 3, emits an E522 warning — visible failure mode rather than silent type-erasure to Int. **CR-Y5** (`spec/06-contracts.md:342`) — the Tier summary table at §6.8 still listed "arrays" as a Tier 2 concern, contradicting §6.3.1's Tier 1 classification. Updated both rows of the table: - Tier 1 scope expanded to mention real/float (Z3 Real sort), strings, array literals + indexing via uninterpreted `index_<T>` functions (citing #667). - Tier 2 scope narrowed to "quantifiers, lemma/assert hints" (the actual remaining Tier 2 surface, removing the now- Tier-1 arrays). **Skipped duplicates**: - `_get_element_sort_for_array` (`vera/smt.py:665-697`) — same finding as earlier CR; addressed in `66a9939` via `_array_element_sorts` direct map + 3-tier lookup. - `_translate_call_with_info` Array branch routing (`vera/verifier.py:466-475`) — same finding as earlier CR; addressed in `66a9939`. (Note: CR pointed at verifier.py but the actual code path is in smt.py.) Doc counts auto-updated: test_verifier_coverage.py 1,557 → 1,590 lines (3 tests restructured with longer fixtures + warning-strict assertions). Validation: - `pytest tests/ -q` → 3,861 passed, 14 skipped, 14 deselected - `mypy vera/` → clean - 12/12 `TestSmtCoverage667` tests pass with warning-strict assertions Refs #667 #670 Co-Authored-By: Claude <noreply@anthropic.invalid>

coderabbitai

♻️ Duplicate comments (2)

vera/smt.py (1)

1091-1101: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Avoid unsound Int fallback for unresolved Array<T> call results.

At Line 1101, coercing an unresolved Array<T> return to Int can make caller-side predicates type-check against the wrong SMT sort and produce misleading proofs. Return None here so the path cleanly falls back to Tier 3 instead.

Suggested fix

         elif isinstance(base_ret, AdtType) and base_ret.name == "Array":
             # Array<T> return type — declare with a proper Array
             # sort so `result[i]` predicates on the call site can
             # reason about the result via `index_<T>`.
             element_sort: z3.SortRef | None = None
             if base_ret.type_args:
                 element_sort = self._vera_type_to_z3_sort(base_ret.type_args[0])
-            if element_sort is not None:
-                ret_var = self.declare_array_var(fresh, element_sort)
-            else:
-                ret_var = self.declare_int(fresh)
+            if element_sort is None:
+                return None
+            ret_var = self.declare_array_var(fresh, element_sort)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@vera/smt.py` around lines 1091 - 1101, The code falls back to declaring an
Int for unresolved Array<T> return types (in the branch handling
isinstance(base_ret, AdtType) and base_ret.name == "Array"), which is unsound;
change the else branch so it does not call declare_int but instead leaves the
return as None (e.g., set ret_var = None or return None) so the caller-side
logic will drop to Tier 3; update any local usage of ret_var in this scope to
handle None accordingly. Use the symbols base_ret, AdtType,
_vera_type_to_z3_sort, declare_array_var, declare_int, and ret_var to locate the
change and ensure callers that expect a possible None are preserved.

tests/test_verifier_coverage.py (1)

1236-1261: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Strengthen call-site typing regressions to fail on Tier 3 fallback.

These three tests only assert “no errors”, so they can still pass when verification downgrades to Tier 3 warnings. Please also assert no warnings (or explicit Tier 1 counts) so the typed-call-site guarantee is actually pinned.

Suggested hardening

         errors = [d for d in result.diagnostics if d.severity == "error"]
         assert errors == [], f"Expected no errors, got: {errors}"
+        warnings = [d for d in result.diagnostics if d.severity == "warning"]
+        assert warnings == [], f"Expected no warnings, got: {warnings}"

As per coding guidelines, "tests/**/*.py: DO flag tests that don't assert anything meaningful."

Also applies to: 1262-1283, 1358-1381

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/test_verifier_coverage.py` around lines 1236 - 1261, The test
test_string_return_type_typed_at_call_site currently only asserts no errors via
result.diagnostics and can hide Tier 3 fallback warnings; update the test to
also assert that there are no warnings (e.g., build a warnings list from
result.diagnostics where d.severity == "warning" and assert it is empty) or
explicitly assert the expected Tier 1 count for the call-site typing (inspect
diagnostics for the tier field or expected predicate diagnostic and assert its
tier == 1). Apply the same change pattern to the other tests noted (the blocks
around lines 1262-1283 and 1358-1381) so each test checks both errors and
warnings (or explicit Tier 1 counts) using the same result.diagnostics / errors
variable logic.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Duplicate comments:
In `@tests/test_verifier_coverage.py`:
- Around line 1236-1261: The test test_string_return_type_typed_at_call_site
currently only asserts no errors via result.diagnostics and can hide Tier 3
fallback warnings; update the test to also assert that there are no warnings
(e.g., build a warnings list from result.diagnostics where d.severity ==
"warning" and assert it is empty) or explicitly assert the expected Tier 1 count
for the call-site typing (inspect diagnostics for the tier field or expected
predicate diagnostic and assert its tier == 1). Apply the same change pattern to
the other tests noted (the blocks around lines 1262-1283 and 1358-1381) so each
test checks both errors and warnings (or explicit Tier 1 counts) using the same
result.diagnostics / errors variable logic.

In `@vera/smt.py`:
- Around line 1091-1101: The code falls back to declaring an Int for unresolved
Array<T> return types (in the branch handling isinstance(base_ret, AdtType) and
base_ret.name == "Array"), which is unsound; change the else branch so it does
not call declare_int but instead leaves the return as None (e.g., set ret_var =
None or return None) so the caller-side logic will drop to Tier 3; update any
local usage of ret_var in this scope to handle None accordingly. Use the symbols
base_ret, AdtType, _vera_type_to_z3_sort, declare_array_var, declare_int, and
ret_var to locate the change and ensure callers that expect a possible None are
preserved.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: 914b4c01-ec1b-4f4d-bfbe-5f686d84fafc

📥 Commits

Reviewing files that changed from the base of the PR and between 66a9939 and ee81c04.

⛔ Files ignored due to path filters (2)

examples/json.vera is excluded by !**/*.vera
tests/conformance/ch06_quantifiers.vera is excluded by !**/*.vera

📒 Files selected for processing (7)

CHANGELOG.md
README.md
ROADMAP.md
TESTING.md
tests/test_verifier.py
tests/test_verifier_coverage.py
vera/smt.py

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@spec/06-contracts.md`:
- Line 341: The table row text currently reads "Decidable fragment: QF_LIA,
real/float (Z3 `Real` sort), strings..." which is inconsistent because QF_LIA
excludes reals; update the logic label to a correct SMT logic that includes
linear reals (replace "QF_LIA" with "QF_LRA" if only linear reals are supported,
or "QF_LIRA" / "QF_LIA+LRA (mixed int/real)" if mixed int+real is supported) by
editing the row that contains the exact string "Decidable fragment: QF_LIA,
real/float (Z3 `Real` sort), strings (Z3 `String` sort), bool, length, array
literals and indexing via uninterpreted `index_<T>` functions" and make the
chosen replacement so the logic label matches the stated real/float support.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: 0cc98c31-7031-4cca-9a75-5e0eb86e5b03

📥 Commits

Reviewing files that changed from the base of the PR and between ee81c04 and 676cc6d.

📒 Files selected for processing (4)

TESTING.md
spec/06-contracts.md
tests/test_verifier_coverage.py
vera/smt.py

Spec §6.8 Tier 1 row claimed "QF_LIA, real/float (Z3 `Real` sort)" — internally inconsistent. QF_LIA is Quantifier-Free **Linear Integer** Arithmetic by SMT-LIB definition; reals are in QF_LRA / QF_LIRA. Vera's verifier uses `z3.IntSort()` AND `z3.RealSort()` (for Float64), so the actual logic is QF_LIRA (mixed integer + real) plus strings + uninterpreted functions / sorts. Rewrote the row to "Z3 decidable fragment: linear integer + real arithmetic (QF_LIRA), bool, strings (Z3 `String` sort), uninterpreted sorts/functions (length, array literals and indexing via `index_<T>` functions — #667)". Drops the contradiction; the QF_LIRA label is the right SMT-LIB name for mixed integer+real LRA. Validation - `python scripts/check_spec_examples.py` -> clean - `grep -n "QF_LIA"` -> 0 matches (no other stale references) Refs #667 #670 Co-Authored-By: Claude <noreply@anthropic.invalid>

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@spec/06-contracts.md`:
- Line 341: The SMT logic label "QF_LIRA" in the table row that mentions strings
and uninterpreted sorts/functions is incorrect; update the label to accurately
reflect the supported theories (for example: replace "QF_LIRA" with a broader
description such as "quantifier-free mixed arithmetic, strings, and
uninterpreted sorts/functions (Z3)" or use a correct combined logic like
"QF_UFLIRA" or another appropriate SMT-LIB name); ensure the change appears in
the same table row that currently lists "Z3 decidable fragment: linear integer +
real arithmetic (QF_LIRA), bool, strings (Z3 `String` sort), uninterpreted
sorts/functions" so the logic label and the listed theories are consistent.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: ebadcb35-aa60-4baa-92b5-67e052329310

📥 Commits

Reviewing files that changed from the base of the PR and between 676cc6d and 127cdf8.

📒 Files selected for processing (1)

spec/06-contracts.md

The QF_LIRA label from `127cdf8` was still misleading — QF_LIRA is mixed Linear Integer + Real Arithmetic but doesn't include strings or uninterpreted functions / sorts. CR's escalation of the previous round's nit catches that no single SMT-LIB logic name covers Vera's full theory portfolio. Rewrote the row to: 1. Drop the contradiction-inducing single label. 2. Describe the theories prose-style ("linear integer + real arithmetic, bool, strings, uninterpreted sorts/functions"). 3. Add a clarifier sentence noting that QF_UFLIRA is the closest standard logic (LIA + LRA + UF) and that strings are a Z3-specific extension to the SMT-LIB standard. The clarifier acknowledges the formal-name limitation rather than picking a wrong-but-shorter label. Future spec edits that add a theory (e.g. fixed-precision bit vectors) just extend the prose list without invalidating a formal-logic claim. Validation - `python scripts/check_spec_examples.py` -> clean - `grep -n "QF_LIA\|QF_LIRA\|QF_UFLIRA\|QF_LRA"` -> single occurrence (the new QF_UFLIRA clarifier reference) Refs #667 #670 Co-Authored-By: Claude <noreply@anthropic.invalid>

aallan mentioned this pull request May 13, 2026

Expand smt.translate_expr coverage: FloatLit, ArrayLit, IndexExpr #667

Closed

coderabbitai Bot reviewed May 13, 2026

View reviewed changes

Comment thread spec/06-contracts.md

Comment thread vera/smt.py Outdated

Comment thread vera/verifier.py

coderabbitai Bot reviewed May 13, 2026

View reviewed changes

Comment thread vera/smt.py

Comment thread vera/verifier.py

coderabbitai Bot reviewed May 13, 2026

View reviewed changes

Comment thread tests/test_verifier_coverage.py

Comment thread vera/smt.py Outdated

coderabbitai Bot reviewed May 13, 2026

View reviewed changes

Comment thread spec/06-contracts.md Outdated

coderabbitai Bot reviewed May 13, 2026

View reviewed changes

Comment thread spec/06-contracts.md Outdated

aallan merged commit 50f74ba into main May 13, 2026
23 checks passed

aallan deleted the fix-667-smt-coverage branch May 13, 2026 16:45

coderabbitai Bot mentioned this pull request Jun 10, 2026

feat: reify proof obligations + warm Z3 session (#222 Phase A) #716

Merged

Conversation

aallan commented May 13, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Latent issue surfaced

Tests

Spec

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Suggested labels

Uh oh!

codecov Bot commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

aallan commented May 13, 2026

pr-review-toolkit audit outcome

Critical (1 — factual narrative error)

Important (3 — to fix)

Important (defensive, kept as-is with rationale)

Suggestions (skipped with brief reasons)

Strengths confirmed

Recommended action plan

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

aallan commented May 13, 2026

pr-review fixes landed in ee81c04

Critical — narrative correction

Important — three additional fixes

Negative tests added

Skipped suggestions (with rationale)

Validation

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

aallan commented May 13, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 13, 2026 •

edited

Loading

codecov Bot commented May 13, 2026 •

edited

Loading

pr-review fixes landed in `ee81c04`