Commit 62b5285
Address CodeRabbit review on PR #70
- AILANG_RESULTS.md: reconcile conflicting LLM-eval status
(line 7 said wired, line 110 said not-wired); annotate Haiku
100% as post-fix run, Kimi K2.5 stays 97%, both date-stamped.
- vera_bench/prompts.py: catch subprocess.TimeoutExpired on
`ailang prompt` and surface as RuntimeError instead of letting
the exception escape unstructured.
- vera_bench/runner.py: treat ALL non-zero `ailang check` exits
as failures (except the explicit missing-main allowance);
previously untagged compile errors could be misclassified as
check_pass=True. Removes now-dead _is_ailang_compile_error
helper.
Three other CodeRabbit comments are intentional non-fixes:
- VB_T2_009/T2_010/T4_009 empty `main = ()`: these problems
have `test_cases: []`, so the baseline runner correctly uses
check-only mode (baseline_runner.py:593). No-op main is right.
- VB_T1_007 safe_modulo / VB_T4_010 div_natural defensive
programming: none of the published test_cases exercise b=0
or b<0, so the baselines match spec output. Hardening could
ship as a follow-up.
- VB_T5_009 state_max: the spec description mentions
State<Int>/handler, but test_cases verify only the observable
behavior (n -> n). Baseline matches tests; a state-handler
implementation is a follow-up.
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>1 parent 204f93f commit 62b5285
3 files changed
Lines changed: 15 additions & 27 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
13 | 13 | | |
14 | 14 | | |
15 | 15 | | |
16 | | - | |
17 | | - | |
| 16 | + | |
| 17 | + | |
18 | 18 | | |
19 | | - | |
| 19 | + | |
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
| |||
36 | 36 | | |
37 | 37 | | |
38 | 38 | | |
39 | | - | |
| 39 | + | |
40 | 40 | | |
41 | 41 | | |
42 | 42 | | |
| |||
107 | 107 | | |
108 | 108 | | |
109 | 109 | | |
110 | | - | |
| 110 | + | |
111 | 111 | | |
112 | 112 | | |
113 | 113 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
273 | 273 | | |
274 | 274 | | |
275 | 275 | | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
276 | 281 | | |
277 | 282 | | |
278 | 283 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
546 | 546 | | |
547 | 547 | | |
548 | 548 | | |
549 | | - | |
550 | | - | |
551 | | - | |
552 | | - | |
553 | | - | |
554 | | - | |
555 | | - | |
556 | | - | |
557 | | - | |
558 | | - | |
559 | | - | |
560 | | - | |
561 | | - | |
562 | | - | |
563 | | - | |
564 | | - | |
565 | | - | |
566 | | - | |
567 | 549 | | |
568 | 550 | | |
569 | 551 | | |
| |||
727 | 709 | | |
728 | 710 | | |
729 | 711 | | |
730 | | - | |
731 | | - | |
732 | | - | |
| 712 | + | |
| 713 | + | |
| 714 | + | |
| 715 | + | |
733 | 716 | | |
734 | 717 | | |
735 | | - | |
| 718 | + | |
736 | 719 | | |
737 | 720 | | |
738 | 721 | | |
| |||
0 commit comments