proofs: sorry reduction pass 5 — systematic plan and fixes by Th0rgal · Pull Request #1670 · lfglabs-dev/verity

Th0rgal · 2026-03-26T08:10:31Z

Summary

Comprehensive analysis of all 44 active sorries and 76 TYPESIG_SORRY blocks
Identifies 3 root causes blocking progress and defines a phased execution plan
See docs/SORRY_REDUCTION_PLAN.md for the full analysis

Root Causes Identified

1. `scopeNamesPresent` gap (~30 sorries)

collectStmtNames includes storage field names in scope, but scopeNamesPresent requires all scope names to be local variable bindings. Storage fields aren't bindings.
Fix: Remove field name from collectStmtNames for storage-mutating statements.

2. `ExprCompileCore` missing constructors (1 sorry + downstream)

ExprCompileCore lacks constructors for 20+ operators (sdiv, smod, bitAnd, etc.) that pass the surface check. The theorem claiming the implication is genuinely false.
Fix: Add constructors to ExprCompileCore or tighten the surface check.

3. TYPESIG_SORRY `= 0` on `Option Nat` (76 commented-out blocks)

Migration from Nat to Option Nat left broken type signatures using = 0 instead of = some condValue.
Fix: Update type signatures to use split some condValue pattern.

Execution Plan (7 phases)

Phase	What	Sorries resolved	Can parallelize
1	Quick wins (3 requireClauses theorems)	44 → 41	Yes
2	Fix scopeNamesPresent gap	41 → 11	Yes
3	Fix FunctionBody TYPESIG blocks	-10 TYPESIG	Yes
4	Prove downstream cascade	11 → 0	After Phase 2
5	Fix identifier shapes	1 sorry	Yes
6	Expand ExprCompileCore	1 sorry + TYPESIG	Yes
7	Fix remaining TYPESIG blocks	0 TYPESIG	After 2,4,5,6

Test plan

lake build PrintAxioms passes at each commit
Sorry count decreases monotonically
No new sorries introduced in proven code

🤖 Generated with Claude Code

Note

Medium Risk
Touches code generation for Stmt.setStorageAddr and extends core proof lemmas, so mistakes could affect on-chain storage semantics or proof soundness despite added regression tests and documentation.

Overview
Hardens setStorageAddr codegen by masking stored address values with addressMask before sstore, ensuring high bits are cleared; adds a feature test asserting the exact emitted Yul.

Reduces proof friction / expands core coverage by removing storage field names from collectStmtNames (avoids scope contamination), adding helper lemmas about isMapping classification, and extending ExprCompileCore + FunctionBody proofs to cover bitwise ops (and/or/xor/not) and several additional expression forms.

Updates trust/CI plumbing by documenting a new axiom solidityMappingSlot_lt_evmModulus in AXIOMS.md (active axioms now 2) and making the elan installer download more resilient with curl retries.

^{Written by Cursor Bugbot for commit c046d67. This will update automatically on new commits. Configure here.}

Comprehensive analysis of all 44 active sorries and 76 TYPESIG_SORRY blocks across GenericInduction.lean, FunctionBody.lean, Function.lean, and Contract.lean. Identifies three root causes: - scopeNamesPresent gap (collectStmtNames includes storage field names) - ExprCompileCore missing 20+ operator constructors - TYPESIG_SORRY broken Option Nat type signatures Defines 7 execution phases with parallelism opportunities. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 3624be39fc

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-26T08:14:58Z

docs/SORRY_REDUCTION_PLAN.md

+
+## Current State (post PR #1668)
+
+**Active `by sorry`**: 44 (all in `GenericInduction.lean`)


Reconcile active sorry totals with inventory counts

The headline count (44 active by sorry) conflicts with the inventory below: Category 1 (30) + Category 2 (3) + Category 3 (1) + Category 4 (11) + Category 5 (1) totals 46 active sorries, so the baseline and subsequent phase impact numbers are internally inconsistent. This makes progress tracking and planning error-prone because teams cannot tell which target is authoritative.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-26T08:14:58Z

docs/SORRY_REDUCTION_PLAN.md

+| C | Phase 3 (FunctionBody TYPESIG fixes) | Nothing |
+| D | Phase 6 (ExprCompileCore expansion) | Nothing |
+| E | Phase 5 (identifier shapes) | Nothing |
+| F | Phase 4 (downstream cascade) | Phases 2, 3 |


Include Phase 5 as a prerequisite for Phase 4

The plan states Category 4 is “blocked by Categories 1/3,” but the execution order schedules Phase 4 before Phase 5 and this dependency table lists Stream F as blocked only by Phases 2 and 3. That contradiction makes the sequence infeasible as written, since Category 3 must be resolved before the Category 4 cascade can be completed.

Useful? React with 👍 / 👎.

Move three theorems after their dependencies (stmtListGenericCore_append, stmtListGenericCore_of_stmtListCompileCore) and replace sorry with proofs: - stmtListGenericCore_of_requireClausesOnly: directly via stmtListCompileCore_of_requireLiteralGuardFamilyClauses - stmtListGenericCore_of_requireClausesThenReturnLiteral: via stmtListGenericCore_append with StmtListCompileCore.return_ tail - stmtListGenericCore_of_requireClausesThenLetReturnLocalLiteral: via stmtListGenericCore_append with StmtListCompileCore.letVar + return_ tail Active sorry count: 45 → 42 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 0844803d3f

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-26T08:41:12Z

docs/SORRY_REDUCTION_PLAN.md

+
+Depends on all Tier 3 + Tier 4 theorems. This is the main gateway theorem.
+
+### Category 2: Proofs provable NOW (3 active sorries)


Remove resolved Category 2 sorries from the plan

This section still lists the three requireClauses* theorems as active sorrys, but this commit already proves them in Compiler/Proofs/IRGeneration/GenericInduction.lean (new definitions around lines 11220–11273). Leaving them in Category 2 makes the roadmap and phase impact accounting inaccurate, which can send follow-up work toward already-completed items and distort remaining-effort estimates.

Useful? React with 👍 / 👎.

Storage field names are not local variable bindings, so including them in collectStmtNames caused the scopeNamesPresent gap: stmtNextScope added field names to scope but scopeNamesPresent requires all scope names to be present in runtime.bindings. This removes `field ::` from all storage-mutating cases in collectStmtNames and fixes the affected rcases patterns in stmtListScopeDiscipline_scope_names. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

github-actions · 2026-03-26T08:57:04Z

\n### CI Failure Hints\n\nFailed jobs: `checks`\n\nCopy-paste local triage:\n```bash\nmake check\nlake build\nFOUNDRY_PROFILE=difftest forge test -vv\n```

Adapts the sorry'd proof body for the Option Nat evalExpr return type: - Case-split on evalIRExpr results to extract concrete Nat values - Prove value boundedness via evalExpr_lt_evmModulus_core_of_scope - Show scope inclusion for stmtNextScope using collectStmtNames fix Sorry count: 42 → 41 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 3fbf0af101

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-26T09:13:11Z

docs/SORRY_REDUCTION_PLAN.md

+
+| Line | Theorem | Statement type |
+|------|---------|---------------|
+| 6299 | `compiledStmtStep_setMappingUint_singleSlot_of_slotSafety_preserves` | setMappingUint |


Drop proved setMappingUint theorem from active-sorry inventory

This row marks compiledStmtStep_setMappingUint_singleSlot_of_slotSafety_preserves as an active sorry, but the same commit supplies a full proof body for that theorem in Compiler/Proofs/IRGeneration/GenericInduction.lean (around lines 6249–6383). Keeping it in the Tier 1 backlog overstates remaining work and skews the phase impact estimates teams would use to plan the next reduction pass.

Useful? React with 👍 / 👎.

Reduce sorry count from 41 to 40 by proving the setMapping preserves theorem using the same Option Nat case-splitting pattern established for setMappingUint. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f9dafe7c63

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-26T09:19:46Z

scripts/check_lean_hygiene.py


    # Check 3: Fixed sorry baseline after the merged proof-reduction pass.
-    expected_sorry = 46
+    expected_sorry = 42


Update expected sorry baseline to actual count

expected_sorry is set to 42, but this commit leaves only 41 active by sorry occurrences, so python3 scripts/check_lean_hygiene.py fails immediately with “Expected 42 sorry, found 41.” Because this hygiene script is part of the standard verification flow, the mismatch causes deterministic CI/check failures for this revision until the baseline is corrected.

Useful? React with 👍 / 👎.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

scripts/check_lean_hygiene.py

Uncomment and prove the TYPESIG_SORRY'd CompiledStmtStep wrapper theorems for setMappingUint and setMapping (compileOk via simp + rfl, preserves via the already-proven _preserves theorems). Also prove the StmtListGenericCore cascade theorems stmtListGenericCore_singleton_setMappingUintSingle_of_slotSafety and stmtListGenericCore_singleton_setMappingSingle_of_slotSafety. Sorry count: 41 → 39. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 22bd74fd41

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-26T09:57:45Z

artifacts/verification_status.json

  "proofs": {
    "axioms": 1,
-    "sorry": 46
+    "sorry": 42


Regenerate verification artifact metrics

This artifact is stale for the commit contents: running python3 scripts/generate_verification_status.py --check fails, and regenerating the file produces proofs.sorry = 39 and theorems.proven = 238 instead of the committed 42/235. Because verification sync checks use this file as the source of truth, the mismatch causes deterministic check failures until the artifact is regenerated from the current tree.

Useful? React with 👍 / 👎.

Uncomment and fix TYPESIG_SORRY'd theorems for mstore_single and tstore_single: preserves, wrapper, and cascade. Key fixes: - Replace non-existent CompiledStmtStep.Preserves with expanded type sig - Fix Option Nat handling (evalExpr returns Option Nat, not Nat) - tstore proof uses runtimeStateMatchesIR_setTransientStorage with value < modulus bound from evalExpr_lt_evmModulus_core_of_scope Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 647fa0dc51

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-26T10:32:57Z

docs/SORRY_REDUCTION_PLAN.md

+  - `GenericInduction.lean`: 44 blocks
+  - `FunctionBody.lean`: 15 blocks
+  - `Contract.lean`: 15 blocks
+  - `Function.lean`: 2 blocks
+


Correct TYPESIG_SORRY totals in the current-state summary

This summary says there are 76 TYPESIG_SORRY blocks with 44 in GenericInduction.lean, but the current tree no longer matches those numbers (the theorem-block markers now total 70, with GenericInduction.lean at 38 after proofs added in this commit). Leaving stale totals here overstates remaining proof debt and makes the phase sizing/dependency plan unreliable.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-26T10:32:57Z

docs/SORRY_REDUCTION_PLAN.md

+The corresponding `CompiledStmtStep` wrappers and `_singleSlot_preserves` for setStorage,
+setStorageAddr, mstore, tstore. Need type sig fixes + the `scopeNamesPresent` fix.


Drop resolved mstore/tstore items from Tier 1b backlog

Tier 1b still treats mstore/tstore wrappers as unresolved TYPESIG_SORRY work, but this commit already introduces full proofs for compiledStmtStep_mstore_single(_preserves) and compiledStmtStep_tstore_single(_preserves) in GenericInduction.lean. Keeping these items in the active backlog misdirects follow-up work and inflates the Phase 2 dependency chain.

Useful? React with 👍 / 👎.

Bridge lemma proving that when findFieldWithResolvedSlot returns a field with ty = FieldType.address, isMapping returns false (since both functions find the first field by name, and address is not a mapping type). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Add execIRStmt_sstore_of_eval helper: proves sstore with any evalIRExpr- evaluable slot expr produces abstractStoreStorageOrMapping, handling the mappingSlot branch via by_cases on function name - Add execIRStmt_sstore_foldl_mappingSlot: proves sstore with foldl-built mappingSlot chain evaluates correctly, using induction on Forall₂ - Move 4 private helper theorems (compileExprList_core_ok, eval_compileExpr_core_some_of_scope, eval_compileExprList_core_of_scope, evalIRExpr_mappingSlotChain) earlier in file for visibility - Prove compiledStmtStep_setMappingChain_singleSlot_of_slotSafety_preserves with full proof body, wrapper theorem, and cascade - Fix runtimeStateMatchesIR_writeAddressKeyedMapping2Slot: correct storage type from wrong double-abstractStoreMappingEntry to single call with abstractMappingSlot as base - Prove compiledStmtStep_setMapping2_singleSlot_of_slotSafety_preserves with full proof body, wrapper theorem, and cascade Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…32 sorries) Add isMapping_false_of_findFieldWithResolvedSlot_uint256 lemma in Types.lean. Add findFieldWriteSlots_of_findFieldWithResolvedSlot bridge lemma. Uncomment compiledStmtStep_setStorage_singleSlot with hNotMapping parameter and full preserves proof using eval_compileExpr_core_of_scope pattern. Uncomment stmtListGenericCore_singleton_setStorage_singleSlot. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…26 sorries) Move stmtListGenericCore_of_requireClausesThen{SetStorageLiteral, LetSetStorageLocalLiteral, LetAssignSetStorageLocalLiteral, LetAssign{Add,Sub,Mul}SetStorageLocalLiteral} from line ~9563 to after stmtListGenericCore_append (line ~11500) to resolve forward references. Fix proofs: use FunctionBody.ExprCompileCore.{add,sub,mul} (fully qualified), replace `simpa using hmem` with `exact Or.inl hmem` where simp reduces membership in stmtNextScope to a disjunction. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: acc683fa28

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-26T14:41:55Z

docs/SORRY_REDUCTION_PLAN.md

+
+## Current State
+
+**Active `by sorry`**: 32 (all in `GenericInduction.lean`)


Synchronize plan baseline with enforced sorry count

The plan records 32 active by sorry, but this commit’s own verification sources now enforce/report 26 (scripts/check_lean_hygiene.py sets expected_sorry = 26, and artifacts/verification_status.json reports 26). This stale baseline makes the roadmap’s remaining-work and phase-impact numbers inaccurate, which can mis-prioritize follow-on proof work.

Useful? React with 👍 / 👎.

…3 sorries) Add compileStmt_ok_any_scope and compileStmtList_ok_any_scope lemmas proving that if compilation succeeds with one scope, it succeeds with any scope. Use these to prove three scope-irrelevance sorries in GenericInduction.lean: - compileStmtList_ok_of_stmtListGenericCore - compileStmtList_ok_of_stmtListGenericWithHelpers - compileStmtList_ok_of_stmtListGenericWithHelpersAndHelperIR Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

… (23→19 sorries) Key insight: all callers pass scope = inScopeNames via scopeNamesIncluded_refl, so the inScopeNames parameter was unnecessary. Removing it unblocks 4 proofs: - exec_compileStmtList_generic_sizeOf_extraFuel_step: full induction proof over StmtListGenericCore, handling continue/stop/return/revert cases - exec_compileStmtList_generic_with_helpers_sizeOf_extraFuel_step: same pattern for StmtListGenericWithHelpers (helper-aware source semantics) - supported_function_body_correct_from_exact_state_generic: wrapper that converts sizeOf-based fuel to length-based fuel - supported_function_body_correct_from_exact_state_generic_helper_steps_raw: same wrapper for helper-aware variant Added helper: yulStmtList_length_add_sizeOf_le_append to bridge length/sizeOf arithmetic for natural subtraction in fuel calculations. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 5430d87371

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-26T22:59:48Z

Compiler/Proofs/MappingSlot.lean

+axiom solidityMappingSlot_add_wordOffset_lt_evmModulus
+    (baseSlot key wordOffset : Nat) :
+    solidityMappingSlot baseSlot key + wordOffset < Compiler.Constants.evmModulus


Bound solidityMappingSlot_add_wordOffset_lt_evmModulus axiom

This axiom is quantified over all wordOffset : Nat, but that statement is false in general: setting wordOffset = Compiler.Constants.evmModulus requires solidityMappingSlot baseSlot key + evmModulus < evmModulus, which cannot hold for naturals. As written, this introduces an inconsistency into the proof environment (you can derive False), so any theorem depending on this logical context becomes untrustworthy. Add an explicit bound/precondition on wordOffset (or use a bounded offset type) to match the intended struct-layout case.

Useful? React with 👍 / 👎.

Uncomment and prove three theorems in the aliasSlots chain: 1. execIRStmts_let_then_sstore_lit_ident_slots_continue: Replace `storage := by sorry` with actual abstractStoreStorageOrMappingMany expression. 2. compiledStmtStep_setStorage_aliasSlots: Full proof with hNotMapping hypothesis, rcases pattern for evalIRExpr Option Nat, and runtimeStateMatchesIR_writeUintSlots. 3. compiledStmtStep_setStorage_of_validateIdentifierShapes: Proof using by_cases on aliasSlots, dispatching to singleSlot and aliasSlots branches. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: c3b2ca5174

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-26T23:23:41Z

Compiler/Proofs/MappingSlot.lean

+axiom solidityMappingSlot_lt_evmModulus (baseSlot key : Nat) :
+    solidityMappingSlot baseSlot key < Compiler.Constants.evmModulus


Document new MappingSlot axioms in AXIOMS.md

Adding this axiom (and solidityMappingSlot_add_wordOffset_lt_evmModulus below) without updating AXIOMS.md breaks the verification pipeline: I checked .github/workflows/verify.yml and it runs scripts/check_axioms.py (line 453), and on this commit that check fails with both new axioms reported as missing plus an active-axiom count mismatch. This causes deterministic CI failure until the axiom registry and trust summary are synchronized with the new declarations.

Useful? React with 👍 / 👎.

Records the addition of keccak axiom (10→6) and aliasSlots chain proof (6→5). Updates remaining sorries analysis with current line numbers and root cause classification. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

… sorries) - Add bitAnd/bitOr/bitXor/bitNot constructors to ExprCompileCore inductive - Add eval_compileExpr helpers for all 4 bitwise ops in FunctionBody.lean - Add cases to mutual blocks (eval_compileExpr_core_onExpr, evalExpr_lt_evmModulus_core_onExpr, compileRequireFailCond, eval_compileRequireFailCond) - Add cases to GenericInduction.lean match statements - Tighten SupportedSpec.lean surface checks: sdiv/smod/sgt/slt now return true (unsupported), matching that ExprCompileCore has no constructors for them - Prove exprCompileCore_of_exprTouchesUnsupportedContractSurface_eq_false by structural recursion on Expr, eliminating sorry #1 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 90d2f6b7ce

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-27T04:06:00Z

docs/SORRY_REDUCTION_PLAN.md

+
+## Current State (as of commit 7dee40c3)
+
+**Active `sorry`**: 5 (all in `GenericInduction.lean`)


Sync plan baseline with enforced sorry count

This plan states **Active sorry**: 5, but this same commit enforces/reporting 4 in scripts/check_lean_hygiene.py and artifacts/verification_status.json. Leaving the plan one item high makes the remaining-work and phase-impact estimates internally inconsistent, which can mis-prioritize subsequent sorry-reduction work.

Useful? React with 👍 / 👎.

…led analysis Update the plan after proving sorry #1 (exprCompileCore) in commit 08b717e, bringing the count from 5 to 4. Document three failed attempts at sorry #2 (setMappingPackedWord) with root cause analysis: execIRStmts fuel chaining timeouts and bitwise identity gap. All 4 remaining sorries are architecturally blocked. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

… signatures - execIRStmts_compiled_let_core_append_wholeFuel_of_scope: fix evalExpr Option Nat type by adding {valueNat : Nat} + hValueEval hypothesis - execIRStmts_compiled_assign_core_append_wholeFuel_of_scope: same pattern - execIRStmts_compiled_require_core_pass_append_wholeFuel_of_scope: fix by splitting condition into hcondEval + hcondNeZero - execIRStmts_compiled_require_core_fail_append_wholeFuel_of_scope: fix hcondZero to use = some 0 instead of = 0 All four theorems were commented out as TYPESIG_SORRY due to the evalExpr Nat→Option Nat migration. The proof bodies are adapted from the SORRY'D comments with proper monadic lifting handling (rcases on evalIRExpr + simp [Option.bind]). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…ith fixed type signatures Fix evalExpr Option Nat type signatures for: - execIRStmts_compiled_let_core_tailExtraFuel_of_scope - execIRStmts_compiled_assign_core_tailExtraFuel_of_scope - execIRStmts_compiled_require_core_pass_tailExtraFuel_of_scope - stmtResultMatchesIRExec_compiled_let_core_tailExtraFuel_of_scope - stmtResultMatchesIRExec_compiled_assign_core_tailExtraFuel_of_scope - stmtResultMatchesIRExec_compiled_require_core_pass_tailExtraFuel_of_scope Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

… scopeDiscipline + validateFunctionIdentifierReferences chain Add hNotMapping parameter to match updated compiledStmtStep_setStorage_of_validateIdentifierShapes signature. Prove collectStmtListBindNames/AssignedNames prefix subset lemmas inline for scope name inclusion. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

FunctionBody.lean (5 blocks): - eval_compileRequireFailCond_core: superseded by _onExpr at 3975 - eval_compileRequireFailCond_core_of_scope: superseded by proved version at 5138 - stmtResultMatchesIRExec_compiled_terminal_ite_then: superseded by proved at 10214 - stmtResultMatchesIRExec_compiled_terminal_ite_else: superseded by proved at 10422 - execIRStmts_compiled_return_core_append_wholeFuel: superseded by _of_scope at 10727 GenericInduction.lean (2 blocks): - stmtListGenericCore_of_supportedStmtList_append_of_surface: superseded by inline at 11475 - stmtListGenericCore_of_supportedStmtList_requireClause_of_surface: superseded by inline at 11475 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b459fb3bd0

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-27T06:56:12Z

Compiler/Proofs/IRGeneration/GenericInduction.lean

+      (by rw [hbody]; exact collectStmtListAssignedNames_prefix_subset _ _ _ h))
+  · exact Or.inr h
+
+theorem compiledStmtStep_setStorage_of_validateIdentifierShapes_of_validateFunctionIdentifierReferences


Allowlist or split over-limit setStorage proof

This theorem is newly proved in this commit but now exceeds the hard proof-length gate: running python3 scripts/check_proof_length.py --format=text on be900a7 reports compiledStmtStep_setStorage_of_validateIdentifierShapes_of_validateFunctionIdentifierReferences at 186 lines and exits non-zero. Since the verify workflow runs this checker, the revision fails CI deterministically until this proof is decomposed or added to scripts/check_proof_length.py's allowlist with justification.

Useful? React with 👍 / 👎.

…Writes - Add stmtListTouchesUnsupportedContractSurfaceExceptMappingWrites_append lemma - Move and prove stmtListGenericCore_of_supportedStmtList_append_of_surface_exceptMappingWrites - Move and prove stmtListGenericCore_of_supportedStmtList_requireClause_of_surface_exceptMappingWrites - Both moved after stmtListGenericCore_append to resolve forward reference Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

#4 - Uncomment TYPESIG_SORRY block at line 7789: prove compileOk for compiledStmtStep_setMappingPackedWord_singleSlot_of_slotSafety using simp lemmas for Bool.not_true, bne_self_eq_false, and monadic unfolding - Eliminate sorry #4: prove stmtListGenericCore_singleton_setMappingPackedWordSingle_of_slotSafety by chaining through the newly-uncommented theorem - Sorry count: 4 → 3 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Record sorry #4 elimination (setMappingPackedWord singleton) - Record TYPESIG_SORRY reduction progress - Update remaining sorry line numbers and analysis - Add TYPESIG_SORRY status table Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

* docs: register 2 new mapping-slot axioms in AXIOMS.md and sync all docs The codex/reduce-sorries-pass-5 branch added two axioms in Compiler/Proofs/MappingSlot.lean (solidityMappingSlot_lt_evmModulus and solidityMappingSlot_add_wordOffset_lt_evmModulus) without updating AXIOMS.md, causing CI check_axioms.py to fail. - Add axiom entries #2 and #3 to AXIOMS.md - Update active axiom count from 1 to 3 - Update DOCUMENTED_AXIOMS in check_axioms.py - Sync all docs (README, TRUST_ASSUMPTIONS, VERIFICATION_STATUS, llms.txt) - Update check_layer2_boundary_sync.py expected/forbidden snippets - Fix test_check_layer2_boundary_sync.py assertions Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: allowlist long setStorage proof in check_proof_length.py Add compiledStmtStep_setStorage_of_validateIdentifierShapes_of_validateFunctionIdentifierReferences to the proof length allowlist — it's a mechanical bridge proof (186 lines) that threads identifier validation through the compiled storage-write step. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d0178c10ad

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-27T10:34:19Z

Compiler/CompilationModel/StorageWrites.lean

+        let storedValueExpr :=
+          if requireAddressField then
+            YulExpr.call "and" [valueExpr, YulExpr.hex Compiler.Constants.addressMask]


Apply address masking in packed setStorageAddr paths

When requireAddressField is true, this change introduces storedValueExpr masked by addressMask, but only the non-packed branches use it; the packed branches still pass raw valueExpr to compilePackedStorageWrite/compileCompatPackedStorageWrites. Since packed bits are allowed on non-mapping fields, Stmt.setStorageAddr can still persist high bits in packed address fields (notably if packed width is >160), which makes packed and non-packed address writes semantically inconsistent. Thread the masked expression through packed branches too.

Useful? React with 👍 / 👎.

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 4545f84b14

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-27T14:08:36Z

Compiler/Proofs/IRGeneration/GenericInduction.lean

+  rw [hdyn']
+  simp [SourceSemantics.writeAddressKeyedMappingWordSlots, mappingWordTargetSlot,
+    SourceSemantics.wordNormalize, Verity.Core.Uint256.val_ofNat]
+  exact hvalue

 private theorem encodeStorageAt_writeAddressKeyedMappingPackedWordSlots_singleton_eq_written


Keep packed-word singleton proof under length gate

This theorem grew to 54 lines in b5b2300, which breaches the 50-line hard limit enforced by scripts/check_proof_length.py; running python3 scripts/check_proof_length.py --format=text on this commit reports it as a violation and exits non-zero, while the parent commit passes. Because .github/workflows/verify.yml runs this checker (Generate proof length report), this change causes deterministic CI failure until the proof is decomposed or allowlisted.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-27T14:08:36Z

Compiler/Proofs/IRGeneration/GenericInduction.lean

-  have hprefix : "__compat_value".startsWith "__" := by
+  exact hscopeReserved.1
+
+private theorem compatScratch_startsWith_reserved


Split or allowlist compatScratch_startsWith_reserved

This newly added proof is 108 lines, above the 50-line hard cap in scripts/check_proof_length.py; running the checker on b5b2300 reports this theorem as failing and returns non-zero. Since the verify workflow invokes that checker (.github/workflows/verify.yml, proof-length step), this introduces a hard CI regression until the theorem is split or explicitly allowlisted with justification.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-27T14:08:36Z

Compiler/Proofs/IRGeneration/GenericInduction.lean

+            left
+            decide
+
+private theorem compatScratch_not_internalImmutable


Split or allowlist compatScratch_not_internalImmutable

This new theorem is 116 lines and exceeds the proof-length hard limit (50) enforced by scripts/check_proof_length.py; on commit b5b2300, the checker flags it and fails the run. Because CI executes that same check in .github/workflows/verify.yml, this change makes verification fail deterministically until the proof is decomposed or added to the allowlist.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-27T14:08:36Z

Compiler/Proofs/IRGeneration/GenericInduction.lean

+      exact ⟨runtimeStateMatchesIR_writeUintSlot hruntime hresolvedSlot hnotAddr hnotDyn hvalueLt,
+        hexact', hbounded, hscope'⟩
+
+private theorem compiledStmtStep_setStorageAddr_singleSlot_preserves


Split or allowlist compiledStmtStep_setStorageAddr proof

This proof is now 135 lines, far above the 50-line hard threshold enforced by scripts/check_proof_length.py; executing the checker on b5b2300 reports this theorem as a violation and exits non-zero. Since the verify pipeline runs this checker (.github/workflows/verify.yml), the commit cannot pass CI until this theorem is broken up or explicitly allowlisted.

Useful? React with 👍 / 👎.

chatgpt-codex-connector bot reviewed Mar 26, 2026

View reviewed changes

claude and others added 2 commits March 26, 2026 09:36

chore: auto-refresh derived artifacts

0844803

chatgpt-codex-connector bot reviewed Mar 26, 2026

View reviewed changes

claude and others added 2 commits March 26, 2026 10:08

chore: auto-refresh derived artifacts

3fbf0af

chatgpt-codex-connector bot reviewed Mar 26, 2026

View reviewed changes

prove compiledStmtStep_setMapping_singleSlot_of_slotSafety_preserves

f9dafe7

Reduce sorry count from 41 to 40 by proving the setMapping preserves theorem using the same Option Nat case-splitting pattern established for setMappingUint. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chatgpt-codex-connector bot reviewed Mar 26, 2026

View reviewed changes

cursor bot reviewed Mar 26, 2026

View reviewed changes

scripts/check_lean_hygiene.py Outdated Show resolved Hide resolved

claude and others added 2 commits March 26, 2026 10:52

chore: auto-refresh derived artifacts

0c4d840

chatgpt-codex-connector bot reviewed Mar 26, 2026

View reviewed changes

claude and others added 2 commits March 26, 2026 11:24

chore: auto-refresh derived artifacts

647fa0d

chatgpt-codex-connector bot reviewed Mar 26, 2026

View reviewed changes

claude and others added 3 commits March 26, 2026 11:41

Th0rgal force-pushed the codex/reduce-sorries-pass-5 branch from 7245a48 to a4d79b2 Compare March 26, 2026 14:21

github-actions bot and others added 4 commits March 26, 2026 14:21

chore: auto-refresh derived artifacts

48aac16

docs: update sorry reduction plan with pass 5 progress (44→32)

7bda5f9

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chore: auto-refresh derived artifacts

acc683f

chatgpt-codex-connector bot reviewed Mar 26, 2026

View reviewed changes

claude and others added 2 commits March 26, 2026 16:53

chatgpt-codex-connector bot reviewed Mar 26, 2026

View reviewed changes

claude and others added 2 commits March 27, 2026 00:18

chore: auto-refresh derived artifacts

c3b2ca5

chatgpt-codex-connector bot reviewed Mar 26, 2026

View reviewed changes

claude and others added 3 commits March 27, 2026 01:10

chore: auto-refresh derived artifacts

90d2f6b

chatgpt-codex-connector bot reviewed Mar 27, 2026

View reviewed changes

claude and others added 7 commits March 27, 2026 06:30

chore: auto-refresh derived artifacts

5dc508c

chore: auto-refresh derived artifacts

b459fb3

chatgpt-codex-connector bot reviewed Mar 27, 2026

View reviewed changes

claude and others added 7 commits March 27, 2026 08:34

chore: auto-refresh derived artifacts

16e3af7

fix mapping-slot axiom sync and address write masking

53f1666

chore: auto-refresh derived artifacts

d0178c1

chatgpt-codex-connector bot reviewed Mar 27, 2026

View reviewed changes

claude and others added 3 commits March 27, 2026 15:01

Close packed mapping proof hole

4164973

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

proofs: remove remaining GenericInduction sorrys

c2f325a

chore: auto-refresh derived artifacts

4545f84

chatgpt-codex-connector bot reviewed Mar 27, 2026

View reviewed changes

claude and others added 2 commits March 27, 2026 16:07

Finish packed mapping preservation proof

c046d67

chore: auto-refresh derived artifacts

fa39679


		## Current State (post PR #1668)

		Active `by sorry`: 44 (all in `GenericInduction.lean`)


		Depends on all Tier 3 + Tier 4 theorems. This is the main gateway theorem.

		### Category 2: Proofs provable NOW (3 active sorries)

		The corresponding `CompiledStmtStep` wrappers and `_singleSlot_preserves` for setStorage,
		setStorageAddr, mstore, tstore. Need type sig fixes + the `scopeNamesPresent` fix.


		## Current State

		Active `by sorry`: 32 (all in `GenericInduction.lean`)

		axiom solidityMappingSlot_lt_evmModulus (baseSlot key : Nat) :
		solidityMappingSlot baseSlot key < Compiler.Constants.evmModulus


		## Current State (as of commit 7dee40c3)

		Active `sorry`: 5 (all in `GenericInduction.lean`)

Conversation

Th0rgal commented Mar 26, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Root Causes Identified

1. scopeNamesPresent gap (~30 sorries)

2. ExprCompileCore missing constructors (1 sorry + downstream)

3. TYPESIG_SORRY = 0 on Option Nat (76 commented-out blocks)

Execution Plan (7 phases)

Test plan

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Th0rgal commented Mar 26, 2026 •

edited by cursor bot

Loading

1. `scopeNamesPresent` gap (~30 sorries)

2. `ExprCompileCore` missing constructors (1 sorry + downstream)

3. TYPESIG_SORRY `= 0` on `Option Nat` (76 commented-out blocks)

github-actions bot commented Mar 26, 2026 •

edited

Loading