RFC: tweak structure instance elaboration to avoid un-needed eta expansion

Currently, we have the following behavior for structure instance elaboration 
```lean 
structure A where 

structure B extends A where 

def a : A := sorry 

def b : B := { a with } 
#print b 
-- def b' : B :=
-- let src := a;
-- { toA := { } } <- There is an extra eta expansion for the `toA` field: 
```

I believe it would be advisable to change this to the following behavior
```lean 
structure A where 

structure B extends A where 

def a : A := sorry 

def b : B := { a with } 
#print b 
-- def b' : B :=
-- let src := a;
-- { toA := src } <- There is no eta expansion for the `toA` field: 
```

The extra eta expansion is due to the following: the function [`mkProjStx?`](https://github.com/leanprover/lean4/blob/d29b8e542236020f195faad5d7a4af566342f9ff/src/Lean/Elab/StructInst.lean#L453) will not apply when the arguments are `` _ `A `toA`` as `` `toA`` is not a field of `` `A.``  

The change in behavior can be achieved by adding two lines to the function `addMissingFields` [here](https://github.com/leanprover/lean4/blob/d29b8e542236020f195faad5d7a4af566342f9ff/src/Lean/Elab/StructInst.lean#L508): 
```lean4
if let some stx := s.source.explicit.find? (·.structName == substructName)|>.map (·.stx) then
  addField (FieldVal.term stx)
```
This accounts for the case where the projection is exactly to the type of an explicit source. 

> User Experience: How does this feature improve the user experience?

There is no extra eta expansion when constructing fields which are also fields of the explicit sources. This change makes the behavior more uniform and therefore reduces mental overhead for the user. 

Furthermore, there is a performance penalty for the extra eta expansion. While small for a single instance, things can snowball when the pattern is nested. The expanded term forces unfolding during unification when otherwise it would not be necessary. 

> Beneficiaries: Which Lean users and projects do benefit most from this feature/change?

The largest benefit is for the libraries with deeply nested type class hierarchies, so Mathlib. 

In leanprover-community/mathlib4#6759 I benchmarked Mathlib with a toolchain containing this change plus some complementary changes directly to Mathlib.  The [changes](http://speed.lean-fro.org/mathlib4/compare/e8adfe39-bc50-4ffb-bcfc-d119b2b5e7cb/to/ace9c3f2-3d04-4f20-811a-6e7ca9be167f) were overall very positive including a 4% reduction to total CPU instructions and dramatic improvements in many files.

Furthermore, at least one example of a serious timeout in algebraic hierarchy presented on Zulip has been completely eliminated. 

> Community Feedback: Have you sought feedback or insights from other Lean users?

This arose from multiple discussions on Zulip about performance problems in Mathlib, especially in the algebraic hierarchy. It bubbled up already in a separate issue #2387. 

> Maintainability: Will this change streamline code maintenance or simplify its structure?

Given the simplicity of the change, I would not expect additional maintenance burden. 

I have a [branch](https://github.com/leanprover/lean4/compare/master...mattrobball:lean4:reduce_etas) on my fork with this change plus a simple test (as above) to document the behavior and am ready to submit a PR. Looking forward to hearing feedback!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: tweak structure instance elaboration to avoid un-needed eta expansion #2451

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

RFC: tweak structure instance elaboration to avoid un-needed eta expansion #2451

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions