Support TCO for functions with tail-recursive inner functions #3958

rhendric · 2020-11-21T10:07:30Z

This commit adds support for optimizing functions that contain local
functions which call the outer function in tail position, as long as
those functions themselves are only called from tail position, either in
the outer function or in other such local functions.

This enables hand-written mutually-tail-recursive function groups to be
optimized, but more critically, it also means that case guards which
desugar to use local functions don't break TCO.

Closes #3957.

garyb

Nice! Touching anything in here is a little tricky, but as far as I can tell it all looks sensible. And there's the tests, of course. 🙂

JordanMartinez · 2020-11-26T19:12:57Z

Are there any tests we should add to this? Or does this seem good to merge?

rhendric · 2020-11-29T22:40:04Z

I've added a small fixup which addresses a test case I hadn't sufficiently thought through originally.

hdgarrood

I'd be a bit more comfortable with this if the tests were exercising that the accumulator is being passed around correctly e.g. by eventually returning a value which depends on the number of recursive calls made. If the TCO variables weren't being assigned properly somehow (if we were somehow immediately reverting to the base case, say), I think these tests might not catch that. However I'll leave that up to you; I'm happy for you to go ahead and merge this now too.

rhendric · 2020-12-16T04:31:34Z

One of the tests does depend non-trivially on the recursion being correct, but I can take another pass since TCO seems to be an area of special concern. (In particular, I probably should add some cases exercising more complex chains of inner functions.)

rhendric · 2020-12-20T01:41:17Z

So just as a heads-up: more extensive testing has revealed a number of problems with this patch, which are making me reconsider the approach a little bit. I'll go into more detail when I have more of an opinion formed, but for now, this isn't ready to merge.

rhendric · 2020-12-22T01:36:46Z

Okay, I've resolved all the issues I've found. I didn't change anything too dramatically from the previous implementation but I did miss some important details the first time around. Namely:

Arity

The initial implementation only considered functions with arity 1. Whoops! Handling functions with higher arity, both as the outermost tail-recursive function and as inner functions participating in the recursion, required a bit more care in the implementation. The stickiest part was what to do with functions that are only partially invoked—in the original TCO, any tail call is necessarily going to have arity matching the function declaration, because it wouldn't type-check otherwise. But allowing other functions to be involved allows for things like:

f x y = if y <= 0 then x else g x (y - 1)
  where
  g x' = f (x' + 2)

where even though the recursive call to f is in tail position in g, and g is called in tail position in f, the fact that the last argument is missing from the recursive call to f means that this can't be TCO'd without performing an eta expansion on g. In theory, the optimizer could perform that expansion automatically, but implicit eta expansion seems like the sort of semantics-altering behavior that we try to avoid in the optimizer. So instead, the above example is not optimized.

Nested tail recursion

Currently, functions are TCO'd from the inside out, and they all use the same variable name for loop control. This is not an issue when tail-recursive inner functions don't interact with the tail recursion of outer functions; the inner variable shadows the outer one, which results in correct behavior. Allowing tail recursion to cross function boundaries means that this also needs to be reconsidered; as an example:

f x y = g x (y - 1)
  where
  g x' y' =
    if y' <= 0 then x'
    else if y' > 10000 then g (x' + 3) (y' - 1)
    else f (x' + 2) y'

Both f and g can compile to tail-recursive loops. But different branches of g should trigger either the end of g's loop or the end of both f and g's loops, which means that different variables for controlling those loops are needed. Implementing this meant adding some state to the TCO pass and converting the pass to be top-down rather than bottom-up. I don't think this should have any adverse effects on existing code, but my confidence in that assessment isn't especially high.

thomashoneyman

👍 This looks great to me!

This commit adds support for optimizing functions that contain local functions which call the outer function in tail position, as long as those functions themselves are only called from tail position, either in the outer function or in other such local functions. This enables hand-written mutually-tail-recursive function groups to be optimized, but more critically, it also means that case guards which desugar to use local functions don't break TCO.

hdgarrood · 2021-03-19T14:09:45Z

This looks good to me too. @rhendric would you like to hit the 'merge' button on this one?

rhendric · 2021-03-19T17:06:36Z

GitHub still tells me I'm not authorized to do that, but otherwise yes!

hdgarrood · 2021-03-19T17:42:53Z

haha oops

hdgarrood · 2021-03-19T20:11:17Z

@rhendric try now?

garyb approved these changes Nov 21, 2020

View reviewed changes

hdgarrood approved these changes Dec 16, 2020

View reviewed changes

rhendric marked this pull request as draft December 20, 2020 01:36

rhendric marked this pull request as ready for review December 22, 2020 00:52

rhendric force-pushed the rhendric/fix-3957 branch from 6ac3ffb to 14fe1b7 Compare December 22, 2020 01:04

thomashoneyman approved these changes Dec 23, 2020

View reviewed changes

rhendric force-pushed the rhendric/fix-3957 branch 3 times, most recently from d1f021c to bd23907 Compare February 17, 2021 22:49

rhendric merged commit efbcc47 into purescript:master Mar 19, 2021

rhendric deleted the rhendric/fix-3957 branch March 19, 2021 20:14

kl0tl mentioned this pull request Apr 3, 2021

Remove unused Data.Foldable.foldr import #4042

Merged

4 tasks

JordanMartinez mentioned this pull request Jan 18, 2022

Update version to v0.15.0 working-group-purescript-es/purescript#4

Closed

rhendric mentioned this pull request Apr 14, 2022

Inline tailRec when followed by an appropriate lambda #4289

Open

Support TCO for functions with tail-recursive inner functions #3958

Support TCO for functions with tail-recursive inner functions #3958

Uh oh!

Conversation

rhendric commented Nov 21, 2020

Uh oh!

garyb left a comment

Choose a reason for hiding this comment

Uh oh!

JordanMartinez commented Nov 26, 2020

Uh oh!

rhendric commented Nov 29, 2020

Uh oh!

hdgarrood left a comment

Choose a reason for hiding this comment

Uh oh!

rhendric commented Dec 16, 2020

Uh oh!

rhendric commented Dec 20, 2020

Uh oh!

rhendric commented Dec 22, 2020

Arity

Nested tail recursion

Uh oh!

thomashoneyman left a comment

Choose a reason for hiding this comment

Uh oh!

hdgarrood commented Mar 19, 2021

Uh oh!

rhendric commented Mar 19, 2021

Uh oh!

hdgarrood commented Mar 19, 2021

Uh oh!

hdgarrood commented Mar 19, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants