Reduce memory consumption when loading large number of commits #2533

stefanhaller · 2023-03-31T12:21:36Z

When pressing > in the commits panel to trigger loading all the remaining commits past the initial 300, memory consumption is a pretty big problem for larger repositories.

The two main causes seem to be

the cell memory from rendering the entire list of commits into the gocui view
the pipe sets when git.log.showGraph is on

This PR addresses only the first of these problems, by not rendering the entire view, but only the visible portion of it. Since we already re-render the visible portion of the view on every layout call, this was relatively easy to do.

Below are some measurements for our repository at work (261.985 commits):

	master	this PR
without Graph	855 MB	360 MB
with Graph	3.1 GB	770 MB

And for the linux kernel repo (1.170.387 commits):

	master	this PR
without Graph	5.8 GB	1.2G
with Graph	Killed by the OS after it reached 86.9 GB	39.9 GB

The measurements were taken after scrolling all the way down in the list of commits. They have to be taken with a grain of salt, as memory consumption fluctuates quite a bit in ways that I find hard to make sense of.

As you can see, there's more work to do to reduce the memory usage for the graph, but for our repo at work this PR makes it usable already, which it wasn't really before.

The commits prefixed with [gocui] are meant to go into a PR on the gocui repo, and will be replaced with a single "bump gocui" commit here.

github-actions · 2023-03-31T12:24:24Z

Uffizzi Ephemeral Environment `deployment-20904`

☁️ https://app.uffizzi.com/github.com/jesseduffield/lazygit/pull/2533

📄 View Application Logs etc.

What is Uffizzi? Learn more!

jesseduffield · 2023-04-01T00:17:05Z

Nice work. I agree with the approach, though there is a problem: our search functionality (invoked by pressing forward-slash) depends on the content of the view. With this PR, we're only be able to search for content in our commits view within the current page of content.

I spent a bit of time trying to improve our search functionality a while back (so that rather than searching the view we're actually filtering down the list) but I didn't get far enough to commit anything and now it's too divergent to resurrect. More recently however, I did successfully get list filtering working in lazydocker, which does not depend on the contents of the view. A couple relevant files:
https://github.com/jesseduffield/lazydocker/blob/master/pkg/gui/panels/side_list_panel.go
https://github.com/jesseduffield/lazydocker/blob/master/pkg/gui/panels/filtered_list.go

So I'm thinking we first implement that filtering functionality, then go ahead with this PR afterwards. FWIW I'd wanna merge my big refactor PR before any filtering functionality is added. What do you think?

stefanhaller · 2023-04-01T08:09:55Z

As for filtering in general: yes, filtering is one the things that I miss the most in lazygit, so it would be very much appreciated; however, I want this only in the branches list (and tags, maybe), but not in the commits list. For the commits list I'd prefer to not filter it down when searching; the reason is that when hopping from one search result to the next it helps me see the context of the commit I arrived at, e.g. see what branch it is a part of, and maybe type up/down arrow a bit to look at the surrounding commits too.

But I suppose searching could still be implemented in terms of the model data rather than the view content. I haven't looked at the searching code yet to get a feeling for how much work that would be; let me know if you think it would make sense for me to look in that direction, and whether that should be done before or after the big refactor.

jesseduffield · 2023-04-02T01:46:35Z

I'm on board with that approach. Feel free to look into how things are currently done (there's code related to searching in vendor/github.com/jesseduffield/gocui/view.go in the searcher struct. But yes, I'd want those changes to wait for the big refactor first

stefanhaller

This is ready for review now!

It sits on top of #3642.

stefanhaller · 2024-06-06T15:49:51Z

pkg/gui/context/base_context.go

+func (self *BaseContext) NeedsRerenderOnHeightChange() bool {
+	return self.needsRerenderOnHeightChange
+}
+


It's a bit unfortunate that this is a separate flag that clients must remember to set when they set renderOnlyVisibleLines on the ListContextTrait. I guess this could have been avoided by putting it on Context instead of IBaseContext, and then implementing it in ListContextTrait to return renderOnlyVisibleLines. I haven't tried this yet, let me know if you think it's worth trying.

That sounds reasonable to me

I tried, but it doesn't seem to be possible with the way ListContextTrait is implemented, since ListContextTrait embeds a context (usually a SimpleContext). It looks like we would first have to do something about the containment hierarchy of contexts, and this seems like too big of a change for this PR. Plus, I wouldn't actually know how. 😅

Yeah the context hierarchy is indeed a pickle. Happy to leave for another PR

stefanhaller · 2024-06-06T15:52:27Z

pkg/gui/controllers/list_controller.go

 	self.context.GetViewTrait().ScrollUp(scrollHeight)
+	if self.context.RenderOnlyVisibleLines() {
+		return self.context.HandleRender()
+	}


It's unfortunate that we need to do this manually here (and in HandleFocus, see previous commit). It would be nice if we had a way to automatically call it whenever the origin changed (similar to what we do for the height in layout), but I haven't found a good way to do that.

stefanhaller · 2024-06-06T15:56:47Z

pkg/integration/tests/ui/accordion.go

+			SelectNextItem().
+			SelectNextItem().
+			SelectNextItem().
+			SelectPreviousItem().


This is really ugly, and I wonder if there's a better way to deal with this. It is also the only test that happens to have this problem right now, but there could easily be others, for example for a t.Views().Commits().Lines() assertion that expects more lines than fit on the screen. This then also depends on the screen resolution, which is even worse.

I was thinking about disabling the renderOnlyVisibleLines optimization when running integration tests, but I wasn't sure that's a good idea either.

How about a new function called NavigateDownToLine() which just calls SelectNextItem() until the selected line matches the matcher?

That's a good suggestion, but it would be even better to add logic to NavigateToLine so that clients don't have to worry about which one to call. And it turns out that NavigateToLine already has this comment about exactly that:

// NOTE: this currently assumes that BufferLines returns all the lines that can be accessed. // If this changes in future, we'll need to update this code to first attempt to find the item // in the current page and failing that, jump to the top of the view and iterate through all of it, // looking for the item.

I implemented this in a0427bb (and dropped the hack commit again), please have a look.

stefanhaller · 2024-06-06T16:49:30Z

Ah damn, this still has bugs. Setting back to draft for now.

stefanhaller · 2024-06-06T19:03:23Z

pkg/gui/context/list_context_trait.go

+		// those views that support it.
+		totalLength := self.list.Len()
+		if self.getNonModelItems != nil {
+			totalLength += len(self.getNonModelItems())


We are calling getNonModelItems twice now, here and inside of renderLines below. I couldn't come up with a good way of avoiding that, but I'm also not sure it's a problem.

stefanhaller · 2024-06-06T19:04:22Z

@jesseduffield The bug is fixed, this is back in review. There are several things here that I'm not very happy with, curious what you think about these.

jesseduffield

Looking good, see comments

jesseduffield · 2024-06-09T12:00:24Z

pkg/gui/context/base_context.go

+func (self *BaseContext) NeedsRerenderOnHeightChange() bool {
+	return self.needsRerenderOnHeightChange
+}
+


That sounds reasonable to me

jesseduffield · 2024-06-09T12:03:37Z

pkg/integration/tests/ui/accordion.go

+			SelectNextItem().
+			SelectNextItem().
+			SelectNextItem().
+			SelectPreviousItem().


How about a new function called NavigateDownToLine() which just calls SelectNextItem() until the selected line matches the matcher?

ListContextTrait.OnSearchSelect was introduced in 138be04, but it was never called. I can only guess that a planned refactoring wasn't finished here.

Searching in the "Divergence from upstream" view would select the wrong lines. The OnSearchSelect function gets passed a view index, and uses it to select a model line. In most views these are the same, but not in the divergence view (because of the Remote/Local section headers).

We do show the graph in the left/right view, as of b767357.

We want to add an additional method to ISearchableContext later in this branch, and this will make sure that we don't forget to implement it in any concrete context.

Just to be really sure that it not only contains the expected status text, but also actually shows it.

…ighting Previously, jumping to the next search result with 'n' would not show the search result in the selected line. This is especially annoying when there is more than one search result on a single line, because pressing 'n' seemingly does nothing. Now we highlight the search result on top of the selection highlight, which solves the problem nicely.

It is unnecessary to call it every time we draw, but we do need to call it every time the view content changes.

We will need it again when searching the model; extracting the change as a separate commit just to make the diffs of the following commits smaller.

When we add support for searching the model, the length of a highlighted search result isn't necessarily the same as the length of the search string. As an example, consider that you are searching for a full commit hash; the view only renders an abbreviated hash, so when the hash is found in the model, we only want to highlight the abbreviated version. Prepare for this by adding an XEnd field to cellPos; when searching the view, this is always XStart + len(searchString). Also, rename cellPos to SearchPosition and uppercase its fields to expose them; we will use the struct in the API to set a model search function in the next commit.

Add a method View.SetModelSearchFunc; if set, it will be used to look for search results instead of searching the view lines. This will be needed when we optimize some list views to render only the visible portion of the view instead of all lines of the list. However, it also has the benefit that it allows searching for the full text of something that is only rendered in an abbreviated form in the view; the most notable example being commit hashes. This is opt-in, so we still support searching the view as before. This is needed for two reasons: - adding a model search func can be a bit of work, so we should only do it when there's a benefit - some views don't even have a model

…ruction There is no need to overwrite things with spaces; since tcell clears the whole surface before drawing, it is enough to just truncate the line at the current write position.

It is similar to OverwriteLines, except that it also clears all lines outside of the area that was rerendered. This is needed for views that only draw the visible portion of their model, to ensure that cells for invisble lines don't accumulate as you scroll.

When refreshViewportOnChange is true, we would refresh the viewport once at the end of FocusLine, and then we would check at the end of AfterLayout if the origin has changed, and refresh again if so. That's unnecessarily complicated, let's just unconditionally refresh at the end of AfterLayout only.

…dently This is a commit only for testing; we'll drop it again before merging. We now have two separate flags, refreshViewportOnChange and renderOnlyVisibleLines, but we only have views that either use them both, or neither. To test whether the logic around them is really independent, change two other views to use them independently: - the local branches view is changed to refresh viewport on change, and in order to make that testable, we change the branch render function so that the selected branch is rendered yellow. This is comparable to the graph node of the selected commit being rendered white. - the remote branches view is changed to render only the visible lines. Conveniently, this is a view that uses filtering and not searching, so we don't have to bother implementing a model search func.

jesseduffield

LGTM

- **PR Description** This makes it possible to search the model data instead of the view when pressing `/`, and uses this for the commits view. This is mainly a preparation for #2533 which requires it, but it is also useful on its own, because it makes it possible to search for full commit hashes. It will highlight the abbreviated hash in that case. - **Please check if the PR fulfills these requirements** * [x] Cheatsheets are up-to-date (run `go generate ./...`) * [x] Code has been formatted (see [here](https://github.com/jesseduffield/lazygit/blob/master/CONTRIBUTING.md#code-formatting)) * [ ] Tests have been added/updated (see [here](https://github.com/jesseduffield/lazygit/blob/master/pkg/integration/README.md) for the integration test guide) * [ ] Text is internationalised (see [here](https://github.com/jesseduffield/lazygit/blob/master/CONTRIBUTING.md#internationalisation)) * [ ] Docs have been updated if necessary * [x] You've read through your own file changes for silly mistakes etc

(Github decided to auto-close #2533, and I don't see any way to reopen it, so opening a new one here. Please see there for discussion and review.) When pressing `>` in the commits panel to trigger loading all the remaining commits past the initial 300, memory consumption is a pretty big problem for larger repositories. The two main causes seem to be 1. the cell memory from rendering the entire list of commits into the gocui view 2. the pipe sets when git.log.showGraph is on This PR addresses only the first of these problems, by not rendering the entire view, but only the visible portion of it. Since we already re-render the visible portion of the view on every layout call, this was relatively easy to do. Below are some measurements for our repository at work (261.985 commits): | | master | this PR | | ------------- | ------ | ------- | | without Graph | 855 MB | 360 MB | | with Graph | 3.1 GB | 770 MB | And for the linux kernel repo (1.170.387 commits): | | master | this PR | | ------------- | ----------------------------------------- | ------- | | without Graph | 5.8 GB | 1.2G | | with Graph | Killed by the OS after it reached 86.9 GB | 39.9 GB | The measurements were taken after scrolling all the way down in the list of commits. They have to be taken with a grain of salt, as memory consumption fluctuates quite a bit in ways that I find hard to make sense of. As you can see, there's more work to do to reduce the memory usage for the graph, but for our repo at work this PR makes it usable already, which it wasn't really before.

- **PR Description** As a followup to #2533, reduce the memory consumption some more by optimizing the storage of the pipe sets used for the commit graph. Some coarse measurements (taken by opening the respective repo, typing `4 >`, and waiting for all commits to be loaded, and then reading the Memory column in Activity Monitor): | | master | this PR | | ------------- | ------- | ------- | | git | 2.5 GB | 1.0 GB | | our work repo | 2.3 GB | 1.3 GB | | linux kernel | 94.8 GB | 38.0 GB | It's still not really usable for the linux kernel, but for all other repos that I come across in my daily use of lazygit, it works quite well now.

mark2185 mentioned this pull request Apr 19, 2023

Reflog very slow on large git repositories #2560

Open

stefanhaller mentioned this pull request Aug 2, 2023

Searching by commit hash does not bring results #2862

Closed

stefanhaller mentioned this pull request Dec 4, 2023

What about adding a nice way to display a git commit graph? #2843

Closed

stefanhaller mentioned this pull request May 16, 2024

crash on searching in commits #3564

Open

stefanhaller force-pushed the optimize-memory-consumption branch from bd22f04 to 719e69c Compare June 6, 2024 15:31

stefanhaller mentioned this pull request Jun 6, 2024

Search the model instead of the view in the commits panel #3642

Merged

6 tasks

stefanhaller changed the base branch from master to search-the-model-instead-of-the-view June 6, 2024 15:40

stefanhaller added the enhancement New feature or request label Jun 6, 2024

stefanhaller marked this pull request as ready for review June 6, 2024 15:43

stefanhaller commented Jun 6, 2024

View reviewed changes

stefanhaller marked this pull request as draft June 6, 2024 16:49

stefanhaller force-pushed the optimize-memory-consumption branch from 719e69c to dc96c70 Compare June 6, 2024 18:58

stefanhaller marked this pull request as ready for review June 6, 2024 19:01

stefanhaller commented Jun 6, 2024

View reviewed changes

stefanhaller force-pushed the search-the-model-instead-of-the-view branch 2 times, most recently from 42901e9 to 9c561b8 Compare June 7, 2024 15:12

stefanhaller force-pushed the optimize-memory-consumption branch from dc96c70 to 5da2942 Compare June 7, 2024 15:12

jesseduffield reviewed Jun 9, 2024

View reviewed changes

stefanhaller added 6 commits June 10, 2024 12:03

Cleanup: reduce some code duplication

ebb79ac

ListContextTrait.OnSearchSelect was introduced in 138be04, but it was never called. I can only guess that a planned refactoring wasn't finished here.

Cleanup: remove outdated comment

30cb6c9

We do show the graph in the left/right view, as of b767357.

Add type assertions for all searchable contexts

a96685e

We want to add an additional method to ISearchableContext later in this branch, and this will make sure that we don't forget to implement it in any concrete context.

Assert that the search status view is visible

479ffe8

Just to be really sure that it not only contains the expected status text, but also actually shows it.

stefanhaller added 15 commits June 10, 2024 12:03

[gocui] Move updateSearchPositions call from draw to writeRunes

dbbd795

It is unnecessary to call it every time we draw, but we do need to call it every time the view content changes.

[gocui] Extract helper function searchPositionsForLine

4dad9a3

We will need it again when searching the model; extracting the change as a separate commit just to make the diffs of the following commits smaller.

Use model searching in commits (and sub-commits) view

d3a11ee

fixup! [gocui] Support searching the model instead of the view

3da6594

fixup! Use model searching in commits (and sub-commits) view

d04cb9a

Log memory usage every 10s

a225a2b

[gocui] Simplify the implementation of the eraseInLineFromCursor inst…

2233fe0

…ruction There is no need to overwrite things with spaces; since tcell clears the whole surface before drawing, it is enough to just truncate the line at the current write position.

[gocui] Fix overwriting the last line of the buffer

59f1351

[gocui] Add method View.SetContentLineCount

55a2cf9

Only render visible portion of the screen for commits view

1e47fd5

Render the view when scrolling with the wheel

3305a4e

stefanhaller force-pushed the search-the-model-instead-of-the-view branch from 9c561b8 to d04cb9a Compare June 10, 2024 10:04

stefanhaller added 2 commits June 10, 2024 13:16

fixup! Only render visible portion of the screen for commits view

a0427bb

stefanhaller force-pushed the optimize-memory-consumption branch from 5da2942 to f1540db Compare June 10, 2024 11:16

jesseduffield approved these changes Jun 23, 2024

View reviewed changes

stefanhaller force-pushed the search-the-model-instead-of-the-view branch from d04cb9a to 6a6316c Compare June 23, 2024 09:44

stefanhaller deleted the branch jesseduffield:search-the-model-instead-of-the-view June 23, 2024 09:48

stefanhaller closed this Jun 23, 2024

This was referenced Jun 23, 2024

Add functions SetContentLineCount and OverwriteLinesAndClearEverythingElse jesseduffield/gocui#53

Merged

Reduce memory consumption when loading large number of commits #3687

Merged

stefanhaller mentioned this pull request Apr 20, 2025

Reduce memory consumption of graph (pipe sets) #4498

Merged

Uh oh!

Reduce memory consumption when loading large number of commits #2533

Reduce memory consumption when loading large number of commits #2533

Uh oh!

Conversation

stefanhaller commented Mar 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uffizzi Ephemeral Environment deployment-20904

Uh oh!

jesseduffield commented Apr 1, 2023

Uh oh!

stefanhaller commented Apr 1, 2023

Uh oh!

jesseduffield commented Apr 2, 2023

Uh oh!

stefanhaller left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stefanhaller commented Jun 6, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stefanhaller commented Jun 6, 2024

Uh oh!

jesseduffield left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jesseduffield left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

stefanhaller commented Mar 31, 2023 •

edited

Loading

github-actions bot commented Mar 31, 2023 •

edited

Loading

Uffizzi Ephemeral Environment `deployment-20904`