bindings/go: Add memory usage test for streaming parser by varungandhi-src · Pull Request #181 · scip-code/scip

varungandhi-src · 2023-06-28T05:01:02Z

We rely on low memory usage in Sourcegraph. https://github.com/sourcegraph/sourcegraph/pull/53828

So add a test to avoid regressing memory usage.

Test plan

Added a new test

efritz

Comments are just doc nits

…55160) This should reduce memory usage significantly, as in the common case (no two documents share the same relative path), we will end up processing one document at a time. I've tested the new code with a 5.7GB Chromium index, and we're able to process it with even 100MB of memory (at the cost of increased GC pressure). We need to iterate over the index twice, first to get all external symbols, and then to process documents, as document processing requires access to the external symbols list. This means we need the ability to seek to the start again. I've implemented that as follows: - For small indexes, just read the index into a slice. - For large indexes, save the compressed index to a temporary file on disk, and rely on the GC and page cache to transparently drop pages earlier in the file when under memory pressure. I figured decompressing is cheap enough, that it doesn't make sense to have the extra I/O overhead of reading/writing the uncompressed index. Other changes: - Documents are no longer sorted by relative path during iteration The order of iteration is still deterministic though as it matches the order of documents in the index. Questions: - Should we add instrumentation see the memory usage at different stages of processing an index? - How do we add a memory usage test (either here or in the upstream scip bindings)? ([internal Slack discussion](https://sourcegraph.slack.com/archives/C3B3SDBMY/p1687751363185919)) ## Test plan - [x] Update existing tests - [x] Add low memory usage test -- added upstream. scip-code/scip#181 <br> Backport b98ca76 from #53828 Co-authored-by: Varun Gandhi <varun.gandhi@sourcegraph.com>

varungandhi-src force-pushed the vg/memtest branch 2 times, most recently from 926abaa to c3efd5a Compare June 28, 2023 05:02

varungandhi-src requested a review from efritz June 28, 2023 05:02

varungandhi-src mentioned this pull request Jun 28, 2023

uploads: Use streaming API for ingesting SCIP indexes sourcegraph/sourcegraph-public-snapshot#53828

Merged

2 tasks

varungandhi-src force-pushed the vg/memtest branch from b3a4ed6 to f73a5d5 Compare June 28, 2023 05:10

varungandhi-src changed the title ~~test: Add memory usage test for streaming parser~~ bindings/go: Add memory usage test for streaming parser Jun 28, 2023

test: Add memory usage test for streaming parser

b886c06

varungandhi-src force-pushed the vg/memtest branch from f73a5d5 to b886c06 Compare June 29, 2023 11:05

efritz approved these changes Jun 29, 2023

View reviewed changes

Comment thread bindings/go/scip/memtest/low_mem_test.go

Comment thread bindings/go/scip/memtest/low_mem_test.go

Comment thread bindings/go/scip/memtest/low_mem_test.go

varungandhi-src added 3 commits June 29, 2023 20:34

test: Add stub file with explanation

adff061

test: Add comment about total size

eb3a363

docs: Repeat warning about not adding tests

aadff27

efritz approved these changes Jun 29, 2023

View reviewed changes

varungandhi-src merged commit ca0c0bc into main Jul 3, 2023

varungandhi-src deleted the vg/memtest branch July 3, 2023 02:05

github-actions Bot mentioned this pull request Jul 20, 2023

[Backport 5.1] uploads: Use streaming API for ingesting SCIP indexes sourcegraph/sourcegraph-public-snapshot#55160

Merged

2 tasks

varungandhi-src mentioned this pull request Aug 3, 2023

bindings/go: Add document-granularity streaming Index parser #172

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bindings/go: Add memory usage test for streaming parser#181

bindings/go: Add memory usage test for streaming parser#181
varungandhi-src merged 4 commits into
mainfrom
vg/memtest

varungandhi-src commented Jun 28, 2023

Uh oh!

efritz left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

varungandhi-src commented Jun 28, 2023

Test plan

Uh oh!

efritz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants