Rewrite batch mode to use temp directory by requilence · Pull Request #136 · ipfs/go-ds-flatfs

requilence · 2025-08-27T16:04:52Z

Problem

When uploading large files through UnixFS or other IPLD structures, the current batch implementation keeps everything in memory and writes blocks individually. If the application crashes mid-process, partial blocks remain in FlatFS, making garbage collection extremely difficult since there's no way to identify which blocks belong to incomplete operations.

Solution

This PR rewrites the batch mode to use a temporary directory approach:

Atomic batch operations: All blocks in a batch are written to a temporary directory first, then renamed to their final locations only on commit. If the process crashes, no partial data remains in the main datastore - the temp directory is cleaned on restart.
Memory efficiency: Instead of keeping all data in memory, blocks are written directly to temp files, enabling large file uploads without high RAM usage.
Read operations in batch: Added BatchReader interface implementing Get/Has/GetSize methods. This is essential for IPLD structures which need to verify block links during construction. Without batch reads, you cannot build complex IPLD structures that reference blocks added earlier in the same batch. This follows standard database transaction semantics where reads within a transaction see uncommitted writes.

Implementation Details

Each batch creates a unique temp directory under .temp/
Files are written without sharding in temp
On commit, files are atomically renamed to their sharded destinations
On discard or crash, temp files are cleaned up (on the next startup). This code was already existing
Thread-safe with mutex protection
Batch can be reused after commit/discard (because this behavior of Batch was before my changes)

Write batch data to temp directory instead of memory to avoid high RAM usage with large files. Each batch creates its own temp directory, writes blocks directly to disk, and renames all files atomically on commit.

welcome · 2025-08-27T16:04:56Z

Thank you for submitting this PR!
A maintainer will be here shortly to review it.
We are super grateful, but we are also overloaded! Help us by making sure that:

The context for this PR is clear, with relevant discussion, decisions
and stakeholders linked/mentioned.
Your contribution itself is clear (code comments, self-review for the
rest) and in its best form. Follow the code contribution
guidelines
if they apply.

Getting other community members to do a review would be great help too on complex PRs (you can ask in the chats/forums). If you are unsure about something, just leave us a comment.
Next steps:

A maintainer will triage and assign priority to this PR, commenting on
any missing things and potentially assigning a reviewer for high
priority items.
The PR gets reviews, discussed and approvals as needed.
The PR is merged by maintainers when it has been approved and comments addressed.

We currently aim to provide initial feedback/triaging within two business days. Please keep an eye on any labelling actions, as these will indicate priorities and status of your contribution.
We are very grateful for your contribution!

- Removed unused tempFileOnce() method - Replaced unused 'done' variable with '_' in Commit method

Only check temp directory for keys that exist in puts slice

gammazero · 2025-09-02T16:33:20Z

Need to review. Check that there is still ability to do in-memory batching for smaller batches.

gammazero

My concern is that this will negatively impact the performance of Put. This will make Commit faster because the temp files are already written, but existing code my expect a Put that is not slowed down by writing to the filesystem.

It may be worth considering having Put write the temp files asynchronously in a separate goroutine. Then Commit will wait for any outstanding Put to finish, handle any errors from async Puts , and then move all the temp files to their final destination.

flatfs.go

batch_test.go

flatfs.go

hsanjuan · 2025-09-16T14:20:07Z

Triage questions that come up:

Can you describe the scenario in which memory usage of batches becomes a problem? Is this while using Kubo or are you using flatfs and interacting with it in a different application?

lidel · 2025-10-21T14:54:10Z

Triage notes:

being able to remove RAM-limit is useful for self-hosted adoption, but comes with some risk
we could mitigate by do writes async until the final batch commit is executed, and wait for async work

@requilence gentle ping, are you able to answer where this problem/need surfaced? Are yo uusing Kubo or your own implementation? On what hardware are you running?

gammazero · 2025-10-21T14:58:52Z

Batch can be reused after commit/discard (because this behavior of Batch was before my changes)

I do not think we need or want to guarantee this, because the ability to reuse a Batch was not consistent across different datastores, so should not be done in general.

Address PR review feedback

requilence · 2025-10-22T11:17:20Z

@lidel Thanks for the review! I've addressed the feedback:

Async Write Operations

Put now writes to temp files asynchronously in goroutines and returns immediately
Commit and all read/write operations waits for all async writes to complete before proceeding
Fail-fast error handling: First async error is captured and returned on next Put/Delete/Commit or any read operation, preventing unnecessary work if disk is full or other errors occur

requilence · 2025-10-22T12:05:49Z

Triage questions that come up:

Can you describe the scenario in which memory usage of batches becomes a problem? Is this while using Kubo, or are you using flatfs and interacting with it in a different application?

We are using flatfs and the IPLD structures in Anytype (http://github.com/anyproto/anytype-heart) to store files on users' devices. If we are not using a batch operation, we can find ourselves in a situation where the user force-closes the app in the middle of a large file (e.g., 4 GB video) addin, which will leave a lot of garbage blocks in flatfs. Therefore, we are using atomic batch operations. At the same time, we are trying to avoid heap allocations and reusing memory, as it is especially critical on mobile devices.

@lidel, I'm not sure async Put is really a good option for us. We would prefer to do it synchronously; otherwise, it could lead to heap spikes. I'm considering a clean way to make this configurable while maintaining backward compatibility. Perhaps a new OpenWithArgs that will have variadic arguments? It will also be possible to override the fs.FS implementation with a custom or mocked version.

gammazero · 2025-10-23T04:10:43Z

@requilence

I'm not sure async Put is really a good option for us. We would prefer to do it synchronously

Consider limiting the number of asynchronous write operations. I think that it is reasonable for an async write operation to wait until the previous async write operation has completed. This will allow a write function to return immediately, while the work is done during accumulation of operations in the next batch.

Additionally, there could be an option to enable/disable asynchronous puts. Although this is probably not necessary if above limit is implemented.

gammazero

See and commit suggestions: Added a semaphore to limit concurrent puts

flatfs.go

lidel

I tried to document the current state (see godoc drafts in comments below) and one things that stands out is O(n) related to linear seartch in puts – not sure how big of an impact this is, but we already have O(1) for deletes so maybe do the same for puts? (details inline).

If we do that, godoc should be updated to relfect that.

flatfs.go

gammazero · 2025-11-04T15:54:37Z

Triage: We will look at this after kubo v0.39

Co-authored-by: Marcin Rataj <lidel@lidel.org>

* Rewrite batch mode to use temp directory (#136) Write batch data to temp directory instead of memory to avoid high RAM usage with large files. Each batch creates its own temp directory, writes blocks directly to disk, and renames all files atomically on commit. * Use slice instead of map for batch puts * Add DiscardableBatch interface with Discard method * add read operations for batch iface * fix: remove unused function and variable to fix CI checks * Removed unused tempFileOnce() method * Replaced unused 'done' variable with '_' in Commit method * remove some local files * perf: optimize batch read operations to skip filesystem checks Only check temp directory for keys that exist in puts slice * feat: implement batch Query with temp directory merging * make batch Put operations async Address PR review feedback * fix Batch init * Update flatfs.go * use map for O(1) lookup of puts in batch * docs: improve godocs for batch operations - add godoc for maxConcurrentPuts explaining why 16 was chosen - fix incorrect O(n) claims for Get/Has/GetSize (actually O(1) via putSet map) * handle create file for windows * Update tests and include batch concurrency test * test writing concurrent batches * docs: document first-successful-writer-wins semantics - clarify flatfs is for content-addressed storage only and point users to leveldb/pebble for mutable data needing last-writer-wins. * fix: batch cleanup and docs improvements - document batch reuse contract per go-datastore spec - fix TOCTOU race in temp dir creation using MkdirAll - log warnings on temp dir cleanup failures

Rewrite batch mode to use temp directory

1db1849

Write batch data to temp directory instead of memory to avoid high RAM usage with large files. Each batch creates its own temp directory, writes blocks directly to disk, and renames all files atomically on commit.

requilence added 3 commits August 27, 2025 18:12

Use slice instead of map for batch puts

9be0cb7

Add DiscardableBatch interface with Discard method

e44548b

add read operations for batch iface

d49f5b2

requilence mentioned this pull request Aug 29, 2025

GO-6112 preload files rpc anyproto/anytype-heart#2627

Merged

requilence added 4 commits August 29, 2025 12:45

fix: remove unused function and variable to fix CI checks

337f2fc

- Removed unused tempFileOnce() method - Replaced unused 'done' variable with '_' in Commit method

remove some local files

104c7f0

perf: optimize batch read operations to skip filesystem checks

1e8b13e

Only check temp directory for keys that exist in puts slice

feat: implement batch Query with temp directory merging

d939e0e

requilence force-pushed the batch-temp-directory branch from ddd53d9 to d939e0e Compare August 29, 2025 11:34

gammazero added kind/enhancement A net-new feature or improvement to an existing feature need/maintainer-input Needs input from the current maintainer(s) labels Sep 2, 2025

gammazero self-requested a review September 2, 2025 16:32

gammazero requested changes Sep 3, 2025

View reviewed changes

gammazero mentioned this pull request Sep 6, 2025

Flatfs DS has to handle out of FDs in Query ipfs/kubo#3752

Closed

lidel added the need/author-input Needs input from the original author label Sep 9, 2025

hsanjuan removed the need/maintainer-input Needs input from the current maintainer(s) label Sep 16, 2025

lidel added need/author-input Needs input from the original author and removed need/author-input Needs input from the original author labels Oct 21, 2025

make batch Put operations async

81ccfcd

Address PR review feedback

requilence requested a review from gammazero October 22, 2025 11:18

fix Batch init

5e81cbb

gammazero requested changes Oct 26, 2025

View reviewed changes

flatfs.go Show resolved Hide resolved

flatfs.go Show resolved Hide resolved

flatfs.go Outdated Show resolved Hide resolved

flatfs.go Show resolved Hide resolved

flatfs.go Outdated Show resolved Hide resolved

lidel mentioned this pull request Oct 27, 2025

Release 0.39 ipfs/kubo#10946

Closed

74 tasks

hsanjuan assigned requilence Oct 28, 2025

hsanjuan requested a review from lidel October 28, 2025 15:35

hsanjuan added the need/maintainer-input Needs input from the current maintainer(s) label Oct 28, 2025

lidel requested changes Oct 28, 2025

View reviewed changes

gammazero removed the need/author-input Needs input from the original author label Nov 11, 2025

gammazero and others added 12 commits November 12, 2025 12:43

Update flatfs.go

2ce7ee6

Update flatfs.go

7260e52

Update flatfs.go

e2dd205

Update flatfs.go

b92cb4c

Update flatfs.go

0cdcbc0

Update flatfs.go

6afd65f

Co-authored-by: Marcin Rataj <lidel@lidel.org>

Update flatfs.go

fc8b85f

Co-authored-by: Marcin Rataj <lidel@lidel.org>

Update flatfs.go

599e4e2

Co-authored-by: Marcin Rataj <lidel@lidel.org>

Update flatfs.go

d5d68b9

Co-authored-by: Marcin Rataj <lidel@lidel.org>

Update flatfs.go

e07b67c

Co-authored-by: Marcin Rataj <lidel@lidel.org>

Update flatfs.go

f241816

Co-authored-by: Marcin Rataj <lidel@lidel.org>

Update flatfs.go

d9b8858

Co-authored-by: Marcin Rataj <lidel@lidel.org>

gammazero changed the base branch from master to batch-temp-dir November 13, 2025 15:56

Update flatfs.go

5e338c9

Co-authored-by: Marcin Rataj <lidel@lidel.org>

gammazero merged commit 2e24d0a into ipfs:batch-temp-dir Nov 14, 2025
8 of 9 checks passed

gammazero mentioned this pull request Nov 14, 2025

refactor: rewrite batch mode to use temp directory #142

Merged

Conversation

requilence commented Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Implementation Details

Uh oh!

welcome bot commented Aug 27, 2025

Uh oh!

gammazero commented Sep 2, 2025

Uh oh!

gammazero left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hsanjuan commented Sep 16, 2025

Uh oh!

lidel commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gammazero commented Oct 21, 2025

Uh oh!

requilence commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

requilence commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gammazero commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gammazero left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lidel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gammazero commented Nov 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

requilence commented Aug 27, 2025 •

edited

Loading

gammazero left a comment •

edited

Loading

lidel commented Oct 21, 2025 •

edited

Loading

requilence commented Oct 22, 2025 •

edited

Loading

requilence commented Oct 22, 2025 •

edited

Loading

gammazero commented Oct 23, 2025 •

edited

Loading