refactor: rewrite batch mode to use temp directory by gammazero · Pull Request #142 · ipfs/go-ds-flatfs

gammazero · 2025-11-14T19:12:09Z

Continued from #136
Credit to @requilence for most of this PR

Problem

When uploading large files through UnixFS or other IPLD structures, the current batch implementation keeps everything in memory and writes blocks individually. If the application crashes mid-process, partial blocks remain in FlatFS, making garbage collection extremely difficult since there's no way to identify which blocks belong to incomplete operations.

Solution

This PR rewrites the batch mode to use a temporary directory approach:

Atomic batch operations: All blocks in a batch are written to a temporary directory first, then renamed to their final locations only on commit. If the process crashes, no partial data remains in the main datastore - the temp directory is cleaned on restart.
Memory efficiency: Instead of keeping all data in memory, blocks are written directly to temp files, enabling large file uploads without high RAM usage.
Read operations in batch: Added BatchReader interface implementing Get/Has/GetSize methods. This is essential for IPLD structures which need to verify block links during construction. Without batch reads, you cannot build complex IPLD structures that reference blocks added earlier in the same batch. This follows standard database transaction semantics where reads within a transaction see uncommitted writes.

Implementation Details

Each batch creates a unique temp directory under .temp/
Files are written without sharding in temp
On commit, files are atomically renamed to their sharded destinations
On discard or crash, temp files are cleaned up (on the next startup). This code was already existing
Thread-safe with mutex protection
Batch can be reused after commit/discard (because this behavior of Batch was before my changes)

* Rewrite batch mode to use temp directory Write batch data to temp directory instead of memory to avoid high RAM usage with large files. Each batch creates its own temp directory, writes blocks directly to disk, and renames all files atomically on commit. * Use slice instead of map for batch puts * Add DiscardableBatch interface with Discard method * add read operations for batch iface * fix: remove unused function and variable to fix CI checks * Removed unused tempFileOnce() method * Replaced unused 'done' variable with '_' in Commit method * remove some local files * perf: optimize batch read operations to skip filesystem checks Only check temp directory for keys that exist in puts slice * feat: implement batch Query with temp directory merging * make batch Put operations async Address PR review feedback * fix Batch init * Update flatfs.go

gammazero · 2025-12-02T15:55:34Z

Triage: Open PR in kubo and test on kubo staging.

Upgrade go-ds-flatfs to version that uses uses temproary files to store items added to batches. See: ipfs/go-ds-flatfs#142

- add godoc for maxConcurrentPuts explaining why 16 was chosen - fix incorrect O(n) claims for Get/Has/GetSize (actually O(1) via putSet map)

lidel

Lgtm, as long we address two things:

Updated godoc (bd468ee) to reflect the latest version, but unsure about one thing – see inline.
IIUC two concurrent goroutines calling os.Create() and Write() on the same file can error or corrupt data – see inline.

flatfs.go

Co-authored-by: Marcin Rataj <lidel@lidel.org>

lidel · 2025-12-09T04:05:50Z

flatfs.go

+			return
+		}
+
+		file, err := os.Create(tempFile)


Bit worried about this on Windows: i suspect concurrent async writes (multiple ipfs add) may cause file handle issues, things like golang/go#34681 – flatfs uses https://github.com/alexbrainman/goissue34681 for a reason, we likely need similar wrapper on windows here (old code used tempFileOnce from util_windows.go).

@gammazero too late on my end to write code, but we probably need to create createFile equivalent that does something similar (pseudocode):

util_unix.go:

func createFile(name string) (*os.File, error) { return os.Create(name) }

util_windows.go:

func createFile(name string) (*os.File, error) { return goissue34681.OpenFile(name, os.O_RDWR|os.O_CREATE|os.O_TRUNC, 0666) }

Then finally here:

file, err := createFile(tempFile)

Windows concurrency is janky, unsure if we need a retry wrapper, maybe can go without it, but in case, it could be something like:

util_windows.go:

func createFile(name string) (*os.File, error) { var file *os.File var err error for i := 0; i < RetryAttempts; i++ { file, err = createFileOnce(name) if err == nil || !isTooManyFDError(err) { break } time.Sleep(time.Duration(i+1) * RetryDelay) } return file, err }

Done, without retry wrapper.

hsanjuan · 2025-12-09T15:58:46Z

Triage:

Should test windows well. Perhaps stress-test with a test in this repo. Potential problem with file with duplicate blocks writing in parallel etc.

gammazero · 2026-01-09T06:55:24Z

@lidel Included batch concurrency test TestConcurrentDuplicateBatchWrites (last test in batch_test.go). Let me know if this looks sufficient.

lidel · 2026-01-09T23:11:40Z

flatfs.go

+	// Skip duplicate keys to prevent concurrent goroutines from writing to the
+	// same temp file. Without this check, two Put calls with the same key
+	// would spawn goroutines that race on os.Create/Write, potentially
+	// corrupting the file contents.
+	if _, exists := bt.putSet[key]; exists {
+		bt.mu.Unlock()
+		<-bt.asyncPutGate // Release semaphore slot acquired above
+		return nil


Hm.. the Query method documentation says:

Keys Put multiple times appear only once (last write wins)

BUT the duplicate key handling in here seem to silently discard updates?

This means:

batch.Put(ctx, key, []byte("first")) batch.Put(ctx, key, []byte("second")) batch.Commit(ctx) // writes "first", not "second"

@gammazero late friday so maybe I misunderstood intent here, but would it be more intuitive for the batch to implement "last write wins" semantics instead of "first write wins"?

My main worry is that the current behavior could cause subtle bugs for callers expecting standard transaction semantics where later operations override earlier ones.

Thinking was that duplicate keys should have the same data (assuming key is CID), so instead of blocking/overwriting when key already exists, discard the write instead. If "last write wins" is needed within a batch, because a write with same key could have different data than previous write, then we need change to change behavior to block/overwrite implementation or not do asynchronous writes.

Should the batch behavior be configurable? Last-write-wins or discard-duplicate-write

Thanks, I stepped back and did some research on this.

TL;DR: Seems that I was wrong, this is not a genera-purpose datastore: the current behavior (first-successful-writer-wins, discard duplicates) is correct for flatfs's intended use case (limited to CID→block), but we should document it more explicitly (for humans and LLMs).

Why this is fine for flatfs

flatfs is explicitly a special-purpose datastore for CID:block pairs only - Kubo's docs already state: "flatfs must only be used as a block store (mounted at /blocks) as it only partially implements the datastore interface"

CIDs guarantee value determinism - A CID is a cryptographic hash of the content. If Put(cid, A) and Put(cid, B) are called with the same CID, then A == B by definition (or one is corrupted). First-writer-wins and last-writer-wins produce identical results.

Non-batch Put already has this behavior - The existing doWriteOp documents: "we assume that the first succeeding operation on that key was the last one to happen after all successful others"

go-datastore Batch interface doesn't guarantee ordering - The interface docs say batches "do NOT have transactional semantics"

Next steps

I've pushed 410cfd2 with documentation updates that make the "first-successful-writer-wins" semantics explicit in :

README.md restrictions section

Batch type documentation

Put method documentation

It also points users needing last-writer-wins for non-deterministic values should use leveldb, or pebble instead.

If we are ok with "first-successful-writer-wins" for flatfs, and it being limited to cid→block use, I think this should be enough. Thoughts @gammazero?

I agree with "first-successful-writer-wins" for this limited purpose datastore, and given rule 2 above, it is the most correct behavior.

clarify flatfs is for content-addressed storage only and point users to leveldb/pebble for mutable data needing last-writer-wins.

- document batch reuse contract per go-datastore spec - fix TOCTOU race in temp dir creation using MkdirAll - log warnings on temp dir cleanup failures

lidel

LGTM. Previous concerns were addressed:

docs clarified
TOCTOU race fixed with MkdirAll
temp dir cleanup now logs warnings on failure (so users debugging get useful info)
batch reuse contract documented (#142 (comment))
did some manual testing and races, so far all good

@gammazero if #142 (comment) sounds right, feel free to merge. Up to you if you wan to tag a release, or switch kubo to latest commit from master for now for RC testign, until final release.

* datastore: upgrade go-ds-flatfs to v0.6.0 See: ipfs/go-ds-flatfs#142 * docs(changelog): add go-ds-flatfs atomic batch writes *documents the new flatfs batch implementation that uses atomic operations via temp directory, preventing orphan blocks on interrupted imports and reducing memory usage. * includes improved tests, batch cleanup fixes, and docs * docs(changelog): reframe go-ds-flatfs entry for users focus on user benefits instead of implementation details

requilence and others added 2 commits November 14, 2025 11:10

use map for O(1) lookup of puts in batch

784ae9f

lidel mentioned this pull request Nov 18, 2025

Release 0.40 ipfs/kubo#11008

Closed

67 tasks

hsanjuan assigned gammazero Nov 25, 2025

hsanjuan requested review from hsanjuan and lidel November 25, 2025 15:24

gammazero added a commit to ipfs/kubo that referenced this pull request Dec 4, 2025

datastore: upgrade go-ds-flatfs

153c870

Upgrade go-ds-flatfs to version that uses uses temproary files to store items added to batches. See: ipfs/go-ds-flatfs#142

gammazero mentioned this pull request Dec 4, 2025

feat: improved go-ds-flatfs ipfs/kubo#11092

Merged

docs: improve godocs for batch operations

bd468ee

- add godoc for maxConcurrentPuts explaining why 16 was chosen - fix incorrect O(n) claims for Get/Has/GetSize (actually O(1) via putSet map)

lidel requested changes Dec 9, 2025

View reviewed changes

flatfs.go Show resolved Hide resolved

flatfs.go Show resolved Hide resolved

gammazero and others added 2 commits December 8, 2025 19:43

Update flatfs.go

ebbfddf

Co-authored-by: Marcin Rataj <lidel@lidel.org>

gofmt

a6e9646

lidel requested changes Dec 9, 2025

View reviewed changes

handle create file for windows

f9f8892

gammazero requested a review from lidel December 20, 2025 04:21

Update tests and include batch concurrency test

b40f5af

gammazero added 3 commits January 8, 2026 21:19

test writing concurrent batches

7dfb41b

check for timeout

460ace6

return errors from goroutines

22a1570

lidel reviewed Jan 9, 2026

View reviewed changes

gammazero requested a review from lidel January 12, 2026 15:29

lidel added 2 commits January 12, 2026 19:58

docs: document first-successful-writer-wins semantics

410cfd2

clarify flatfs is for content-addressed storage only and point users to leveldb/pebble for mutable data needing last-writer-wins.

fix: batch cleanup and docs improvements

bd1c3b5

- document batch reuse contract per go-datastore spec - fix TOCTOU race in temp dir creation using MkdirAll - log warnings on temp dir cleanup failures

lidel changed the title ~~Rewrite batch mode to use temp directory (#136)~~ refactor: rewrite batch mode to use temp directory (#136) Jan 12, 2026

lidel changed the title ~~refactor: rewrite batch mode to use temp directory (#136)~~ refactor: rewrite batch mode to use temp directory Jan 12, 2026

lidel approved these changes Jan 13, 2026

View reviewed changes

gammazero merged commit 5f67d0e into master Jan 13, 2026
9 checks passed

gammazero deleted the batch-temp-dir branch January 13, 2026 04:26

gammazero mentioned this pull request Jan 13, 2026

Flatfs DS has to handle out of FDs in Query ipfs/kubo#3752

Closed

lidel added the iteration/2026-q1 On maintainer radar for Q1 2026 label Feb 12, 2026

This was referenced Feb 13, 2026

putMany uses as many FDs as there are entries #36

Open

panic: send on closed channel when cancelled command while adding content #45

Closed

BrewTestBot mentioned this pull request Feb 25, 2026

kubo 0.40.0 Homebrew/homebrew-core#269451

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: rewrite batch mode to use temp directory#142

refactor: rewrite batch mode to use temp directory#142
gammazero merged 12 commits intomasterfrom
batch-temp-dir

gammazero commented Nov 14, 2025 •

edited

Loading

Uh oh!

gammazero commented Dec 2, 2025

Uh oh!

lidel left a comment

Uh oh!

Uh oh!

Uh oh!

lidel Dec 9, 2025 •

edited

Loading

Uh oh!

gammazero Dec 10, 2025

Uh oh!

hsanjuan commented Dec 9, 2025

Uh oh!

gammazero commented Jan 9, 2026

Uh oh!

lidel Jan 9, 2026

Uh oh!

gammazero Jan 10, 2026 •

edited

Loading

Uh oh!

lidel Jan 12, 2026

Uh oh!

gammazero Jan 13, 2026

Uh oh!

lidel left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

gammazero commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Implementation Details

Uh oh!

gammazero commented Dec 2, 2025

Uh oh!

lidel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lidel Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gammazero Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

hsanjuan commented Dec 9, 2025

Uh oh!

gammazero commented Jan 9, 2026

Uh oh!

lidel Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

gammazero Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lidel Jan 12, 2026

Choose a reason for hiding this comment

Why this is fine for flatfs

Next steps

Uh oh!

gammazero Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

lidel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gammazero commented Nov 14, 2025 •

edited

Loading

lidel Dec 9, 2025 •

edited

Loading

gammazero Jan 10, 2026 •

edited

Loading