fix: track max sequence globally in compact() to prevent level regression by davidby-influx · Pull Request #27231 · influxdata/influxdb

davidby-influx · 2026-02-18T23:27:38Z

compact() derives the output filename from maxGeneration and
maxSequence+1, but maxSequence was scoped to the max generation only.
When cold or forced compaction groups L1 files (high gen, seq=1) with L4 files
(low gen, seq>=4), the output gets seq=2, regressing L4 data to L2. If large
L4 files are skipped by the cold planner's size check, the regressed file
becomes stranded, requiring repeated cold compaction cycles (12+ hours) to
recover.

The fix tracks the minimum sequence per generation to determine each
generation's true level, then bumps maxSequence so the output level never
regresses. Per-generation max sequence is preserved for collision avoidance,
and min-sequence level detection avoids inflating the output when a generation
has split files from the 2GB size limit.

Test plan

TestCompactor_CompactFull_MixedLevelNoRegression: L4+L1 compaction
produces L4 output, not L2
TestCompactor_CompactFull_SplitFilesNoLevelInflation: split L2
generation compacted with normal L2 produces L3, not inflated to L4

Fixes #27230

…sion The compaction planner can produce groups mixing L1 and L4 files via cold compaction or the nested-file handling added in d1d68d6. compact() computed the output sequence scoped to the max generation only, so when a newer L1 file (high gen, seq=1) was compacted with older L4 files (low gen, seq≥4), the output was written at level 2, regressing already-optimized data. Track maxSequence across all input generations so the output level is always at least as high as the highest input level. This is safe for all existing compaction paths: PlanLevel inputs are same-level (global max == per-generation max), and cold/forced compactions merge into a single generation where the higher sequence is harmless. Fixes #27230

davidby-influx · 2026-02-19T00:24:23Z

Claude is rethinking its life choices... Stand by.

compact() derives the output filename from maxGeneration and maxSequence+1, but maxSequence was scoped to the max generation only. When cold or forced compaction groups L1 files (high gen, seq=1) with L4 files (low gen, seq>=4), the output gets seq=2 — regressing L4 data to L2. Track the minimum sequence per generation to determine each generation's true level, then bump maxSequence so the output level never regresses below the highest input level. Per-generation max sequence is preserved for collision avoidance, and min-sequence level detection avoids inflating the output when a generation has split files from the 2GB size limit. Fixes #27230

davidby-influx · 2026-02-19T04:59:49Z

Claude says:

 Here are my re-verified conclusions:                                                                                                                                                  
                                                                                                                                                                                        
  1. Which paths produce mixed-level groups?                                                                                                                                            
                                                                                                                                                                                        
  Only the cold/forced Plan() path (line 493). It collects all non-in-use, non-skipped files regardless of level.                                                                       
                                                                                                                                                                                        
  - PlanLevel(1,2,3): Groups by level. The orphan absorption in groupAdjacentGenerations can mix levels within a group, but PlanLevel filters groups by level() (which returns the max),
   so orphaned lower-level files get absorbed into higher-level groups that PlanLevel at the lower level never sees. PlanLevel always produces same-level groups. No mixed levels.
  - Hot Plan(): The reverse scan finds the last L4 by generation number. New L1 files always have higher generation numbers than L4 files, so they're excluded. No mixed levels in
  normal operation.
  - PlanOptimize(): Operates on cold shards after Plan(). Not relevant.

  2. The bug predates both commits. The per-generation maxSequence logic in compact() has been the same since before f0d5c73 and d1d68d6. The cold/forced path has always been able to
  mix levels. Neither commit introduced or extended the bug.

  3. The fix is correct. I traced through seven scenarios:

  ┌────────────────────┬────────────────────────────┬────────────┬────────────┬──────────────┐
  │      Scenario      │           Input            │ Fix output │ Old output │   Correct?   │
  ├────────────────────┼────────────────────────────┼────────────┼────────────┼──────────────┤
  │ L4+L1 cold         │ gen=1(s4), gen=4(s1)       │ seq=4 (L4) │ seq=2 (L2) │ Fix ✓, Old ✗ │
  ├────────────────────┼────────────────────────────┼────────────┼────────────┼──────────────┤
  │ All L4 cold        │ gen=1(s4), gen=4(s4)       │ seq=5 (L4) │ seq=5 (L4) │ Both ✓       │
  ├────────────────────┼────────────────────────────┼────────────┼────────────┼──────────────┤
  │ All L1 PlanLevel   │ gen=1..8(s1)               │ seq=2 (L2) │ seq=2 (L2) │ Both ✓       │
  ├────────────────────┼────────────────────────────┼────────────┼────────────┼──────────────┤
  │ L2 split PlanLevel │ gen=1(s2,s3), gen=16(s2)   │ seq=3 (L3) │ seq=3 (L3) │ Both ✓       │
  ├────────────────────┼────────────────────────────┼────────────┼────────────┼──────────────┤
  │ L3+L1 cold         │ gen=1(s3), gen=2(s1)       │ seq=3 (L3) │ seq=2 (L2) │ Fix ✓, Old ✗ │
  ├────────────────────┼────────────────────────────┼────────────┼────────────┼──────────────┤
  │ L4 split+L1 cold   │ gen=1(s4,s5,s6), gen=4(s1) │ seq=4 (L4) │ seq=2 (L2) │ Fix ✓, Old ✗ │
  ├────────────────────┼────────────────────────────┼────────────┼────────────┼──────────────┤
  │ L2 split+L1 cold   │ gen=1(s2,s3,s4), gen=4(s1) │ seq=2 (L2) │ seq=2 (L2) │ Both ✓       │
  └────────────────────┴────────────────────────────┴────────────┴────────────┴──────────────┘

  The level bump loop's result is order-independent (map iteration) because it only increases maxSequence, never decreases it.

  4. The tests are correct.
  - MixedLevelNoRegression: catches the bug (old code produces seq=2, fix produces seq=4)
  - SplitFilesNoLevelInflation: catches over-correction (global-max fix would produce seq=4, correct fix produces seq=3, old code also produces seq=3)

  5. One prior claim was wrong. I incorrectly attributed hot-path exposure to d1d68d6. The hot path does not produce mixed-level groups. The bug is exclusively in the cold/forced
  compaction path and has been latent since the original implementation.

Copilot

Pull request overview

This PR fixes a critical bug in TSM file compaction where level 4 (L4) files could be regressed to level 2 (L2) during cold or forced compaction operations. The issue occurred when compact() determined the output filename using only the maximum sequence number within the maximum generation, ignoring higher-level files from older generations. This could cause already-optimized L4 data to be written at a lower level, potentially stranding files and requiring multiple 12+ hour compaction cycles to recover.

Changes:

Modified compact() to track minimum sequence per generation and ensure output level never regresses below the highest input level
Added comprehensive tests covering mixed-level compaction and split-file edge cases

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
`tsdb/engine/tsm1/compact.go`	Added `minSeqByGen` map to track minimum sequence per generation and logic to bump `maxSequence` to prevent level regression
`tsdb/engine/tsm1/compact_test.go`	Added `mustWriteTSMWithSeq` helper and two tests: one for mixed-level regression prevention, one for split-file level inflation prevention

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

gwossum · 2026-02-19T17:37:27Z

tsdb/engine/tsm1/compact.go

+	for _, minSeq := range minSeqByGen {
+		level := minSeq
+		if level > 4 {
+			level = 4
+		}
+		if maxSequence+1 < level {
+			maxSequence = level - 1
+		}


I find the logic here somewhat confusing. I've created a branch with an alternative implementation that gets the same result, but makes it clear what is being calculated.

For your consideration:
https://github.com/influxdata/influxdb/tree/gw/max_seq_fix_minInputLevel

Diff:
https://github.com/influxdata/influxdb/compare/DSB/max_seq_fix..gw/max_seq_fix_minInputLevel

Claude bows to your superiority:

The gw branch computes maxInputLevel as a single value and applies the bump in one comparison. My branch iterates the map and bumps incrementally inside the loop. Let me verify they're equivalent. gw branch: var maxInputLevel int for _, minSeq := range minSeqByGen { maxInputLevel = max(maxInputLevel, min(minSeq, 4)) } if maxSequence+1 < maxInputLevel { maxSequence = maxInputLevel - 1 } This branch: for _, minSeq := range minSeqByGen { level := minSeq if level > 4 { level = 4 } if maxSequence+1 < level { maxSequence = level - 1 } } My loop bumps maxSequence incrementally per generation. Since the bump only increases maxSequence, after the full loop, maxSequence ends at max(original, maxLevel-1) where maxLevel is the highest level seen. That's the same as computing maxInputLevel first and then bumping once. They produce identical results for all inputs. The gw branch is cleaner — it separates the computation from the mutation and avoids redundant comparisons inside the loop.

Adopting your changes.

Explicitly calcluate maxInputLevel for clarity. (cherry picked from commit 142fcaa)

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated no new comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

gwossum

LGTM

davidby-influx requested a review from gwossum February 18, 2026 23:27

davidby-influx self-assigned this Feb 18, 2026

davidby-influx added area/tsm kind/bug 1.x team/edge labels Feb 18, 2026

davidby-influx marked this pull request as draft February 19, 2026 00:24

davidby-influx marked this pull request as ready for review February 19, 2026 01:43

davidby-influx requested a review from Copilot February 19, 2026 05:00

Copilot started reviewing on behalf of davidby-influx February 19, 2026 05:00 View session

Copilot AI reviewed Feb 19, 2026

View reviewed changes

gwossum reviewed Feb 19, 2026

View reviewed changes

chore: explicitly calclulate maxInputLevel for clarity

12b6068

Explicitly calcluate maxInputLevel for clarity. (cherry picked from commit 142fcaa)

davidby-influx requested review from Copilot and gwossum February 19, 2026 18:47

Copilot started reviewing on behalf of davidby-influx February 19, 2026 18:48 View session

Copilot AI reviewed Feb 19, 2026

View reviewed changes

gwossum approved these changes Feb 19, 2026

View reviewed changes

davidby-influx linked an issue Feb 19, 2026 that may be closed by this pull request

compact() regresses L4 files to L2 when compacting mixed-level groups #27230

Closed

davidby-influx merged commit 381fd2d into master-1.x Feb 19, 2026
12 of 13 checks passed

davidby-influx deleted the DSB/max_seq_fix branch February 19, 2026 21:31

davidby-influx mentioned this pull request Feb 19, 2026

compact() regresses L4 files to L2 when compacting mixed-level groups [port to 1.12] #27232

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: track max sequence globally in compact() to prevent level regression#27231

fix: track max sequence globally in compact() to prevent level regression#27231
davidby-influx merged 3 commits intomaster-1.xfrom
DSB/max_seq_fix

davidby-influx commented Feb 18, 2026 •

edited

Loading

Uh oh!

davidby-influx commented Feb 19, 2026

Uh oh!

davidby-influx commented Feb 19, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

gwossum Feb 19, 2026

Uh oh!

davidby-influx Feb 19, 2026

Uh oh!

davidby-influx Feb 19, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

gwossum left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

davidby-influx commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test plan

Uh oh!

davidby-influx commented Feb 19, 2026

Uh oh!

davidby-influx commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

gwossum Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

davidby-influx Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

davidby-influx Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

gwossum left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

davidby-influx commented Feb 18, 2026 •

edited

Loading

davidby-influx commented Feb 19, 2026 •

edited

Loading