fix: Add support for compressed files for tail package by kalleep · Pull Request #5363 · grafana/alloy

kalleep · 2026-01-27T13:07:04Z

Brief description of Pull Request

When decompression is configured for loki.soure.file we now use the same code internally as we do when it's not configured. This aligns the feature between them like BOM detection.

Pull Request Details

Before this pr we had two different implementation that used when tailing file, tailer and decompressor. The latter was used any time decompression was configured.

Ever since my major refactors to tail package I was certain that we could add support for compressed files and reuse the same implementation.

That is what I have done here so most code are now shared. When compression is configured for tail.File it will not wait if EOF is returned. It will then check if we have any remaining data to flush and the return EOF. This will stop the tailer.

One issue we have is that the previous implementation tracked position by line numbers but this implementation will track offset on uncompressed data, not sure how we can handle that.

In addition to decompression support I fixed an issue that I noticed by adding flush to reader, if we hit EOF we drain all remaining data. But if we had data that did not include a newline it would never be consumed.

Issue(s) fixed by this Pull Request

Notes to the Reviewer

If we consume a compressed file we will exit and remove the stored position, while alloy is running this is fine. But if alloy is restarted we will ingest the files again. This is true with both this implementation and the previous one and we should fix that in another pr.

PR Checklist

Documentation added
Tests updated
Config converters updated

ptodev

LGTM. I only added a few minor comments.

One issue we have is that the previous implementation tracked position by line numbers but this implementation will track offset on uncompressed data

Is this going to be a problem for users who have a partially consumed archive? Is there a chance of logs being lost during Alloy upgrades?

if we had data that did not include a newline it would never be consumed.

It'd be good to note this with a fix: in the changelog.

internal/component/loki/source/file/tailer.go

Co-authored-by: Paulin Todev <paulin.todev@gmail.com>

kalleep · 2026-01-27T14:31:23Z

Is this going to be a problem for users who have a partially consumed archive? Is there a chance of logs being lost during Alloy upgrades?

It would be the other way around, we would most likely read lines that have already been consumed again

kalleep · 2026-01-27T15:43:36Z

I want to add integration tests for compression usage. If it's okay I will leave that to a followup

kalleep · 2026-01-27T15:45:32Z

It'd be good to note this with a fix: in the changelog.

Yeah this is probably good because now we get the same support for BOM detection when reading compressed data.

### Brief description of Pull Request When `decompression` is configured for `loki.soure.file` we now use the same code internally as we do when it's not configured. This aligns the feature between them like BOM detection. (cherry picked from commit 2347c1b)

@kalleep

) ## Backport of #5363 This PR backports #5363 to release/v1.13. ### Original PR Author @kalleep ### Description ### Brief description of Pull Request When `decompression` is configured for `loki.soure.file` we now use the same code internally as we do when it's not configured. This aligns the feature between them like BOM detection. ### Pull Request Details Before this pr we had two different implementation that used when tailing file, tailer and decompressor. The latter was used any time decompression was configured. Ever since my major refactors to tail package I was certain that we could add support for compressed files and reuse the same implementation. That is what I have done here so most code are now shared. When compression is configured for `tail.File` it will not wait if EOF is returned. It will then check if we have any remaining data to flush and the return EOF. This will stop the tailer. One issue we have is that the previous implementation tracked position by line numbers but this implementation will track offset on uncompressed data, not sure how we can handle that. In addition to decompression support I fixed an issue that I noticed by adding `flush` to reader, if we hit EOF we drain all remaining data. But if we had data that did not include a newline it would never be consumed. ### Issue(s) fixed by this Pull Request ### Notes to the Reviewer If we consume a compressed file we will exit and remove the stored position, while alloy is running this is fine. But if alloy is restarted we will ingest the files again. This is true with both this implementation and the previous one and we should fix that in another pr. ### PR Checklist  - [ ] Documentation added - [x] Tests updated - [ ] Config converters updated --- *This backport was created automatically.* Co-authored-by: Karl Persson <23356117+kalleep@users.noreply.github.com>

kalleep added 10 commits January 26, 2026 15:24

rework bom functions and move file position handling to reader

039f2a2

initial support for compression

50ea765

Add tests for different compressions

8307149

Add Nop position implementation

692d5f6

fix: add function to flush buffered data from reader

ee29234

pass encoding as string

f6d4ee1

Use tailer for reader compressed files

a6ae2bd

spelling

0b63974

add comment

b5b5959

Add initial delay

6862676

kalleep requested a review from a team as a code owner January 27, 2026 13:07

kalleep changed the title ~~refactor: reuse tailer for compressed files~~ refactor: Reuse tailer for compressed files Jan 27, 2026

kalleep added 2 commits January 27, 2026 14:39

fix flaky tests

69dcbc4

remove prints

4ac9508

ptodev reviewed Jan 27, 2026

View reviewed changes

internal/component/loki/source/file/tailer.go Outdated Show resolved Hide resolved

internal/component/loki/source/file/tailer.go Outdated Show resolved Hide resolved

internal/component/loki/source/file/tailer.go Outdated Show resolved Hide resolved

ptodev self-assigned this Jan 27, 2026

kalleep and others added 3 commits January 27, 2026 15:29

fix flaky test

adbebf9

Update internal/component/loki/source/file/tailer.go

f6c6862

Co-authored-by: Paulin Todev <paulin.todev@gmail.com>

Update internal/component/loki/source/file/tailer.go

839bf83

Co-authored-by: Paulin Todev <paulin.todev@gmail.com>

kalleep requested a review from ptodev January 28, 2026 13:05

ptodev approved these changes Jan 28, 2026

View reviewed changes

kalleep changed the title ~~refactor: Reuse tailer for compressed files~~ fix: reuse code when compression is configured for loki.source.file Jan 28, 2026

kalleep changed the title ~~fix: reuse code when compression is configured for loki.source.file~~ fix: Add support for compressed files with tail packge Jan 28, 2026

kalleep changed the title ~~fix: Add support for compressed files with tail packge~~ fix: Add support for compressed files for tail package Jan 28, 2026

kalleep merged commit 2347c1b into main Jan 28, 2026
50 of 51 checks passed

kalleep deleted the kalleep/loki-source-file-compression branch January 28, 2026 14:01

grafana-alloybot bot mentioned this pull request Jan 28, 2026

Unreleased Changes #5170

Draft

kalleep added the backport/v1.13 Backport to release/v1.13 label Feb 2, 2026

grafana-alloybot bot mentioned this pull request Feb 2, 2026

fix: Add support for compressed files for tail package [backport] #5415

Merged

3 tasks

github-actions bot added the frozen-due-to-age label Feb 17, 2026

github-actions bot locked as resolved and limited conversation to collaborators Feb 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Add support for compressed files for tail package#5363

fix: Add support for compressed files for tail package#5363
kalleep merged 15 commits intomainfrom
kalleep/loki-source-file-compression

kalleep commented Jan 27, 2026 •

edited

Loading

Uh oh!

ptodev left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kalleep commented Jan 27, 2026

Uh oh!

kalleep commented Jan 27, 2026

Uh oh!

kalleep commented Jan 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kalleep commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Brief description of Pull Request

Pull Request Details

Issue(s) fixed by this Pull Request

Notes to the Reviewer

PR Checklist

Uh oh!

ptodev left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kalleep commented Jan 27, 2026

Uh oh!

kalleep commented Jan 27, 2026

Uh oh!

kalleep commented Jan 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kalleep commented Jan 27, 2026 •

edited

Loading