Skip to content

Conversation

@stloyd
Copy link
Member

@stloyd stloyd commented Sep 24, 2025

Resolves: #1859

Change Log


Added

  • Allow detecting Excel files by reading first bytes

Fixed

Changed

Removed

Deprecated

Security

@norberttech
Copy link
Member

Sweet!!!

@norberttech norberttech enabled auto-merge (squash) September 24, 2025 12:55
@norberttech norberttech merged commit 01d60a1 into flow-php:1.x Sep 24, 2025
21 checks passed
@codecov
Copy link

codecov bot commented Sep 24, 2025

Codecov Report

❌ Patch coverage is 95.45455% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 82.48%. Comparing base (c0afa1e) to head (f3b8fd0).
⚠️ Report is 3 commits behind head on 1.x.

Additional details and impacted files
@@            Coverage Diff             @@
##              1.x    #1860      +/-   ##
==========================================
+ Coverage   82.46%   82.48%   +0.01%     
==========================================
  Files         772      772              
  Lines       21785    21800      +15     
==========================================
+ Hits        17966    17982      +16     
+ Misses       3819     3818       -1     
Components Coverage Δ
etl 89.25% <ø> (ø)
cli 85.91% <ø> (ø)
lib-array-dot 94.56% <ø> (ø)
lib-azure-sdk 61.35% <ø> (ø)
lib-doctrine-dbal-bulk 95.59% <ø> (ø)
lib-filesystem 80.25% <ø> (ø)
lib-types 53.55% <ø> (ø)
lib-parquet 85.50% <ø> (ø)
lib-parquet-viewer 83.11% <ø> (ø)
lib-snappy 90.23% <ø> (+0.46%) ⬆️
bridge-filesystem-async-aws 90.38% <ø> (ø)
bridge-filesystem-azure 89.92% <ø> (ø)
bridge-monolog-http 97.04% <ø> (ø)
bridge-openapi-specification 94.52% <ø> (ø)
symfony-http-foundation 74.41% <ø> (ø)
adapter-chartjs 86.70% <ø> (ø)
adapter-csv 88.85% <ø> (ø)
adapter-doctrine 91.21% <ø> (ø)
adapter-elasticsearch 97.23% <ø> (ø)
adapter-google-sheet 84.49% <ø> (ø)
adapter-http 58.10% <ø> (ø)
adapter-json 87.98% <ø> (ø)
adapter-logger 53.84% <ø> (ø)
adapter-meilisearch 97.95% <ø> (ø)
adapter-parquet 78.92% <ø> (ø)
adapter-text 84.44% <ø> (ø)
adapter-xml 82.73% <ø> (ø)
🚀 New features to boost your workflow:
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@github-actions
Copy link
Contributor

Flow PHP - Benchmarks

Results of the benchmarks from this PR are compared with the results from 1.x branch.

Extractors
+-----------------------+------------------------+------+-----+-----------------+------------------+-----------------+
| benchmark             | subject                | revs | its | mem_peak        | mode             | rstdev          |
+-----------------------+------------------------+------+-----+-----------------+------------------+-----------------+
| CSVExtractorBench     | bench_extract_10k      | 1    | 3   | 4.956mb -0.02%  | 433.008ms -0.76% | ±0.51% -40.81%  |
| ExcelExtractorBench   | bench_extract_10k_ods  | 1    | 3   | 66.212mb +0.00% | 1.083s -0.17%    | ±0.15% -63.54%  |
| ExcelExtractorBench   | bench_extract_10k_xlsx | 1    | 3   | 68.323mb +0.00% | 1.713s -1.10%    | ±0.64% -19.01%  |
| JsonExtractorBench    | bench_extract_10k      | 1    | 3   | 5.492mb +0.01%  | 1.163s -1.24%    | ±0.25% -39.43%  |
| ParquetExtractorBench | bench_extract_10k      | 1    | 3   | 10.788mb -0.26% | 9.434s -20.50%   | ±0.80% +75.55%  |
| TextExtractorBench    | bench_extract_10k      | 1    | 3   | 4.682mb -0.02%  | 62.477ms +1.37%  | ±1.11% +361.81% |
| XmlExtractorBench     | bench_extract_10k      | 1    | 3   | 4.665mb -0.02%  | 627.184ms -0.44% | ±0.47% -53.37%  |
+-----------------------+------------------------+------+-----+-----------------+------------------+-----------------+
Transformers
+---------------------------------+--------------------------+------+-----+------------------+-----------------+-----------------+
| benchmark                       | subject                  | revs | its | mem_peak         | mode            | rstdev          |
+---------------------------------+--------------------------+------+-----+------------------+-----------------+-----------------+
| RenameEachEntryTransformerBench | bench_transform_10k_rows | 1    | 3   | 18.687mb -0.01%  | 73.225ms +0.34% | ±0.59% +130.90% |
| RenameEntryTransformerBench     | bench_transform_10k_rows | 1    | 3   | 123.490mb -0.00% | 66.627ms -0.63% | ±1.62% +325.51% |
+---------------------------------+--------------------------+------+-----+------------------+-----------------+-----------------+
Loaders
+--------------------+----------------+------+-----+------------------+------------------+-----------------+
| benchmark          | subject        | revs | its | mem_peak         | mode             | rstdev          |
+--------------------+----------------+------+-----+------------------+------------------+-----------------+
| CSVLoaderBench     | bench_load_10k | 1    | 3   | 62.781mb -0.00%  | 90.809ms -1.09%  | ±1.20% -47.04%  |
| JsonLoaderBench    | bench_load_10k | 1    | 3   | 80.702mb -0.00%  | 102.675ms -0.37% | ±0.52% +5.37%   |
| ParquetLoaderBench | bench_load_10k | 1    | 3   | 819.384mb +0.04% | 20.347s -25.86%  | ±0.71% +222.09% |
| TextLoaderBench    | bench_load_10k | 1    | 3   | 17.982mb -0.01%  | 34.498ms +0.57%  | ±0.64% -9.97%   |
+--------------------+----------------+------+-----+------------------+------------------+-----------------+
Building Blocks
+-------------------+----------------------------+------+-----+------------------+------------------+-----------------+
| benchmark         | subject                    | revs | its | mem_peak         | mode             | rstdev          |
+-------------------+----------------------------+------+-----+------------------+------------------+-----------------+
| TypeDetectorBench | bench_type_detector        | 1    | 3   | 42.608mb -0.00%  | 405.478ms +0.66% | ±0.75% +28.64%  |
| TypeDetectorBench | bench_type_detector        | 1    | 3   | 11.665mb -0.01%  | 82.858ms +0.30%  | ±1.12% -11.76%  |
| EntryFactoryBench | bench_entry_factory        | 1    | 3   | 106.085mb -0.00% | 656.310ms +0.02% | ±0.50% -38.31%  |
| EntryFactoryBench | bench_entry_factory        | 1    | 3   | 55.363mb -0.00%  | 335.981ms +1.69% | ±1.72% +30.23%  |
| EntryFactoryBench | bench_entry_factory        | 1    | 3   | 14.949mb -0.01%  | 70.435ms -1.05%  | ±0.86% +360.33% |
| RowsBench         | bench_chunk_10_on_10k      | 2    | 3   | 93.553mb -0.00%  | 3.870ms +2.42%   | ±1.93% +72.13%  |
| RowsBench         | bench_diff_left_1k_on_10k  | 2    | 3   | 110.943mb -0.00% | 239.549ms -0.63% | ±0.46% -37.90%  |
| RowsBench         | bench_diff_right_1k_on_10k | 2    | 3   | 93.663mb -0.00%  | 24.576ms +2.70%  | ±0.73% -17.78%  |
| RowsBench         | bench_drop_1k_on_10k       | 2    | 3   | 94.428mb -0.00%  | 1.937ms +25.03%  | ±2.95% +185.42% |
| RowsBench         | bench_drop_right_1k_on_10k | 2    | 3   | 94.428mb -0.00%  | 1.940ms +30.79%  | ±1.62% -8.39%   |
| RowsBench         | bench_entries_on_10k       | 2    | 3   | 92.589mb -0.00%  | 3.778ms +7.36%   | ±2.87% +55.00%  |
| RowsBench         | bench_filter_on_10k        | 2    | 3   | 93.117mb -0.00%  | 15.662ms +1.00%  | ±0.82% -17.45%  |
| RowsBench         | bench_find_on_10k          | 2    | 3   | 93.117mb -0.00%  | 15.828ms +1.18%  | ±0.35% +30.79%  |
| RowsBench         | bench_find_one_on_10k      | 10   | 3   | 91.806mb -0.00%  | 2.000μs +0.30%   | ±0.00% -100.00% |
| RowsBench         | bench_first_on_10k         | 10   | 3   | 91.806mb -0.00%  | 0.400μs 0.00%    | ±0.00% 0.00%    |
| RowsBench         | bench_flat_map_on_1k       | 2    | 3   | 100.867mb -0.00% | 16.567ms +13.81% | ±1.12% +23.33%  |
| RowsBench         | bench_map_on_10k           | 2    | 3   | 130.294mb -0.00% | 71.839ms +4.59%  | ±1.21% +523.07% |
| RowsBench         | bench_merge_1k_on_10k      | 2    | 3   | 93.637mb -0.00%  | 1.674ms +34.82%  | ±2.56% -12.02%  |
| RowsBench         | bench_partition_by_on_10k  | 2    | 3   | 97.025mb -0.00%  | 62.821ms +1.72%  | ±0.71% +6.44%   |
| RowsBench         | bench_remove_on_10k        | 2    | 3   | 94.690mb -0.00%  | 3.980ms +12.22%  | ±2.63% +265.30% |
| RowsBench         | bench_sort_asc_on_1k       | 2    | 3   | 92.187mb -0.00%  | 39.946ms -1.65%  | ±1.19% +6.20%   |
| RowsBench         | bench_sort_by_on_1k        | 2    | 3   | 92.187mb -0.00%  | 41.095ms -1.44%  | ±2.15% +78.14%  |
| RowsBench         | bench_sort_desc_on_1k      | 2    | 3   | 92.187mb -0.00%  | 41.708ms -0.89%  | ±3.02% +46.01%  |
| RowsBench         | bench_sort_entries_on_1k   | 2    | 3   | 94.249mb -0.00%  | 8.295ms +2.61%   | ±1.21% -38.90%  |
| RowsBench         | bench_sort_on_1k           | 2    | 3   | 91.999mb -0.00%  | 29.118ms -7.68%  | ±0.57% -83.19%  |
| RowsBench         | bench_take_1k_on_10k       | 10   | 3   | 91.806mb -0.00%  | 14.680μs -3.14%  | ±1.15% -52.38%  |
| RowsBench         | bench_take_right_1k_on_10k | 10   | 3   | 91.806mb -0.00%  | 17.359μs +0.30%  | ±1.43% +425.11% |
| RowsBench         | bench_unique_on_1k         | 2    | 3   | 110.943mb -0.00% | 242.021ms -1.24% | ±0.74% +103.61% |
+-------------------+----------------------------+------+-----+------------------+------------------+-----------------+
Parquet Library
+--------------------+---------------------------------+------+-----+------------------+-------------------+-----------------+
| benchmark          | subject                         | revs | its | mem_peak         | mode              | rstdev          |
+--------------------+---------------------------------+------+-----+------------------+-------------------+-----------------+
| ParquetReaderBench | bench_page_headers              | 1    | 3   | 6.989mb -0.02%   | 3.308s -0.53%     | ±0.80% -34.00%  |
| ParquetReaderBench | bench_read_metadata             | 1    | 3   | 5.444mb -0.02%   | 18.383ms +1.21%   | ±0.46% -86.09%  |
| ParquetReaderBench | bench_read_schema               | 1    | 3   | 5.444mb -0.02%   | 18.146ms -0.79%   | ±0.34% -74.93%  |
| ParquetReaderBench | bench_read_values_all_columns   | 1    | 3   | 9.260mb -0.21%   | 5.690s -28.02%    | ±1.07% +56.98%  |
| ParquetReaderBench | bench_read_values_single_column | 1    | 3   | 6.491mb -0.31%   | 233.078ms -49.57% | ±1.11% +81.84%  |
| ParquetReaderBench | bench_read_values_with_limit    | 1    | 3   | 7.075mb -1.21%   | 29.378ms -12.05%  | ±0.35% -33.30%  |
| ParquetWriterBench | bench_write_batch               | 1    | 3   | 11.878mb -14.76% | 194.819ms -12.30% | ±0.53% +339.43% |
| ParquetWriterBench | bench_write_gzip                | 1    | 3   | 10.503mb +0.01%  | 219.362ms +0.79%  | ±0.27% +284.50% |
| ParquetWriterBench | bench_write_row_by_row          | 1    | 3   | 11.878mb -14.76% | 191.675ms -13.29% | ±0.59% +129.92% |
| ParquetWriterBench | bench_write_snappy              | 1    | 3   | 11.878mb -14.76% | 191.043ms -14.29% | ±0.87% -53.09%  |
| ParquetWriterBench | bench_write_uncompressed        | 1    | 3   | 10.124mb +0.01%  | 191.005ms +0.36%  | ±1.23% +15.89%  |
+--------------------+---------------------------------+------+-----+------------------+-------------------+-----------------+

@stloyd stloyd deleted the bugfix/1859 branch September 24, 2025 13:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug]: Excel reader crashes on missing file extension

2 participants