Skip to content

Code: 117. DB::Exception: Dictionary page size out of bounds when reading a Parquet file with input_format_parquet_use_native_reader_v3 #87306

@aadant

Description

@aadant

Company or project name

got this error when reading this Parquet file (iris.parquet) in 25.8.1

clickhouse-local --query "select * from file('iris.parquet','Parquet') limit 1 format CSVWithNames  settings input_format_parquet_use_native_reader_v3 = 1"
"sepal.length","sepal.width","petal.length","petal.width","variety"
Code: 117. DB::Exception: Dictionary page size out of bounds: 39 > 26: read stage: MainData: column: variety: (in file/uri /iris.parquet): While executing ParquetV3BlockInputFormat: While executing File. (INCORRECT_DATA)

Describe what's wrong

it works with input_format_parquet_use_native_reader_v3 = 0 but not input_format_parquet_use_native_reader_v3 = 1, so probably a regression

Does it reproduce on the most recent release?

Yes

How to reproduce

download https://github.com/Altinity/ice/blob/master/examples/s3watch/iris.parquet

clickhouse-local --query "select * from file('iris.parquet','Parquet') limit 1 format CSVWithNames  settings input_format_parquet_use_native_reader_v3 = 1"

Expected behavior

No response

Error message and/or stacktrace

No response

Additional context

No response

Metadata

Metadata

Assignees

Labels

comp-formatsInput/output formats (CSV/JSON/Parquet/ORC/Arrow/Protobuf/etc.).potential bugTo be reviewed by developers and confirmed/rejected.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions