Skip to content

Unexpected uncompressed page size: read stage: MainData: column: bool - with parquet native reader v3 #87396

@Selfeer

Description

@Selfeer

Company or project name

No response

Describe what's wrong

I get the following exception when using native reader v3:

Error on processing query: Code: 117. DB::Exception: Unexpected uncompressed page size: read stage: MainData: column: bool: (in file/uri /var/lib/clickhouse/user_files/file_6bb64ca7_97a4_11f0_88de_498af214c503.Parquet): While executing ParquetV3BlockInputFormat: While executing File: data for INSERT was parsed from file. (INCORRECT_DATA) (version 25.8.4.13 (official build))

I wasn't able to create smaller steps to reproduce the issue but the sequence is ass follows:

  1. I create a table with a schema containing a lot of columns with different datatypes
  2. I insert the data from that table into a FUNCTION file()
  3. Insert the data from the generated parquet file back to the table

Here is the gist with all the steps as the queries are too big to fit here: https://gist.github.com/Selfeer/1acfea443d1155b7fc834025b2c33eda

Does it reproduce on the most recent release?

Yes

How to reproduce

ClickHouse version: 25.8.4.13
Settings: input_format_parquet_use_native_reader_v3 = 1

Expected behavior

No response

Error message and/or stacktrace

No response

Additional context

No response

Metadata

Metadata

Assignees

Labels

comp-formatsInput/output formats (CSV/JSON/Parquet/ORC/Arrow/Protobuf/etc.).potential bugTo be reviewed by developers and confirmed/rejected.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions