Skip to content

Conversation

@HangyuanLiu
Copy link
Contributor

@HangyuanLiu HangyuanLiu commented Aug 7, 2019

When you import the parquet type file directory generated by SPARK.
you might have multiple files, but some of them have only headers and no data.
But this is a normal situation that can lead to overall failure.

#1594

_cur_file_reader = new ParquetReaderWrap(file_reader.release());
return _cur_file_reader->init_parquet_reader(_src_slot_descs);
Status status = _cur_file_reader->init_parquet_reader(_src_slot_descs);
if(status.is_end_of_file()) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
if(status.is_end_of_file()) {
if (status.is_end_of_file()) {

@imay imay merged commit 9402456 into apache:master Aug 7, 2019
@imay imay mentioned this pull request Sep 26, 2019
@HangyuanLiu HangyuanLiu deleted the fix-parquet-empty-file branch May 30, 2020 12:54
luwei16 added a commit to luwei16/incubator-doris that referenced this pull request Apr 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants