-
Notifications
You must be signed in to change notification settings - Fork 4k
ARROW-5880: [C++][Parquet] Use TypedBufferBuilder instead of ArrayBuilder in writer.cc #4827
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
emkornfield
commented
Jul 9, 2019
- Other small cleanups/comments based on my grokking of the code.
- Vectorize inner loop of List based LevelBuilder where possible
cpp/src/parquet/arrow/writer.cc
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| // arrays otherwise we would preallocate. An upper boun on memory | |
| // arrays otherwise we would preallocate. An upper bound on memory |
|
@bkietz thanks for the review, all good suggestions. |
|
Added some missing exports. Beware as you make more changes on the Parquet codebase that my patch #4841 is slightly disruptive to the src/parquet/arrow directory (but should not be too bad to rebase) |
|
@wesm thanks for the fixes. I will look more closely as I modify code to watch out for potential merge conflicts (I have enough other things on my plate that it might be a few days anyways before I get back to this). I restarted the failing travis-CI job which looked spurious |
a14fbe5 to
94ce3e5
Compare
Co-Authored-By: Benjamin Kietzman <bengilgit@gmail.com>
Co-Authored-By: Benjamin Kietzman <bengilgit@gmail.com>
Co-Authored-By: Benjamin Kietzman <bengilgit@gmail.com>
94ce3e5 to
2f77313
Compare
Codecov Report
@@ Coverage Diff @@
## master #4827 +/- ##
==========================================
+ Coverage 87.31% 89.05% +1.74%
==========================================
Files 894 718 -176
Lines 134066 100316 -33750
==========================================
- Hits 117055 89334 -27721
+ Misses 16659 10982 -5677
+ Partials 352 0 -352
Continue to review full report at Codecov.
|
wesm
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1
…lder in writer.cc - Other small cleanups/comments based on my grokking of the code. - Vectorize inner loop of List based LevelBuilder where possible Closes #4827 from emkornfield/parquet_cleanup and squashes the following commits: 2f77313 <Wes McKinney> Export operator<< symbols for DataType, TimeUnit on Windows e1698e5 <Micah Kornfield> fix compilation issues 152a6ee <Micah Kornfield> change back to signed it seems to be what the API uses 67ccf3e <Micah Kornfield> update references ef7fade <Micah Kornfield> remove unneeded nesting 6776e61 <emkornfield> Update cpp/src/parquet/arrow/writer.cc 7cbc8e4 <emkornfield> Update cpp/src/parquet/arrow/writer.cc 83ee5ae <emkornfield> Update cpp/src/parquet/arrow/writer.cc 325af52 <Micah Kornfield> undo reserve 4516110 <Micah Kornfield> Small style issues 7795352 <Micah Kornfield> change from array builder to buffer builde Lead-authored-by: Micah Kornfield <emkornfield@gmail.com> Co-authored-by: emkornfield <emkornfield@gmail.com> Co-authored-by: Wes McKinney <wesm+git@apache.org> Signed-off-by: Wes McKinney <wesm+git@apache.org>