-
Notifications
You must be signed in to change notification settings - Fork 4k
ARROW-13005: [C++] Add support for take implementation on dense union type #10606
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thanks for opening a pull request! If this is not a minor PR. Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW Opening JIRAs ahead of time contributes to the Openness of the Apache Arrow project. Then could you also rename pull request title in the following format? or See also: |
|
ARROW-13005: [C++] Implementation of take function for dense union data type. |
|
@ZMZ91 Thanks for tackling this slightly delicate issue. I've posted two comments about the chosen approach. |
… update variable declarations
|
Thank you @bkietz. All comments resolved. Please help check. |
|
Hi @bkietz and @pitrou, just a question not quite related to this pr and about dense union array. I've seen the limitation for array data is under 2GB. Does it apply to nested arrays or just the sub arrays inside? For example, dense union array, it has child id array, value offset array and a bunch of child arrays. Each of them should be under 2GB. And is it restricted that the sum under 2GB? |
|
The 2GB limitation applies to buffers and is type specific. It derives from usage of signed 32 bit integers to encode offsets, so for example a StringArray cannot have more than 2GB total character data because otherwise we could not express the final offset. IIUC it's allowed for a child of the union to be an Int8Array with In short, this is not a constraint you need to enforce in this PR. |
bkietz
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks!
I'll merge on green CI
https://issues.apache.org/jira/browse/ARROW-13005