-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Check validity of indices in take_fixed_size_binary #8948
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
This might slow down the take kernel. A second approach I've thought of is that we keep the same iterator and apply the null mask at the end (only if the null count is > 0 for indices). @alamb Could you maye be start the (EDIT: take kernel) benchmarks so we know the effect of this change? Thanks! |
|
run benchmark take_kernels |
|
🤖 |
|
🤖: Benchmark completed Details
|
Thanks! I guess this confirms the performance regression. I'll give the other approach (applying nulls at the end) a shot. |
|
I am always a bit suspicious about benchmarks in the usec range as they are so suseptable to noise. I'll run some more |
|
run benchmark take_kernels |
|
🤖 |
I agree but I think it's possible that the initial change introduced a performance regression. I've pushed a new version that performed better on my PC so hopefully we can see that in the benchmarks. 🤞 Nice robotic sheep for the bot btw! 🐑 |
|
🤖: Benchmark completed Details
|
|
Looking nice now! Let's run it again to make sure it is reproduceable |
|
run benchmark take_kernels |
|
🤖 |
|
🤖: Benchmark completed Details
|
alamb
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great to me -- more correct and faster -- thank you @tobixdev 🚀
2004e60 to
c3226a4
Compare
|
I am sorry -- I accidentally pushed to this branch. I will fix shortly |
|
vv |
|
I am so sorry @tobixdev . I messed up this PR by accident. I did save the code in a new PR: #8981 I think you can revive the PR if you want to, by going to this commit: 885f31e here is the command I used git reset --hard 885f31e26ecIf you push that branch again, I think we'll be able to reopen this PR Alternately, we can just go with #8981 |
|
Hahaha no worries. We can just go with the new one if we can find another reviewer. |
# Which issue does this PR close? - redo of #8948 from @tobixdev - closes #8947 # Rationale for this change I messed up @tobixdev 's PR / branch by accident in #8948 This PR contains the previous commits that were in that PR # What changes are included in this PR? - See #8948 # Are these changes tested? Yes, by CI # Are there any user-facing changes? Less bugs, faster code --------- Co-authored-by: Tobias Schwarzinger <tobias.schwarzinger@tuwien.ac.at>
Which issue does this PR close?
take_fixed_size_binaryDoes Not Consider NULL Indices #8947Rationale for this change
Fix incorrect behavior of take kernel for FixedSizeBinary arrays.
What changes are included in this PR?
Check that the index is valid in the iterator for the result array.
Are these changes tested?
Yes, there is a new test.
Are there any user-facing changes?
Yes, the kernel behaves correctly.