Skip to content

Conversation

@geoffreyclaude
Copy link
Contributor

Which issue does this PR close?

N/A - benchmark fix discovered during performance analysis of #18832.

Rationale for this change

The in_list benchmark (introduced in #4068) had inverted null generation logic: null_percent=0 was producing 100% nulls instead of 0% nulls.

What changes are included in this PR?

Fix the random_bool(null_percent).then(...) pattern to use random_bool(1.0 - null_percent) so that null_percent correctly represents the percentage of null values.

Are these changes tested?

Benchmark-only change. Verified by running the benchmark and observing expected performance characteristics.

Are there any user-facing changes?

No.

@adriangb adriangb force-pushed the fix/in_list_benchmark branch from c5e8898 to e1e4da3 Compare December 8, 2025 17:39
@adriangb adriangb added this pull request to the merge queue Dec 8, 2025
Merged via the queue into apache:main with commit 662a3ba Dec 8, 2025
14 checks passed
github-merge-queue bot pushed a commit that referenced this pull request Dec 8, 2025
## Which issue does this PR close?

N/A - benchmark improvement

Stacked on top of #19204

## Rationale for this change

We need to measure InList performance on both StringArray (Utf8) and
StringViewArray (Utf8View) to compare Arrow's string representations.

## What changes are included in this PR?

Add Utf8View benchmarks for InList, refactored with generics to make
adding new array types trivial.

## Are these changes tested?

Benchmark-only change.

## Are there any user-facing changes?

No.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

physical-expr Changes to the physical-expr crates

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants