-
Notifications
You must be signed in to change notification settings - Fork 4k
Closed
Milestone
Description
ARROW-13620 added a binding for n_distinct() but it only works for grouped aggregation, not whole-table aggregation.
This works:
Table$create(starwars) %>%
group_by(homeworld) %>%
summarise(n_distinct(species)) %>%
collect()but this errors:
Table$create(starwars) %>%
summarise(n_distinct(species)) %>%
collect()
#> Error: Key error: No function registered with name: count_distinctOnce we have a non-hash count_distinct aggregate kernel in the C++ library (ARROW-14035) we should bind the options for it in the R package and add a test.
Reporter: Ian Cook / @ianmcook
Assignee: Percy Camilo Triveño Aucahuasi / @aucahuasi
Related issues:
- [R] Binding for n_distinct() (relates to)
- [C++][Compute] Implement non-hash count_distinct aggregate kernel (depends upon)
Note: This issue was originally created as ARROW-14036. Please see the migration documentation for further details.