Skip to content

colexec: improve unordered distinct #44404

@yuzefovich

Description

@yuzefovich

When #42522 goes in, we will have the support for unordered distinct. It works by building a hash table over the whole input and storing all the tuples from the input in the hash table, and then iterating over it and outputting only "head" tuples. This is quite inefficient and should be improved. Namely, we'll need to modify the hash table to not store duplicates at all.

Metadata

Metadata

Assignees

Labels

C-enhancementSolution expected to add code/behavior + preserve backward-compat (pg compat issues are exception)

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions