Skip to content

Remove entry object from Column Statistics #1560

@norberttech

Description

@norberttech

When executing pipeline with analyze()->withColumnStatistics() each column stats holds reference to the first entry from the dataframe.

This is problematic since entries might change over time, for example one row might be null and the second one might be date_time. In this case the column statistics will hold a reference to nullable StringEntry with metadata FROM_NULL

Entry reference should be removed, they do not serve any purpose there.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    Status

    Done

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions