Skip to content

adding support for low memory dataframes#76

Merged
brifordwylie merged 4 commits intomasterfrom
dataframe_optimization
Sep 3, 2019
Merged

adding support for low memory dataframes#76
brifordwylie merged 4 commits intomasterfrom
dataframe_optimization

Conversation

@brifordwylie
Copy link
Copy Markdown
Member

PR based on some of the concepts in #75 but keeping the 'generator' for bro_log_reader (most folks using that class will definitely need/want a generator instead of a dataframe). Also instead of creating a large dataframe and then compressing we try to bring the data into the dataframe with categorical type directly. Also we make the changes compatible with the Parquet/Arrow conversions.

Note: This PR is still a WIP, we still have lots of improvements and testing to conduct.

@brifordwylie
Copy link
Copy Markdown
Member Author

Addresses issues in #23 and #71

@brifordwylie brifordwylie merged commit ba53323 into master Sep 3, 2019
@brifordwylie brifordwylie deleted the dataframe_optimization branch September 3, 2019 17:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant