Skip to content

Conversation

@jqnatividad
Copy link
Collaborator

to add more columns to the Data Dictionary from the Summary Statistics

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds the ability to include additional columns from Summary Statistics in the Data Dictionary generated by the describegpt command. This enhances the dictionary with statistical measures like mean, median, MAD, standard deviation, variance, and coefficient of variation.

Key Changes:

  • Introduces --addl-cols flag to enable additional columns and --addl-cols-list option to specify which columns to include (with a sensible default list of statistical measures)
  • Updates the default stats options to include --mad --quartiles --percentiles for richer statistical context
  • Modifies data structures (DictionaryEntry, StatsRecord) to use IndexMap for preserving column order
  • Enhances output formatting functions (markdown, JSON, TSV) to dynamically include additional columns
  • Adds command line information to attribution metadata

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

File Description
src/cmd/describegpt.rs Core implementation adding --addl-cols and --addl-cols-list options with support for dynamic additional columns in dictionary output across all formats (markdown, JSON, TSV). Includes new helper function determine_addl_cols() and updates to parsing/formatting logic.
Cargo.toml Enables the "serde" feature for the indexmap dependency to support serialization of the new IndexMap fields.

@jqnatividad jqnatividad merged commit 52c3d9a into master Dec 10, 2025
13 of 14 checks passed
@jqnatividad jqnatividad deleted the describegpt-addlcols branch December 10, 2025 12:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants