Use example IDs when specifying `sample_ids` #613

desilinguist · 2023-07-10T22:51:34Z

Assume that sample_ids are now actual example IDs instead of indices.
Raise a ValueError if anything goes wrong in select_examples.
Update test_explanation_utils.py
- Use actual string IDs in the test data.
- Update all expected outputs to use these new string IDs.
Update relevant test in test_experiment_rsmexplain.py to IDs.
Update rsmexplain documentation
- Update description of sample_ids and move it to the end.
- Fix some typos.

Closes #609.

- Assume that `sample_ids` are now actual example IDs instead of indices. - Raise a `ValueError` if anything goes wrong in `select_examples`.

- Use actual string IDs in the test data. - Update all expected outputs to use these new string IDs.

- Update description of `sample_ids` and move it to the end. - Fix some typos.

codecov · 2023-07-10T23:35:58Z

Codecov Report

Patch coverage: 100.00% and no project coverage change.

Comparison is base (e0a1392) 95.89% compared to head (1cee0cc) 95.90%.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #613   +/-   ##
=======================================
  Coverage   95.89%   95.90%           
=======================================
  Files          59       59           
  Lines        9286     9297   +11     
=======================================
+ Hits         8905     8916   +11     
  Misses        381      381

Impacted Files	Coverage Δ
tests/test_experiment_rsmexplain.py	`96.29% <ø> (ø)`
rsmtool/rsmexplain.py	`92.03% <100.00%> (+0.12%)`	⬆️
tests/test_explanation_utils.py	`98.80% <100.00%> (+0.12%)`	⬆️

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

damien2012eng

LGTM!

It's possible that a given data file has integers as ID. RSMTool will read them as integers and not strings. In that case, we need to make sure that any sample IDs specified in the configuration file are also converted to the same data type as the featureset IDs before being compared.

desilinguist added 6 commits July 10, 2023 17:35

refactor: use actual example IDs for sample_ids

ad52d42

- Assume that `sample_ids` are now actual example IDs instead of indices. - Raise a `ValueError` if anything goes wrong in `select_examples`.

test: update test_explanation_utils.py

37a3400

- Use actual string IDs in the test data. - Update all expected outputs to use these new string IDs.

docs: update rsmexplain documentation

acf75d1

- Update description of `sample_ids` and move it to the end. - Fix some typos.

chore: update gitignore file

e34bf57

fix: allow whitespaces in sample_ids.

383f3f9

test: update test with sample_ids

cdc8e55

desilinguist requested review from damien2012eng and tamarl08 July 10, 2023 22:51

damien2012eng approved these changes Jul 11, 2023

View reviewed changes

desilinguist added 2 commits July 11, 2023 11:36

test: new tests for integer featureset IDs

1cee0cc

tamarl08 approved these changes Jul 11, 2023

View reviewed changes

desilinguist merged commit d372d66 into main Jul 11, 2023

delete-merged-branch bot deleted the 609-use-example-ids-in-samples branch July 11, 2023 17:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use example IDs when specifying `sample_ids` #613

Use example IDs when specifying `sample_ids` #613

Uh oh!

desilinguist commented Jul 10, 2023 •

edited

Loading

Uh oh!

codecov bot commented Jul 10, 2023 •

edited

Loading

Uh oh!

damien2012eng left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Use example IDs when specifying sample_ids #613

Use example IDs when specifying sample_ids #613

Uh oh!

Conversation

desilinguist commented Jul 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jul 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

damien2012eng left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Use example IDs when specifying `sample_ids` #613

Use example IDs when specifying `sample_ids` #613

desilinguist commented Jul 10, 2023 •

edited

Loading

codecov bot commented Jul 10, 2023 •

edited

Loading