Add W&B logging #761

tamarl08 · 2024-01-11T19:28:50Z

This PR adds W&B logging for all task types.

Changes include:

wandb logging for each task - see links and details below.
using the return value of _classify_featureset directly (This part is not tested for the grid use case).
exposing some parameters in the result dict so wandb logger can access them. this will affect the results.json file.

Examples:

train: currently only logs the configuration, model file path and train set size (in run summary).

predict: logs configuration, train and test sizes and predictions file

learning curve: logs config, plots and some raw data from the learning curve results (in run summary)

evaluate: logs config and all evaluation metrics and confusion matrix as a chart.

cross_validate: logs evaluation per fold + average of folds

TODO:

update test data
add tests of new functionality (?)
revert changes in examples cfg files before merging

desilinguist

@tamarl08 this looks pretty elegant for the most part! I like that you added what you needed to the learner result dictionaries and then just use those for logging most of the time.

I made some minor suggestions based on the code itself but without testing what the various W&B outputs look like. I will do that next.

skll/utils/wandb.py

skll/experiments/__init__.py

skll/experiments/output.py

desilinguist · 2024-01-11T21:08:03Z

@tamarl08 any idea why gitlab is failing and Azure is passing?

tamarl08 · 2024-01-11T21:13:17Z

Will check. I didn't expect anything to pass!

desilinguist · 2024-01-12T14:41:32Z

I am seeing things like this in my run:

I am also seeing charts with a single data point (e.g., the accuracy values etc.) which aren't useful. I wonder if we can tell W&B to create relevant charts from the summary file on the fly? Let's sit down next week and try to figure out what things are actually valuable to log and how to make a run appear useful right when someone opens it.

tamarl08 · 2024-01-12T15:46:22Z

This is why I tried to log to summary instead, but I couldn't always get rid of these charts. I'll try some more and let's talk next week.

…in tests and fix some tests. Some code fixes and improvement.

pep8speaks · 2024-01-25T16:29:40Z

Hello @tamarl08! Thanks for updating this PR.

In the file tests/test_regression.py:

Line 452:64: E203 whitespace before ':'

Comment last updated at 2024-01-26 21:47:38 UTC

tamarl08 · 2024-01-25T16:40:53Z

@desilinguist @mulhod @damien2012eng @Frost45
This is ready for review now. I changed the logging of evaluation/cv tasks so that no unneeded charts are logged.

See the updated evaluation logging here - look only at the most recent run.

Changes to tests are due to: a change I made to the job name/output file names; data added to the result dict; bug fixes in some tests.

codecov · 2024-01-25T18:12:41Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (7c09d07) 95.33% compared to head (1dec165) 95.44%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #761      +/-   ##
==========================================
+ Coverage   95.33%   95.44%   +0.11%     
==========================================
  Files          30       30              
  Lines        3598     3688      +90     
==========================================
+ Hits         3430     3520      +90     
  Misses        168      168

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

desilinguist

Looks really good! I ran some test experiments and W&B logging now looks much better than before. I made some minor suggestions. I am assuming that you are going to add documentation in a separate MR?

skll/utils/wandb.py

tests/test_examples.py

tests/test_output.py

tamarl08 · 2024-01-26T21:06:11Z

Thanks for the review @desilinguist! All suggestions applied. I did add a section in the docs, and also a comment about the output file names. Will add more examples later.

desilinguist

Just some minor tweaks for the documentation.

doc/run_experiment.rst

Co-authored-by: Nitin Madnani <nmadnani@ets.org>

mulhod

Since I see others commenting on the docs, I will leave some comments here for now and continue my review after refreshing and/or waiting, etc.

doc/run_experiment.rst

desilinguist

LGTM!

Add logging of all task types to wandb

4b3aedc

tamarl08 requested review from Frost45, damien2012eng, desilinguist and mulhod January 11, 2024 19:28

desilinguist requested changes Jan 11, 2024

View reviewed changes

Tamar Lavee added 7 commits January 18, 2024 09:24

address PR comments: docstring and code improvements. fix most tests

24175f3

fix typo

b9c6135

update docstring

a2c837f

revert removal of test method and fix more PR comments

9a8a6c3

fix object name

60e3d42

add wandb to mypy dependencies

52b75af

W&B log summary instead of individual metrics. Update expected paths …

4086655

…in tests and fix some tests. Some code fixes and improvement.

import wandb run types

a5a47d5

tamarl08 changed the title ~~Draft: Add W&B logging~~ Add W&B logging Jan 25, 2024

Tamar Lavee added 2 commits January 25, 2024 12:13

add model files for voting learner tests

2589778

revert change in example config

5815b79

Tamar Lavee added 2 commits January 25, 2024 20:56

add wandb tests to cover all wandb module

f3242f3

update documentation

6e75262

desilinguist requested changes Jan 26, 2024

View reviewed changes

fix and improve docstrings and test methods

8f279b0

desilinguist reviewed Jan 26, 2024

View reviewed changes

doc/run_experiment.rst Outdated Show resolved Hide resolved

Update doc/run_experiment.rst

1dec165

Co-authored-by: Nitin Madnani <nmadnani@ets.org>

mulhod reviewed Jan 26, 2024

View reviewed changes

doc/run_experiment.rst Outdated Show resolved Hide resolved

doc/run_experiment.rst Outdated Show resolved Hide resolved

doc/run_experiment.rst Outdated Show resolved Hide resolved

desilinguist approved these changes Jan 29, 2024

View reviewed changes

mulhod approved these changes Jan 30, 2024

View reviewed changes

tamarl08 merged commit 6c0c0d4 into main Jan 30, 2024

delete-merged-branch bot deleted the 733-wandb-integration branch January 30, 2024 16:32

Add W&B logging #761

Add W&B logging #761

Uh oh!

Conversation

tamarl08 commented Jan 11, 2024

Uh oh!

desilinguist left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

desilinguist commented Jan 11, 2024

Uh oh!

tamarl08 commented Jan 11, 2024

Uh oh!

desilinguist commented Jan 12, 2024

Uh oh!

tamarl08 commented Jan 12, 2024

Uh oh!

pep8speaks commented Jan 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Comment last updated at 2024-01-26 21:47:38 UTC

Uh oh!

tamarl08 commented Jan 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jan 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

desilinguist left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tamarl08 commented Jan 26, 2024

Uh oh!

desilinguist left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mulhod left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

desilinguist left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pep8speaks commented Jan 25, 2024 •

edited

Loading

tamarl08 commented Jan 25, 2024 •

edited

Loading

codecov bot commented Jan 25, 2024 •

edited

Loading

mulhod left a comment •

edited

Loading