Add voting learners #665

desilinguist · 2021-02-15T23:14:50Z

This PR mainly closes #488. It also closes #524 and closes #584.

New `VotingLearner` class

Relevant files: learner/voting.py and learner/utils.py.

The main contribution of this PR is to allow SKLL to use VotingClassifier and VotingRegressor learners from scikit-learn. This was not as straightforward since this is the first true meta-learner class we have added to SKLL. By meta-learner, I mean a class that builds on top of SKLL's Learner class. This meta-learner class is called VotingLearner.
The implementation of VotingLearner provides the same 7 methods as in the original Learner class: train(), cross_validate(), evaluate(), predict(), learning_curve(), from_file() and save(). There is some minor code duplication between these methods and the corresponding methods in the Learner class but most of it was avoided by the refactoring of the common code into utility functions last year. There are some changes to those functions in order to accommodate the new meta-learner. In addition, there is some new refactoring primarily for the from_file() and save() methods for the two classes which now use the refactored functions _save_learner_to_disk() and _load_learner_from_disk().
The implementation supports passing in keyword arguments to the underlying learners, as well as samplers and sampler arguments.
The train() implementation nicely incorporates grid search such that the learners underlying the voting meta-learner are automatically tuned with grid search (assuming the user requests it) before their predictions are used for voting. Grid search also works with cross_validate() although, as expected, doing this is much slower since the per-fold-grid-search will now be done for each underlying learner. Both these methods also accept a list of parameter grids for tuning the underlying learners.
The evaluate() and predict() methods support returning (and writing out) not only the meta-learner's predictions but also the predictions from the underlying learners that were used in the voting process.

Integration with `run_experiment`

Relevant files: config/__init__.py, experiments/__init__.py, and experiments/utils.py.

The VotingLearner class can be used via the SKLL API as described in the previous section. This PR also includes hooks that allow users to specify VotingClassifier and VotingRegressor as their chosen learners in an experiment configuration file as input to run_experiment. The hooks are set up to require users to specify the underlying estimators as fixed parameters using the estimator_names key. The following additional fixed parameters can also be specified: voting_type, estimator_fixed_parameters, estimator_samplers, estimator_sampler_parameters, and estimator_param_grids. These parameters are fully documented and example configuration files are also included (see documentation section below for details).
A new configuration field called save_votes is added to allow the user to save the predictions from the underlying learners in addition to the predictions from the VotingClassifier or VotingRegressor. The default value for this field is False.
A new JSON encoder called PipelineTypeEncoder() was added so as to support being able to serialize VotingLearner instances to JSON. This was necessary since these instances in turn contain Pipeline instances.

New tests

Relevant files: tests/test_voting_learners_api_*.py, tests/test_voting_learners_expts_*.py, and tests/utils.py.

The artificially constructed datasets we use for the existing tests are not very useful for VotingLearner tests since they are either too small or too toy-ish. Therefore, we use the digits and housing datasets (included in scikit-learn) for classification and regression tests respectively. To make it easy to use these datasets, two new utility functions are added: make_digits_data() and make_california_housing_data().
Note that we were already using a version of the digits dataset in the learning curve tests for the Learner class. This use was refactored to use the new make_digits_data() utility function.
All of the API tests test the methods of the VotingLearner class by first calling that method on an instantiated VotingLearner in SKLL space, then using only scikit-learn functions to perform (nearly) identical
operations, and then comparing the two results to make sure they are as close as possible. Most of the classification tests only those results up to 2 decimal places because there are some inherent differences between scikit-learn and SKLL that make it difficult to replicate SKLL operations in scikit-learn space. Most of the regression tests are able to compare to more decimal places since there are no probabilities involved.
Since the API tests are so comprehensive, there is no real reason for the experiment tests to run real experiments since the same API methods are called from within run_configuration() anyway. For this reason, we focus all of the experiment tests on making sure that the right methods are called based on the "task" value and that they are called with the right arguments derived from the fields specified in the configuration file. To do so, we mock the appropriate API methods, call run_configuration() on different configuration files, and check that the mocked methods were called the expected number of times and with the expected arguments.
To make experiment testing easier, we add a new utility function called fill_in_config_options_for_voting_learners() that takes an empty configuration file template (tests/configs/test_voting_learner.template.cfg ) and populates it with the right values depending on the arguments with which the function was called. In addition, a new class called BoolDictis added that returns ``False`` as the default value for key lookups rather than ``None``. This simplifies thefill_in_config_options_for_voting_learners()` function significantly.
New tests were added to test_input.py for the save_votes field. In addition, existing tests in the same file were updated to accommodate this new field.
Note that there is still some code duplication between the tests but this adding them all into the same file will add a lot more complexity (if statements and the like). This way they are fully self-contained and can be run fully independently.

Documentation

Relevant files: doc/run_experiment.rst, doc/api/learner.rst, examples/iris/voting.cfg, examples/boston/voting.cfg, and others (see below).

Updated run_experiment documentation to include detailed description of VotingClassifier and VotingRegressor and added an entry for the save_votes configuration field. Note that some of the links will only work after the PR is merged.
Updated doc/api/learner.rst to include the VotingLearner class and improved sub-headings.
Added new configuration files illustrating the use of VotingClassifier and VotingRegressor for the Iris and Boston examples respectively.
Updated doc/contributing.rst page for readability and fixed links to existing methods. Note that some of the links will only work after the PR is merged.
Since top-level imports have been removed, deleted api/skll.rst and updated api/quickstart.rst and the Tutorial notebook.
Updated doc/conf.py to fix imports and changed year from 2019 to 2021.

Other changes

There was a major bug when using samplers. We were calling fit_transform() on the test set rather than transform(). This was fixed and tests/test_classification.py:test_sampler:test_sparse_predict_sampler() was updated.
A new parallel build job was added to both Travis and Azure CI builds to accommodate the new tests. Test files were redistributed across all 6 jobs to make sure that the overall build time is still optimized.
The Travis CI configuration now creates a new conda environment rather than using the default miniconda one. In addition, it does not activate said environment when running the tests. Finally, it also configures nosetests via environment variables for simplicity.
Both Azure CI and Travis CI now set logging level for tests to WARNING and do not use --nologcapture which significantly reduces the size of the logs produced.
The warning and error messages in Learner.learning_curve() have been tweaked to be more concise.

- This is useful since it can the be used by both regular learners as well as the voting learners.

- Add docstring for `cross_validate()` - Add `learning_curve()`. - Use refactored code where possible.

# Conflicts: # skll/learner/__init__.py # skll/learner/utils.py

# Conflicts: # skll/learner/utils.py

# Conflicts: # tests/test_output.py

- Use position arguments instead of `.args` and `.kwargs` accessors which do not seem to be supported on Python 3.7.

- And fix existing tests

- Set BINDIR since we are now running tests without activating the conda environment. - Set nose options as environment variables to make command shorter - Use `travis_wait` for the longest running test job to avoid early termination.

- Update run_experiment documentation to include detailed description of `VotingClassifier` and `VotingRegressor`. Also add entry for `save_votes` configuration field. - Add new votingconfiguration files for voting learners for Iris and Boston examples. - Update contributing page for readability and fix links. - Remove all top-level imports from documentation pages and tutorial notebook. - Update Learner API documentation to include `VotingLearner` class and improved sub-headings. - Update docstrings for voting learners to make them more readable. - Update sphinx configuration (year and imports -

codecov · 2021-02-15T23:20:19Z

Codecov Report

Merging #665 (63fe693) into main (13c19aa) will increase coverage by 1.66%.
The diff coverage is 98.42%.

@@            Coverage Diff             @@
##             main     #665      +/-   ##
==========================================
+ Coverage   95.09%   96.76%   +1.66%     
==========================================
  Files          27       63      +36     
  Lines        3100     9077    +5977     
==========================================
+ Hits         2948     8783    +5835     
- Misses        152      294     +142

Impacted Files	Coverage Δ
skll/learner/utils.py	`93.39% <84.21%> (-0.94%)`	⬇️
skll/experiments/utils.py	`93.16% <90.90%> (-0.30%)`	⬇️
skll/experiments/__init__.py	`95.14% <95.91%> (-0.05%)`	⬇️
tests/utils.py	`97.89% <96.10%> (ø)`
tests/test_voting_learners_expts_1.py	`98.38% <98.38%> (ø)`
skll/learner/voting.py	`98.42% <98.42%> (ø)`
tests/test_voting_learners_expts_5.py	`98.48% <98.48%> (ø)`
tests/test_voting_learners_expts_4.py	`98.76% <98.76%> (ø)`
tests/test_voting_learners_expts_2.py	`98.82% <98.82%> (ø)`
tests/test_voting_learners_expts_3.py	`98.82% <98.82%> (ø)`
... and 47 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 13c19aa...63fe693. Read the comment docs.

mulhod

I have a couple questions and comments. (I have not done a full review yet, though. I just want to break it up since I won't get back to this till tomorrow probably.)

It looks really awesome so far! Exciting.

doc/run_experiment.rst

skll/experiments/__init__.py

skll/learner/__init__.py

skll/learner/utils.py

- Update API and experiment tests to handle this new functionality.

mulhod

Minor typos.

skll/experiments/utils.py

skll/learner/voting.py

tests/test_voting_learners_api_2.py

tests/test_voting_learners_api_3.py

mulhod

Really nice! Exciting changes!

aoifecahill

Looks great, thanks! I've tried it out (including navigating the documentation) and managed to successfully run and revise an experiment based on the outputs/documentation.

desilinguist added 30 commits June 9, 2020 14:21

Fix inaccurate docstring.

0b76507

Factor out prediction writing into separate function

a235e25

- This is useful since it can the be used by both regular learners as well as the voting learners.

Initial commit of voting learner

4a84a65

Factor out common code for evaluation

098c143

Some more tweaks

405920e

Refactor some CV code into new functions.

bd08b5e

Add cross_validate() method for voting learner.

3c8a676

Refactor some inefficient code.

d2402e3

Change single quotes to double quotes.

968e37a

Refactor some more code

9716e1a

Use refactored functions

c6b15d7

More additions for voting learner

ab09c70

- Add docstring for `cross_validate()` - Add `learning_curve()`. - Use refactored code where possible.

Merge branch 'refactor-code-for-meta-learners' into add-voting-learners

3e5b898

# Conflicts: # skll/learner/__init__.py # skll/learner/utils.py

Reorder functions alphabetically.

3eda5f5

Updating docstring

d638739

Merge branch 'master' into add-voting-learners

96e1840

# Conflicts: # skll/learner/utils.py

Fix merge bork.

e3c2017

Add learning curve setup.

10271c3

Add missing initialization.

b6eb3f5

Start adding tests.

f855a8c

Merge branch 'main' into add-voting-learners

79752bc

MInor tweaks

7194b56

Add first set of tests for voting learners

c1719ef

Allow zero test sizes for digits and CA housing data

872fab5

Replace load_digits() with make_digits_data()

e027184

Merge branch 'main' into add-voting-learners

433f858

# Conflicts: # tests/test_output.py

Merge branch 'main' into add-voting-learners

19ce3b6

Tweak digits and housing functions.

c78adbc

Remove unnecessary warning.

c65f890

Add new voting learner test for learning curves.

dc3367f

desilinguist added 16 commits February 14, 2021 16:37

Modernize .travis.yml

71e18d8

Use 3.7 compatible syntax for mock calls

30c4ee7

- Use position arguments instead of `.args` and `.kwargs` accessors which do not seem to be supported on Python 3.7.

Set logging level for CI builds.

1c48192

Add input tests for save_votes

938dd16

- And fix existing tests

More changes to Travis CI config

5881d77

- Set BINDIR since we are now running tests without activating the conda environment. - Set nose options as environment variables to make command shorter - Use `travis_wait` for the longest running test job to avoid early termination.

More Windows fixes.

2c3f0e3

Fix typo

8c562b3

Update expected value due to sampler bugfix.

d262df7

Do not use nologcapture.

3227fcb

use 3 folds for cross-validation API test

0067611

No need to use travis_wait anymore.

fe20fee

Add some more tests to increase coverage.

2122039

Rebalance tests across CI jobs.

f886ec4

Another attempt at balancing CI test times.

81333fc

Add missing test results

55bd0dd

desilinguist requested review from a user, aoifecahill and mulhod February 15, 2021 23:14

mulhod suggested changes Feb 17, 2021

View reviewed changes

doc/run_experiment.rst Show resolved Hide resolved

skll/experiments/__init__.py Show resolved Hide resolved

skll/learner/__init__.py Show resolved Hide resolved

skll/learner/utils.py Outdated Show resolved Hide resolved

desilinguist added 2 commits February 17, 2021 16:44

Fix bug in cross-validate experiments with voting learners

dbd48d5

Allow saving individual predictions from xval.

2d330ad

- Update API and experiment tests to handle this new functionality.

mulhod suggested changes Feb 18, 2021

View reviewed changes

Whitespace and typo fixes.

63fe693

mulhod approved these changes Feb 18, 2021

View reviewed changes

desilinguist mentioned this pull request Feb 22, 2021

Use 5 folds for grid search #667

Merged

aoifecahill approved these changes Feb 23, 2021

View reviewed changes

desilinguist merged commit dc5ed05 into main Feb 23, 2021

delete-merged-branch bot deleted the add-voting-learners branch February 23, 2021 01:58

Add voting learners #665

Add voting learners #665

Uh oh!

Conversation

desilinguist commented Feb 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

New VotingLearner class

Integration with run_experiment

New tests

Documentation

Other changes

Uh oh!

codecov bot commented Feb 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mulhod left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mulhod left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mulhod left a comment

Choose a reason for hiding this comment

Uh oh!

aoifecahill left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

desilinguist commented Feb 15, 2021 •

edited

Loading

New `VotingLearner` class

Integration with `run_experiment`

codecov bot commented Feb 15, 2021 •

edited

Loading