Fix `_get_observation_pairs` for conditional parameters. by y0z · Pull Request #1166 · optuna/optuna

y0z · 2020-04-25T09:54:23Z

This PR fixes split_observations for conditional parameters to calculate EI values.

Motivation

The current implementation of the split_observations in the TPESampler is as follows:

The sampler discards the trials that do not contain the parameter named param_name when it gets observation pairs. This causes different len(config_vals) for conditional parameters.

optuna/optuna/samplers/tpe/sampler.py

Line 562 in f614d69

if param_name not in trial.params:
Then, the sampler calculates the y* value based on the gamma = min(int(np.ceil(0.1 * len(config_vals))), 25)

optuna/optuna/samplers/tpe/sampler.py

Line 183 in f614d69

n_below = self._gamma(len(config_vals))
Finally, the sampler splits config_vals into the ones with loss_vals < y* (i.e., the observations for l(x)) and the remaining ones (i.e., the observations for g(x)).

Based on my understanding, y* which determines the integration interval for the EI calculation should be the same value for all parameters (ref. the TPE paper).
However, the current strategy may set different y* values for conditional parameters.
Here is an example of this situation (note: this example is maximization).
In this example, y* for params_classifier is 0.9466666666666667, whereas y* for params_svc_c 0.32.
This seems to cause a curious EI calculation, i.e., a different integration interval is set for each parameter.

Description of the changes

This PR changes the split_observation procedure as follows:

The sampler does not discard the trials that do not contain the parameter named param_name when it gets observation.
Then, the sampler calculates the y* value based on the gamma = min(int(np.ceil(0.1 * len(config_vals))), 25). This y* is consistent among all hyperparameters.
After that, the sampler splits config_vals into the observations for l(x) and the observations for g(x).
Finally, the sampler discards the config_vals with the observations that do not contain the parameter named param_name from the observations for l(x) and g(x).

I have also confirmed that the split_observation procedure in the original TPE implementation in HyperOpt is like this.
https://github.com/hyperopt/hyperopt/blob/master/hyperopt/tpe.py#L625
In this implementation, o_vals is the set of config_vals that contain the parameter named param_name only, whereas l_vals is the set of all of the observed losses so far.

My apologies if I am misunderstanding the algorithm and please close this PR.

codecov-io · 2020-04-25T11:01:22Z

Codecov Report

Merging #1166 into master will increase coverage by 0.03%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #1166      +/-   ##
==========================================
+ Coverage   90.99%   91.02%   +0.03%     
==========================================
  Files         142      142              
  Lines       12179    12280     +101     
==========================================
+ Hits        11082    11178      +96     
- Misses       1097     1102       +5

Impacted Files	Coverage Δ
optuna/samplers/tpe/parzen_estimator.py	`96.96% <100.00%> (+0.09%)`	⬆️
optuna/samplers/tpe/sampler.py	`88.40% <100.00%> (+0.23%)`	⬆️
.../samplers_tests/tpe_tests/test_parzen_estimator.py	`90.90% <100.00%> (ø)`
tests/samplers_tests/tpe_tests/test_sampler.py	`99.57% <100.00%> (+0.01%)`	⬆️
optuna/pruners/hyperband.py	`86.95% <0.00%> (-5.24%)`	⬇️
optuna/samplers/random.py	`84.44% <0.00%> (-1.56%)`	⬇️
tests/samplers_tests/test_cmaes.py	`100.00% <0.00%> (ø)`
.../integration_tests/allennlp_tests/test_allennlp.py	`100.00% <0.00%> (ø)`
tests/test_study.py	`98.26% <0.00%> (+0.02%)`	⬆️
... and 11 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f614d69...f3ebf31. Read the comment docs.

ytsmiling

Sincere thanks for your contribution.

ytsmiling · 2020-04-27T02:20:20Z

optuna/samplers/tpe/sampler.py

+        if not self._parzen_estimator_parameters.consider_prior and (n_below == 0 or n_above == 0):
+            return self._random_sampler.sample_independent(
+                study, trial, param_name, param_distribution
+            )
+


Is this fall-back consistent with the hyperopt implementation? cc: @HideakiImamura

I have looked through the HyperOpt code.
It seems that HyperOpt does not have an option to disable to use a prior.
Therefore, HyperOpt has no fall-back for a situation like this.

The current fall-back is the same for the case: n < self._n_startup_trials.
https://github.com/optuna/optuna/blob/master/optuna/samplers/tpe/sampler.py#L139

Another idea is simply ignoring self._parzen_estimator_parameters.consider_prior flag and enabling a prior in this situation.

I agree with @y0z. The hyperopt does not have any option to disable a prior.
I think ignoring consider_prior flag is good idea in this situation, since it is consistent with normal situation.

ytsmiling · 2020-04-27T02:50:59Z

When there are multiple workers, a similar problem will remain even after this PR is merged. Another PR should address this issue. Related issue: #1170.

…flag when the number of observations is zero.

y0z · 2020-04-28T09:59:55Z

Thank you for your feedback.
I have changed the fall-back procedure.
Now, when the number of observations is zero, the Parzen estimator ignores the consider_prior flag and utilizes a prior.

github-actions · 2020-05-12T23:07:09Z

This pull request has not seen any recent activity.

HideakiImamura

Sorry for the delay in reviewing your update. LGTM!

Sorry for confusion. I think we need some benchmark experiments to quantify the proposed improvement.

HideakiImamura · 2020-05-13T07:58:39Z

@y0z I think we need some benchmark comparison between existing TPE and proposed one to quantify the improvement, but it may take quite a lot of time. I think it's good to leave a comment in the code without doing any benchmark experiments like this.

Note: When the number of observations is zero, the Parzen estimator ignores the consider_prior flag and utilizes a prior. Validation of this approach is future work.

HideakiImamura

Thank you for your quick response! LGTM!

ytsmiling

Thank you for your PR (and sorry for the late decision).
LGTM!

y0z added 4 commits April 25, 2020 15:08

Fix split_observation_pairs for conditional paramters

ed45c22

Fix sample_indepentend for conditional parameters

419c2d1

Minor fix for flake8 compatibility

631e93b

Fix type hints

4aa7ea7

Remove a meaningless line

0f77167

crcrpar added the optuna.samplers Related to the `optuna.samplers` submodule. This is automatically labeled by github-actions. label Apr 25, 2020

ytsmiling reviewed Apr 27, 2020

View reviewed changes

ytsmiling mentioned this pull request Apr 27, 2020

Add storage specification to BaseStorage class doc. #1174

Merged

Change the behavior of the Parzen estimator to ignore consider_prior …

f3ebf31

…flag when the number of observations is zero.

github-actions bot added the stale Exempt from stale bot labeling. label May 12, 2020

HideakiImamura previously approved these changes May 13, 2020

View reviewed changes

HideakiImamura removed the stale Exempt from stale bot labeling. label May 13, 2020

Add comments for the calculation of Parzen estimator

43d01e6

HideakiImamura approved these changes May 13, 2020

View reviewed changes

ytsmiling approved these changes May 14, 2020

View reviewed changes

ytsmiling merged commit 8e813c9 into optuna:master May 14, 2020

hvy added this to the v1.5.0 milestone May 14, 2020

hvy added the enhancement Change that does not break compatibility and not affect public interfaces, but improves performance. label May 14, 2020

hvy changed the title ~~Fix split_observations for conditional parameters~~ Fix _get_observation_pairs for conditional parameters. May 14, 2020

ytsmiling mentioned this pull request May 27, 2020

Add read_trials_from_remote_storage method to Storage implementations. #1298

Merged

c-bata mentioned this pull request Jul 25, 2020

Porting the patch of Optuna's TPE sampler. c-bata/goptuna#125

Open

3 tasks

nabenabe0928 mentioned this pull request May 26, 2024

Cache the latest result of HSSP for speedup of MOTPE #5454

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix `_get_observation_pairs` for conditional parameters.#1166

Fix `_get_observation_pairs` for conditional parameters.#1166
ytsmiling merged 7 commits intooptuna:masterfrom
y0z:conditional-ei

y0z commented Apr 25, 2020

Uh oh!

codecov-io commented Apr 25, 2020 •

edited

Loading

Uh oh!

ytsmiling left a comment

Uh oh!

ytsmiling Apr 27, 2020

Uh oh!

y0z Apr 27, 2020

Uh oh!

HideakiImamura Apr 28, 2020

Uh oh!

ytsmiling commented Apr 27, 2020

Uh oh!

y0z commented Apr 28, 2020

Uh oh!

github-actions bot commented May 12, 2020

Uh oh!

HideakiImamura left a comment

Uh oh!

HideakiImamura commented May 13, 2020

Uh oh!

HideakiImamura left a comment

Uh oh!

ytsmiling left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

Conversation

y0z commented Apr 25, 2020

Motivation

Description of the changes

Uh oh!

codecov-io commented Apr 25, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ytsmiling left a comment

Choose a reason for hiding this comment

Uh oh!

ytsmiling Apr 27, 2020

Choose a reason for hiding this comment

Uh oh!

y0z Apr 27, 2020

Choose a reason for hiding this comment

Uh oh!

HideakiImamura Apr 28, 2020

Choose a reason for hiding this comment

Uh oh!

ytsmiling commented Apr 27, 2020

Uh oh!

y0z commented Apr 28, 2020

Uh oh!

github-actions bot commented May 12, 2020

Uh oh!

HideakiImamura left a comment

Choose a reason for hiding this comment

Uh oh!

HideakiImamura commented May 13, 2020

Uh oh!

HideakiImamura left a comment

Choose a reason for hiding this comment

Uh oh!

ytsmiling left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

codecov-io commented Apr 25, 2020 •

edited

Loading