PiBO Pull Request 1 - Bugfixes by hvarfner · Pull Request #221 · automl/ConfigSpace

hvarfner · 2022-01-25T21:15:18Z

PiBO Pull Request 1 - Bugfixes

Bugfixes / changes to existing repo

Bugfix - Normal HPs

Fixed sampling in Normal HPs - did not sample to the correct distribution for logged variables previously
Fixed the definition of parameters in logged Normal HPs - now, they follow a log-normal distribution, so mu=1 for a logged variable gived mode at e, mu=0 gives mode at 1
Fixed all tests accordingly

Bugfix - to_uniform

Removed unnecesary rounding for float HPs

Bugfix - to_integer

Changed incorrect int() rounding, which produces same result as np.floor().astype(int)
Now, the lower value of the float is rounded up (so lower=-1.5 --> -1 for integerHP, and upper is rounded down: 6.5 --> 6 (which is obiously guaranteed to be legit, whereas rounding down in both cases does not work.

Change - Categorical HPs

Categorical HPs always have probabilities (uniform if nothing else is specified)
Changed tests accordingly

for NormalFloat

read_and_write

sampled outside its permitted range.

…king pdf regardlessx of input

Testing left.

hyperparameters.pyx where i changed Typing.

beta and normal distributions

parameters - now, they are generated the same way as the uniform (but scaled, so that it makes sense even though they are not normalized)

hyperparamretersChanged back to standard local search procedure for the normal hyperparamreters

class down to the subclasses Float, Constant, Integer, Categorical and Ordinal Hyperparameters (as these almost certainly can have a well-defined pdf, or that it would make sense to consider a pdf for these HP types).

all the classes that implement a pdf

… betaparams_merge

parameters

for both (identical to Uniform parameters

the other parameter types - follows as closely as possible

hvarfner · 2022-02-28T11:14:53Z

Unfortunately, no improvement. I'm unable to get the test_round_trip test to fail, and so I really don't know how to approach this. The assertion is made on a pair of 32 000 token strings (both of which have the same length in the failed test), so I really don't know on this one. Do you have any idea? Is the source code different for the tests?

eddiebergman · 2022-02-28T11:29:59Z

I ran this branch locally and got no test failures either. I will rerun these github action tests, if they fail again, I'm willing to push this through anyways and just log that this happened as an Issue.

And no, the tests running through github that you see below are the exact same tests as you have in the source code. The only difference is the enviornment they run in (Python versions, conda, how it's installed, windows/mac/linux).

hvarfner · 2022-02-28T11:30:49Z

Thanks, much appreciated!

eddiebergman · 2022-02-28T11:44:22Z

I did some digging and it seems our automated tests are pulling your latest commit on this PR, they're pulling this one. I have no idea why that is ... However I can confirm your latest commit is actually correct so I would vote to disregard these checks but try and figure out why it pulls the merge commit instead of the most recent.

@mfeurer thoughts?

eddiebergman · 2022-02-28T11:47:17Z

Can you try one last thing, just push an empty commit

git commit --allow-empty -m "Fix?"

hvarfner · 2022-02-28T12:50:55Z

Sorry, was away for lunch. It's done!

eddiebergman · 2022-02-28T13:48:40Z

No problem, thanks for that! It appears it's still pulling the old version and isn't pulling the latest commit. I'm going to merge this and finally progress on to stage 2. If the error crops up there again then we will need to somehow fix it but hopefully it dissapears.

eddiebergman · 2022-02-28T14:05:32Z

Okay I figured out the problem and could reproduce the issue. The main problem is that your fork is a good few commits behind. You can see this here where it says that you are "10 commits behind automl:master", this also means that your branches are 10 commits behind.

To make everything up to date and to be able to reproduce the error, you must:

Update your master branch fork through github
Go to your coding environment and pull in the latest master.
Merge in the newest changes from master into your branch bugfixes
pytest -k "test_round_trip" will now produce the same error.

As for why this error occurs, ideally we should have a test that is a bit more informative as to why they are not equivalent but I will look through the changes of the 10 commits that diverge and see if anything pops up that I could point you to.

Apologies this has been so complicated. This package is in need of updating all around -_-

eddiebergman · 2022-02-28T14:12:01Z

The failed pcs file is SparrowToRiss-cssc14.pcs which is read in, written out and then read back in and then check for equality, somewhere this is failing, I'm assuming this is to do with the pcs changes that are in the next PR?

eddiebergman · 2022-02-28T14:16:01Z

I check through the other 2 PR's, it doesn't seem the pcs file format was fixed at any point?

hvarfner · 2022-02-28T14:16:40Z

Well, the PCS format was not fixed, but rather disabled for ConfigSpaces that include categoricals.

eddiebergman · 2022-02-28T14:50:17Z

I'm not sure that's something we can really do? It basically means ConfigSpace won't allow any kind of exporting for people that use categoricals. I think we would need some solution to this by the end of this PR.

@mfeurer

The only solution I can think of is to only disable it for the new feature, essentially disabling pcs export/import when a categorical distribution is non-uniform. However this is not ideal and really the pcs file format should be updated.

hvarfner · 2022-02-28T21:36:56Z

Well, since every categorical has weights with this solution, we'd have to make categoricals only have weights in the case where the user specifies it. It seemed like a less consistent solution in my opinion, but I guess it can be changed if you deem it to be the best solution.

mfeurer · 2022-03-01T08:24:25Z

Good morning. I'm on this. The bug appears because for the json serialization of the ConfigSpace SparrowToRiss-cssc14.pcs we loose some precision in the hyperparameter init-act:

From PCS: (0.14285714285714285, 0.14285714285714285, 0.14285714285714285, 0.14285714285714285, 0.14285714285714285, 0.14285714285714285, 0.14285714285714285)
From JSON: (0.14285714285714288, 0.14285714285714288, 0.14285714285714288, 0.14285714285714288, 0.14285714285714288, 0.14285714285714288, 0.14285714285714288)

Now the question is: what's going on here? I'll keep you posted.

mfeurer · 2022-03-01T08:58:09Z

And the issue is

ConfigSpace/ConfigSpace/hyperparameters.pyx

Line 1598 in f1ead4c

return tuple(weights / np.sum(weights))

The hyperparameter is serialized to json, then de-serialized from json. Until then everything is fine. Now when the constructor is called it invokes the function _get_probabilities, which in turn tries to normalize the probabilities, creating this numeric instability. The actual issue might be the fact that we renormalize the probabilities, or that the probabilities don't sum up to 1 in the first place.

Potentially we need to store and serialize the weights, and have the probabilities as an internal representation only. Then the probabilities will only be computed on __init__ from the original weights, and should always be equal. What do you think?

hvarfner · 2022-03-01T09:02:08Z

If you're asking me, I think that seems like a no-brainer. I don't really see why the probabilities need to be stored externally if the weights are already there.

mfeurer · 2022-03-01T09:06:27Z

If you're asking me, I think that seems like a no-brainer. I don't really see why the probabilities need to be stored externally if the weights are already there.

Great :) Could you please go ahead and store and serialize the weights?

hvarfner · 2022-03-01T09:07:03Z

Sure thing!

mfeurer · 2022-03-01T09:19:29Z

To simplify testing you can temporarily prune the file SparrowToRiss-cssc14.pcs to only contain:

init-act  {0,1,2,3,4,5,6}[1]       # initialize activities (0=none,1=inc-lin,2=inc-geo,3=dec-lin,4=dec-geo,5=rnd,6=abs(jw))
init-pol  {0,1,2,3,4,5}[2]       # initialize polarity (0=none,1=JW-pol,2=JW-neg,3=MOMS,4=MOMS-neg,5=rnd)

This will continue to fail as long as the bug is there.

hvarfner · 2022-03-02T12:16:35Z

Okay, I guess we're getting closer - it seems to me that the final test passed, right? With the remaining failing tests, and PCS files, I would like some input as to what is actually the issue, and what to do with the categorical HPs in the PCS format. Opinions?

mfeurer

The tests look good and we can take care of the failing doctest after this PR is merged. As a last step I would suggest replacing the list of weights by tuple of weights because they are immutable.

ConfigSpace/hyperparameters.pyx

…ight was given

* Fix bug introduced by #221: now, PCS can be serialized again if no weight was given * Doctest * Bump version number of the json writer * Apply suggestions from code review Co-authored-by: Eddie Bergman <eddiebergmanhs@gmail.com> Co-authored-by: Eddie Bergman <eddiebergmanhs@gmail.com>

hvarfner added 30 commits December 7, 2021 14:15

Got structure and notes in place to implement PiBO - commenting and pdf

17e28ad

for NormalFloat

Added class BetaFloatHyperparameter, and allowed for its support in

283415b

read_and_write

Fixed bug where a NormalFLoatHyperparameter in log space could be

d8803d5

sampled outside its permitted range.

Checked normalfloatparameter with inputs - now has a complete and wor…

641129a

…king pdf regardlessx of input

Implemented BetaIntegerHyperparameter - onlb brief testing conducted

7a168b5

Finished categorical hyperparameter and pdf for all hyperparameters.

ba38e01

Testing left.

Added to_uniform for categorical (to uniform weights

5b7c2e4

Added to_uniform in configspace, fixed some earlier mistakes in

24686c7

hyperparameters.pyx where i changed Typing.

Removed debugging method "get_probs"

1aa4cce

Fixed bug in beta neighborhood generation - still have not normalized

7eb9ecb

beta and normal distributions

Changed neighbor generation to be consistent for Normal and Beta

cec5645

parameters - now, they are generated the same way as the uniform (but scaled, so that it makes sense even though they are not normalized)

Changed back to standard local search procedure for the normal

d862ff3

hyperparamretersChanged back to standard local search procedure for the normal hyperparamreters

Forgot to change back neighbor generation for normal float

10de917

Removed done TODO's and commentary

c80f0da

Added return types for .pdf and removed a TODO

0df4b19

Written user guide for placing priors on parameters and the pdf method

0ab46f4

Reformatted some faulty things with the user guide

0b72162

One more user guide fix

1cd34b5

Fixed a few bugs in check_default, and added method get_max_density for

d04eee9

all the classes that implement a pdf

Found a bug in normalfloat and removed a TODO

e4f268f

Merge branch 'master' of https://github.com/hvarfner/ConfigSpace into…

a1c225e

… betaparams_merge

Completing merge with master - not implemented get_size for beta

db1692d

parameters

Changed name of to_uniform and minor fixes

4030097

Corrected for wrong placement in merge

9e6558c

Changed minor bugs in the beta parameters and added the get_size methods

0d8622e

for both (identical to Uniform parameters

Added unit tests for betafloathyperparameter - in line with the ones for

4162900

the other parameter types - follows as closely as possible

Added additional bugfixes to beta parameters

71e44e7

Added tests to BetaIntegerParameters

be6add8

Added skeleton of all remaining tests

6ee9c55

Fix?

7d7675e

Changed Categorical HParam to be based around weights, not probabilities

6173149

mfeurer reviewed Mar 2, 2022

View reviewed changes

ConfigSpace/hyperparameters.pyx Outdated Show resolved Hide resolved

ConfigSpace/hyperparameters.pyx Outdated Show resolved Hide resolved

Weights now a tuple

14a0d8d

mfeurer approved these changes Mar 2, 2022

View reviewed changes

mfeurer merged commit 208136d into automl:master Mar 2, 2022

mfeurer added a commit that referenced this pull request Mar 2, 2022

Fix bug introduced by #221: now, PCS can be serialized again if no we…

7491fe7

…ight was given

mfeurer added a commit that referenced this pull request Mar 3, 2022

Fix bug introduced by #221: now, PCS can be serialized again if no we…

a43903f

…ight was given

github-actions bot pushed a commit that referenced this pull request Mar 7, 2022

Matthias Feurer: Fix bug introduced by #221: (#240)

0fe838a

Conversation

hvarfner commented Jan 25, 2022

PiBO Pull Request 1 - Bugfixes

Bugfixes / changes to existing repo

Bugfix - Normal HPs

Bugfix - to_uniform

Bugfix - to_integer

Change - Categorical HPs

Uh oh!

hvarfner commented Feb 28, 2022

Uh oh!

eddiebergman commented Feb 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hvarfner commented Feb 28, 2022

Uh oh!

eddiebergman commented Feb 28, 2022

Uh oh!

eddiebergman commented Feb 28, 2022

Uh oh!

hvarfner commented Feb 28, 2022

Uh oh!

eddiebergman commented Feb 28, 2022

Uh oh!

eddiebergman commented Feb 28, 2022

Uh oh!

eddiebergman commented Feb 28, 2022

Uh oh!

eddiebergman commented Feb 28, 2022

Uh oh!

hvarfner commented Feb 28, 2022

Uh oh!

eddiebergman commented Feb 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hvarfner commented Feb 28, 2022

Uh oh!

mfeurer commented Mar 1, 2022

Uh oh!

mfeurer commented Mar 1, 2022

Uh oh!

hvarfner commented Mar 1, 2022

Uh oh!

mfeurer commented Mar 1, 2022

Uh oh!

hvarfner commented Mar 1, 2022

Uh oh!

mfeurer commented Mar 1, 2022

Uh oh!

hvarfner commented Mar 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mfeurer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eddiebergman commented Feb 28, 2022 •

edited

Loading

eddiebergman commented Feb 28, 2022 •

edited

Loading

hvarfner commented Mar 2, 2022 •

edited

Loading