Fix a minor bug in GPSampler for objective that returns `inf` by nabenabe0928 · Pull Request #5995 · optuna/optuna

nabenabe0928 · 2025-02-27T03:01:52Z

Motivation

This PR fixes a minor bug in GPSampler.
The bug of my interest was embedded here:

84502dc#diff-fe893dd86f2d66f2d847ab31510f07079718c70f104067a431616cd4117214b8

In principle, this change does not affect most users because this statement is triggered only if a user-defined objective function returns non-finite values.

The bug was originally introduced because of the following misunderstanding:

whether to give the initial argument in np.max and np.min does not affect the result as long as the array of interest is not empty.

However, this is not actually true.
For example, np.max([1], initial=10) gives us 10 instead of 1, which we would like to yield in fact.

To this end, I address this issue by modifying the corresponding part.
Note that emmr also needs to be fixed due to the same issue.

Description of the changes

Address the issue stated above

HideakiImamura · 2025-02-28T08:50:27Z

@c-bata @gen740 @sawa3030 Could you review this PR?

sawa3030 · 2025-03-06T01:37:58Z

I believe the previous logic computed the maximum over all elements in values, whereas the new implementation takes the maximum along axis=0. When values.shape = (n, m) with m >= 2, this change affects the return values. For example, if values = [[inf, 10], [inf, 10]], the previous implementation would return [[10, 10], [10, 10]], while the new implementation returns [[0, 10], [0, 10]]. Here, consstraint_vals can have multiple columns when multiple constraints are provided, which I think may lead to unintended changes in behavior.

nabenabe0928 · 2025-03-06T01:47:26Z

Verification Code

import numpy as np


def original(values: np.ndarray) -> tuple[np.ndarray, np.ndarray]:
    finite_vals = values[np.isfinite(values)]
    best_finite_val = np.max(finite_vals, axis=0, initial=0.0)
    worst_finite_val = np.min(finite_vals, axis=0, initial=0.0)
    return best_finite_val, worst_finite_val


def this_pr(values: np.ndarray) -> tuple[np.ndarray, np.ndarray]:
    finite_vals_with_nan = np.where(np.isfinite(values), values, np.nan)
    is_any_finite = np.any(np.isfinite(finite_vals_with_nan), axis=0)
    best_finite_vals = np.where(is_any_finite, np.nanmax(finite_vals_with_nan, axis=0), 0.0)
    worst_finite_vals = np.where(is_any_finite, np.nanmin(finite_vals_with_nan, axis=0), 0.0)
    return best_finite_vals, worst_finite_vals

nabenabe0928 · 2025-03-06T02:01:12Z

I believe the previous logic computed the maximum over all elements in values, whereas the new implementation takes the maximum along axis=0. When values.shape = (n, m) with m >= 2, this change affects the return values. For example, if values = [[inf, 10], [inf, 10]], the previous implementation would return [[10, 10], [10, 10]], while the new implementation returns [[0, 10], [0, 10]]. Here, consstraint_vals can have multiple columns when multiple constraints are provided, which I think may lead to unintended changes in behavior.

@sawa3030
In fact, this is a good point, and it seems the original implementation embedded the bug you pointed out.

First of all, we expect the shapes of best_finite_vals and worst_finite_vals to be (m, ) while we actually get the shape of (1, ), which is already incorrect based on the verification code:

Fix a minor bug in GPSampler for objective that returns inf #5995 (comment)

So we should probably add some unit tests for this part.

But again, this routine isn't really executed for most use cases, so negative user impacts are very limited.

nabenabe0928 · 2025-03-06T02:35:07Z

@sawa3030
I added some unit tests to ensure the code behavior, PTAL")

sawa3030 · 2025-03-06T10:00:09Z

Thank you for the explanation and for adding the unit tests. LGTM

c-bata

@nabenabe0928 Changes look almost good to me. I left one comment though.

c-bata · 2025-03-11T08:33:19Z

tests/gp_tests/test_gp.py

+@pytest.mark.parametrize(
+    "values,ans",
+    [
+        (np.array([-1, 0, 1]), np.array([-1, 0, 1])),
+        (np.array([-1, -np.inf, 0, np.inf, 1]), np.array([-1, -1, 0, 1, 1])),
+        (np.array([[-1, 2], [0, -2], [1, 0]]), np.array([[-1, 2], [0, -2], [1, 0]])),
+        (
+            np.array([[-1, 2], [-np.inf, np.inf], [0, -np.inf], [np.inf, -2], [1, 0]]),
+            np.array([[-1, 2], [-1, 2], [0, -2], [1, -2], [1, 0]]),
+        ),
+        (
+            np.array(
+                [
+                    [-100, np.inf, 10],
+                    [-np.inf, np.inf, 100],
+                    [-10, -np.inf, np.inf],
+                    [np.inf, np.inf, -np.inf],
+                ]
+            ),
+            np.array([[-100, 0, 10], [-100, 0, 100], [-10, 0, 100], [-10, 0, 10]]),
+        ),
+        (np.array([-np.inf, np.inf]), np.array([0, 0])),
+        (np.array([]), np.array([])),
+    ],
+)
+def test_warn_and_convert_inf(values: np.ndarray, ans: np.ndarray) -> None:
+    assert np.allclose(warn_and_convert_inf(values), ans)
+    if len(values.shape) == 1:
+        # Test also with the shape of (n, 1) to ensure the batched version.
+        assert np.allclose(warn_and_convert_inf(values[:, np.newaxis]), ans[:, np.newaxis])


This test code would be more readable if it were split into two test cases: one for single-dimensional arrays and another for two-dimensional arrays. What do you think?

I addressed your comment, PTAL!

c-bata

LGTM after CI passes!

gen740

LGTM!

nabenabe0928 added 2 commits February 27, 2025 03:55

Fix a bug in GPSampler

324f1f6

Apply formatter

d56031f

nabenabe0928 added the bug Issue/PR about behavior that is broken. Not for typos/examples/CI/test but for Optuna itself. label Feb 28, 2025

nabenabe0928 added this to the v4.3.0 milestone Feb 28, 2025

nabenabe0928 changed the title ~~Fix a bug in GPSampler~~ Fix a minor bug in GPSampler for objective that returns inf Feb 28, 2025

HideakiImamura assigned c-bata and gen740 Feb 28, 2025

nabenabe0928 added 3 commits March 6, 2025 03:13

Share the convert function

a3ada16

Refactor

26db483

Add the unit tests for convert inf

b27d1b6

nabenabe0928 added 2 commits March 6, 2025 04:39

Refactor

e82b8b0

Refactor

dd5c659

c-bata reviewed Mar 11, 2025

View reviewed changes

nabenabe0928 added 2 commits March 11, 2025 09:42

Address c-bata's comment

2cfb417

Apply formatter

c51ce28

c-bata approved these changes Mar 11, 2025

View reviewed changes

c-bata removed their assignment Mar 13, 2025

gen740 approved these changes Mar 14, 2025

View reviewed changes

gen740 merged commit 63179d9 into optuna:master Mar 14, 2025
14 checks passed

gen740 removed their assignment Jun 3, 2025

nabenabe0928 mentioned this pull request Oct 16, 2025

Developments of Gaussian-Process Based Bayesian Optimization (GPSampler) nabenabe0928/my-skills#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix a minor bug in GPSampler for objective that returns `inf`#5995

Fix a minor bug in GPSampler for objective that returns `inf`#5995
gen740 merged 9 commits intooptuna:masterfrom
nabenabe0928:debug-inf-filter-in-gp

nabenabe0928 commented Feb 27, 2025 •

edited

Loading

Uh oh!

HideakiImamura commented Feb 28, 2025

Uh oh!

sawa3030 commented Mar 6, 2025

Uh oh!

nabenabe0928 commented Mar 6, 2025

Uh oh!

nabenabe0928 commented Mar 6, 2025 •

edited

Loading

Uh oh!

nabenabe0928 commented Mar 6, 2025

Uh oh!

sawa3030 commented Mar 6, 2025

Uh oh!

c-bata left a comment

Uh oh!

c-bata Mar 11, 2025

Uh oh!

nabenabe0928 Mar 11, 2025

Uh oh!

c-bata left a comment

Uh oh!

gen740 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Conversation

nabenabe0928 commented Feb 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Description of the changes

Uh oh!

HideakiImamura commented Feb 28, 2025

Uh oh!

sawa3030 commented Mar 6, 2025

Uh oh!

nabenabe0928 commented Mar 6, 2025

Uh oh!

nabenabe0928 commented Mar 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nabenabe0928 commented Mar 6, 2025

Uh oh!

sawa3030 commented Mar 6, 2025

Uh oh!

c-bata left a comment

Choose a reason for hiding this comment

Uh oh!

c-bata Mar 11, 2025

Choose a reason for hiding this comment

Uh oh!

nabenabe0928 Mar 11, 2025

Choose a reason for hiding this comment

Uh oh!

c-bata left a comment

Choose a reason for hiding this comment

Uh oh!

gen740 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

nabenabe0928 commented Feb 27, 2025 •

edited

Loading

nabenabe0928 commented Mar 6, 2025 •

edited

Loading