Enhance SBXCrossover by hrntsm · Pull Request #6008 · optuna/optuna

hrntsm · 2025-03-12T09:50:48Z

Motivation

I would like to reduce the difference in the distribution of children between the reference paper and optuna's SBX as shown in this discussion

Description of the changes

The fixed values of “establishment” and “probability” are used as arguments to reproduce the same distribution as in the paper.

I have two questions.

Would this change be better for Optunahub?
Although establishment and probability are names that originally appeared in comments in the code, I think that expressions such as variable_crossover_prob, for example, would be easier to understand.

… constructor

c-bata · 2025-03-17T01:58:54Z

@sawa3030 @y0z @HideakiImamura Could you review this PR?

sawa3030 · 2025-03-19T08:01:10Z

@hrntsm Thank you for your PR! I’ve left some comments—please take a look when you have time!

hrntsm · 2025-03-20T04:49:07Z

@sawa3030
Thanks for the review.
There does not appear to be any comments specifically attached, where can I check?

sawa3030 · 2025-03-19T07:47:18Z

optuna/samplers/nsgaii/_crossovers/_sbx.py

-                else:
-                    child_params_list.append(c2_i)
+            if rng.rand() < self._establishment:
+                options = (c1_i, c2_i) if rng.rand() < self._probability else (c2_i, c1_i)


Suggested change

options = (c1_i, c2_i) if rng.rand() < self._probability else (c2_i, c1_i)

if index_prob < self._probability:

child_params_list.append(c1_i)

else:

child_params_list.append(c2_i)

The variable options and the function select_parameter seems unnecessary here since we could directly append c1_i or c2_i based on index_prob.

@sawa3030

Once I accepted the suggestion and committed. But it did not work, so I am commenting on it.

The proposed method would not result in the combination of individuals I thought it would unless I branched out with each of the following probabilities.

for c1_i, c2_i, x1_i, x2_i in zip(c1, c2, parents_params[0], parents_params[1]): if rng.rand() < self._probability: if rng.rand() < self._establishment: if index_prob < 0.5: child_params_list.append(c1_i) else: child_params_list.append(c2_i) else: if index_prob < 0.5: child_params_list.append(c2_i) else: child_params_list.append(c1_i) else: if rng.rand() < self._establishment: if index_prob < 0.5: child_params_list.append(x1_i) else: child_params_list.append(x2_i) else: if index_prob < 0.5: child_params_list.append(x2_i) else: child_params_list.append(x1_i)

Instead, I believe that it would be better to create the combination as it is now and then determine whether to select one or the other created after the combination is created, which would reduce duplication of code.

Based on my understanding, the code presented in #6008 (comment) is equivalent to the code below.
Then, establishment is converted to 1 - establishment with probability 0.5 (i.e., index_prob <= 0.5).
How can I interpret this behavior? I feel that it is not intuitive to understand it from the description in the docstring " establishment is the probability of uniform crossover between two individuals selected as candidate child individuals."

index_prob = rng.rand() child_params_list = [] establishment = self._establishment if index_prob >= 0.5 else 1 - self._establishment for c1_i, c2_i, x1_i, x2_i in zip(c1, c2, parents_params[0], parents_params[1]): if rng.rand() < self._probability: if rng.rand() < establishment: child_params_list.append(c1_i) else: child_params_list.append(c2_i) else: if rng.rand() < establishment: child_params_list.append(x1_i) else: child_params_list.append(x2_i) child_params = np.array(child_params_list)

@y0z
As you point out, this implementation is confusing because it is simultaneously doing the change for each variable and choosing between the two candidate child individuals.
The final behavior is the same as pointed out, but it would be easier to understand the intent if it were reimplemented as follows.

Generate two children c1, c2 based on equation(4) & (5) in SBX paper.

Check that the generated children's variable will be used.(_probability)

Check that uniform crossover applies each children's variable.(_establishment)

This is repeated for each variable to create two candidate child individuals.(child1_params_list, child2_params_list)

Finally, Optuna can only return one individual, so choose one of them.

Equivalent to index_prob but used only once, so omitted.

child1_params_list = [] child2_params_list = [] for c1_i, c2_i, x1_i, x2_i in zip(c1, c2, parents_params[0], parents_params[1]): if rng.rand() < self._probability: # If the probability of "applying" uniform crossover is `_establishment`, then the reverse of the equal sign may be more appropriate. if rng.rand() < self._establishment: child1_params_list.append(c1_i) child2_params_list.append(c2_i) else: child1_params_list.append(c2_i) child2_params_list.append(c1_i) else: if rng.rand() < self._establishment: child1_params_list.append(x1_i) child2_params_list.append(x2_i) else: child1_params_list.append(x2_i) child2_params_list.append(x1_i) child_params_list = child1_params_list if rng.rand() < 0.5 else child2_params_list child_params = np.array(child_params_list)

Thank you! It is more understandable!
It also sounds good to rename the establishment uniform_crossover_prob as suggested.

optuna/samplers/nsgaii/_crossovers/_sbx.py

sawa3030 · 2025-03-20T05:19:58Z

@hrntsm Sorry about that! Can you check them now?

HideakiImamura

Thanks for the PR. I have two comments. PTAL.

optuna/samplers/nsgaii/_crossovers/_sbx.py

tests/samplers_tests/test_nsgaii.py

y0z · 2025-03-21T08:00:13Z

@hrntsm Thank you for your contribution.

I would like to share my opinion.

SBXCrossOver generates two children in the original. However, Optuna's Operator must generate only one child. Therefore, the existing implementation generates a child that is a mix of the two children. To get closer to the original implementation, it seems to be better to randomly generate one of the two children.

Also, how about changing the default value of eta to 1 for single-objective optimization? Currently, the setting is 2, but I couldn't find a reason for it.
https://github.com/optuna/optuna/blob/master/optuna/samplers/nsgaii/_crossovers/_sbx.py#L57

I tried such an implementation and it seems to work well.

FYI: I also fixed vSBXCrossOver (#6000 (reply in thread)).

    def crossover(
        self,
        parents_params: np.ndarray,
        rng: np.random.RandomState,
        study: Study,
        search_space_bounds: np.ndarray,
    ) -> np.ndarray:
        # https://www.researchgate.net/profile/M-M-Raghuwanshi/publication/267198495_Simulated_Binary_Crossover_with_Lognormal_Distribution/links/5576c78408ae7536375205d7/Simulated-Binary-Crossover-with-Lognormal-Distribution.pdf
        # Section 2 Simulated Binary Crossover (SBX)

        # To avoid generating solutions that violate the box constraints,
        # alpha1, alpha2, xls and xus are introduced, unlike the reference.
        xls = search_space_bounds[..., 0]
        xus = search_space_bounds[..., 1]

        xs_min = np.min(parents_params, axis=0)
        xs_max = np.max(parents_params, axis=0)
        if self._eta is None:
            eta = 20.0 if study._is_multi_objective() else 1.0  # Suggestion 2: change default eta to 1.0
        else:
            eta = self._eta

        xs_diff = np.clip(xs_max - xs_min, 1e-10, None)
        beta1 = 1 + 2 * (xs_min - xls) / xs_diff
        beta2 = 1 + 2 * (xus - xs_max) / xs_diff
        alpha1 = 2 - np.power(beta1, -(eta + 1))
        alpha2 = 2 - np.power(beta2, -(eta + 1))

        us = rng.rand(len(search_space_bounds))
        mask1 = us > 1 / alpha1  # Equation (3).
        betaq1 = np.power(us * alpha1, 1 / (eta + 1))  # Equation (3).
        betaq1[mask1] = np.power((1 / (2 - us * alpha1)), 1 / (eta + 1))[mask1]  # Equation (3).

        mask2 = us > 1 / alpha2  # Equation (3).
        betaq2 = np.power(us * alpha2, 1 / (eta + 1))  # Equation (3)
        betaq2[mask2] = np.power((1 / (2 - us * alpha2)), 1 / (eta + 1))[mask2]  # Equation (3).

        c1 = 0.5 * ((xs_min + xs_max) - betaq1 * xs_diff)  # Equation (4).
        c2 = 0.5 * ((xs_min + xs_max) + betaq2 * xs_diff)  # Equation (5).

        # SBX applies crossover with establishment 0.5, and with probability 0.5,
        # the gene of the parent individual is the gene of the child individual.
        # The original SBX creates two child individuals,
        # but optuna's implementation creates only one child individual.
        # Therefore, when there is no crossover,
        # the gene is selected with equal probability from the parent individuals x1 and x2.

        # child_params_list = []
        # for c1_i, c2_i, x1_i, x2_i in zip(c1, c2, parents_params[0], parents_params[1]):
        #     if rng.rand() < 0.5:
        #         if rng.rand() < 0.5:
        #             child_params_list.append(c1_i)
        #         else:
        #             child_params_list.append(c2_i)
        #     else:
        #         if rng.rand() < 0.5:
        #             child_params_list.append(x1_i)
        #         else:
        #             child_params_list.append(x2_i)
        # child_params = np.array(child_params_list)

        return c1 if rng.rand() < 0.5 else c2  # Suggestion 1: return one of the two children randomly.

hrntsm · 2025-03-24T02:00:41Z

Also, how about changing the default value of eta to 1 for single-objective optimization? Currently, the setting is 2, but I couldn't find a reason for it. https://github.com/optuna/optuna/blob/master/optuna/samplers/nsgaii/_crossovers/_sbx.py#L57

Here, in K.Deb's SBX paper below, it is mentioned that n=2~5 is close to a single-point crossover in equation 19. (In this paper, n is the equivalent of eta.)
There is no direct mention of good or bad, but I am wondering if it is safe to leave it at 2 for continuity with previous implementations.

Simulated Binary Crossover for Continuous Search Space

This point is subject to the judgment of OptunaTeam.

HideakiImamura · 2025-03-25T01:45:57Z

This is not strong opinion, but I think it is reasonable to leave the value of eta as it is (i.e., 2) since it is suggested to set the value in the range of [2, 5] according to the Simulated Binary Crossover for Continuous Search Space.

y0z · 2025-03-25T04:43:02Z

Thanks for providing the reference.

I think it is reasonable to leave the value of eta as it is (i.e., 2)

+1

hrntsm · 2025-03-25T06:51:38Z

I have been looking into which individuals to return. I have looked at several papers, but I feel that they only mention returning two generated individuals c1 and c2, as y0z mentioned.

On the other hand, I checked the implementation of some optimization libraries.

lib name	comment
DEAP	The only variable is eta, and the two individuals generated are returned as is.
pymoo	The variables are eta, prob_bin(establishment), prob_var(probability), similar to the implementation as in this PR.
jMetal	Same as the current implementation of Optuna, with establishment and probability fixed at 0.5
PlatEMO	The only variable is eta, and the two individuals generated are returned.

After all, the two arguments other than eta were not specified in any of the papers cited in any of the implementations. Please let me know if I am missing something.

As I understand, establishment is the probability of binary crossover (uniform crossover) among 2 parents and 2 children, probability is the probability of applying the SBX crossover (1 - probability of using the parent values as it is)

This is my opinion, but considering the current implementation of Optuna and the reproduction of the paper, I think the arguments should be eta, establishment, and probability.
On the other hand, I thought we need to discuss what the default values should be.

I am thinking either of the following two.

The value as it is in the current implimentation (eta=2, establishment=0.5, probability=0.5)
A value that reproduces the distribution of the paper (eta=2, establishment=0.0, probability=1.0)

y0z · 2025-03-31T05:01:05Z

@hrntsm

Thank you for your survey.

Based on the discussions, introducing establishment and probability parameters with default values that are the same as in the current implementation, i.e., not changing the current Optuna behavior, seems good.

hrntsm · 2025-03-31T06:02:01Z

@sawa3030 @HideakiImamura
Thank you for all your comments!

The all checks passed, Could you please review?.
The implementation is as follows. The default has been the same result so far.

default(eta=2,establishment=probability=0.5)	reproduct ref paper(eta=1,establishment=0.0,probability=1)

optuna/samplers/nsgaii/_crossovers/_sbx.py

sawa3030 · 2025-04-02T00:14:50Z

Since a probability of 0 means the child is never selected, I think it's better to ensure that 0 < _probability ≤ 1.

sawa3030 · 2025-04-03T05:06:27Z

@hrntsm Thank you for your updates! I have a few suggestions and clarifications I'd like to share:

As discussed in this comment, the implementation of this PR behaves the same when _establishment is set to 0 or 1, which suggests that the index_prob variable may be unnecessary. I suggest simplifying the logic as shown below. This approach preserves the current Optuna behavior, as noted here, and could also align with the distribution described in the original paper, as discussed here.

for c1_i, c2_i, x1_i, x2_i in zip(c1, c2, parents_params[0], parents_params[1]):
    if rng.rand() < self._probability:
        if rng.rand() < self._establishment:
            child_params_list.append(c1_i)
        else:
            child_params_list.append(c2_i)
    else:
        if rng.rand() < self._establishment:
            child_params_list.append(x1_i)
        else:
            child_params_list.append(x2_i)

In both the current implementation and the code above, c1 is implicitly associated with x1, and c2 with x2. To be more precise, it might be better to decouple these associations and consider introducing separate control variable for selecting between the two child candidates (c1 and c2) and between the two parent values (x1 and x2).
As pointed out in the description of this PR, the variable name establishment doesn't clearly reflect its actual role in the crossover operation. I would suggest renaming it to uniform_crossover_prob, and likewise renaming probability to use_child_gene_prob to better communicate their intent. Of course, these are just suggestions — I'm happy to discuss other naming ideas if anyone has alternatives.

y0z · 2025-04-03T06:02:20Z

@sawa3030

The code in this PR and the one you suggested seem to have different behavior.
With establishment=0.1 and probability=0.5, we find the difference.
So, this change is not suitable for preserving the current behavior.

As discussed in #6008 (comment), the implementation of this PR behaves the same when _establishment is set to 0 or 1, which suggests that the index_prob variable may be unnecessary. I suggest simplifying the logic as shown below. This approach preserves the current Optuna behavior, as noted #6008 (comment), and could also align with the distribution described in the original paper, as discussed #6000 (reply in thread).
for c1_i, c2_i, x1_i, x2_i in zip(c1, c2, parents_params[0], parents_params[1]):
   if rng.rand() < self._probability:
       if rng.rand() < self._establishment:
           child_params_list.append(c1_i)
       else:
           child_params_list.append(c2_i)
   else:
       if rng.rand() < self._establishment:
           child_params_list.append(x1_i)
       else:
           child_params_list.append(x2_i)

hrntsm · 2025-04-03T07:02:01Z

@sawa3030

As for 1 & 2, please check #6008 (comment) for a summary of the intent of the implementation.

As for 3(the variable name), it more accurately describes the behavior than the one I gave at the beginning, and I agree with your suggestion.

hrntsm · 2025-04-03T09:11:24Z

Based on the comments, I have modified the docstring and implementation for clarity.

optuna/samplers/nsgaii/_crossovers/_sbx.py

Co-authored-by: Yoshihiko Ozaki <30489874+y0z@users.noreply.github.com>

y0z

Thank you for your careful discussion.
LGTM.

sawa3030 · 2025-04-04T04:33:34Z

Thank you so much for your clear classification and explanation — it really helped me understand the logic.
I have one more question: since the behavior is essentially the same when uniform_crossover_prob is set to 0 or 1, and it’s a bit unclear how users should choose the value, would it make sense to restrict the range to [0.0, 0.5]? This isn’t a strong preference, as values greater than 0.5 don’t cause any issues in practice.

hrntsm · 2025-04-04T06:03:42Z

@sawa3030

would it make sense to restrict the range to [0.0, 0.5]?

I once thought the same thing, but the “probability” range of [0, 0.5] seemed counter-intuitive, so I decided to use [0,1].
I will add followings to the docstring to make it more understandable.

If the uniform_crossover_prob exceeds 0.5, the result is equivalent to 1-uniform_crossover_prob, because it returns one of the two individuals of the crossover result.

sawa3030

Thank you for all the discussions and clarifications. LGTM!

hrntsm added 2 commits March 12, 2025 18:23

enhance SBXCrossover: add establishment and probability parameters to…

138e42c

… constructor

fix expected output for SBXCrossover test cases

427c2ac

c-bata assigned y0z and HideakiImamura Mar 17, 2025

c-bata added the feature Change that does not break compatibility, but affects the public interfaces. label Mar 17, 2025

sawa3030 reviewed Mar 20, 2025

View reviewed changes

address comments

f75b91c

HideakiImamura reviewed Mar 21, 2025

View reviewed changes

optuna/samplers/nsgaii/_crossovers/_sbx.py Outdated Show resolved Hide resolved

tests/samplers_tests/test_nsgaii.py Outdated Show resolved Hide resolved

address comments

8f96217

HideakiImamura reviewed Apr 1, 2025

View reviewed changes

optuna/samplers/nsgaii/_crossovers/_sbx.py Outdated Show resolved Hide resolved

HideakiImamura reviewed Apr 1, 2025

View reviewed changes

optuna/samplers/nsgaii/_crossovers/_sbx.py Outdated Show resolved Hide resolved

fix sbx docstring

bc9669f

Make implementation and docstring more clear

58fea87

y0z reviewed Apr 3, 2025

View reviewed changes

optuna/samplers/nsgaii/_crossovers/_sbx.py Outdated Show resolved Hide resolved

fix typo

004f5c7

Co-authored-by: Yoshihiko Ozaki <30489874+y0z@users.noreply.github.com>

y0z approved these changes Apr 3, 2025

View reviewed changes

y0z removed their assignment Apr 3, 2025

update docstring

acb810a

sawa3030 approved these changes Apr 4, 2025

View reviewed changes

y0z merged commit 19296c3 into optuna:master Apr 7, 2025
14 checks passed

y0z unassigned HideakiImamura Apr 7, 2025

y0z added this to the v4.3.0 milestone Apr 7, 2025

hrntsm deleted the update-sbx branch April 7, 2025 03:50

hrntsm mentioned this pull request Apr 7, 2025

Update vsbx #6033

Merged

Uh oh!

Conversation

hrntsm commented Mar 12, 2025

Motivation

Description of the changes

Uh oh!

c-bata commented Mar 17, 2025

Uh oh!

sawa3030 commented Mar 19, 2025

Uh oh!

hrntsm commented Mar 20, 2025

Uh oh!

sawa3030 Mar 19, 2025

Choose a reason for hiding this comment

Uh oh!

hrntsm Mar 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

y0z Apr 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hrntsm Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

y0z Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sawa3030 commented Mar 20, 2025

Uh oh!

HideakiImamura left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

y0z commented Mar 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hrntsm commented Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HideakiImamura commented Mar 25, 2025

Uh oh!

y0z commented Mar 25, 2025

Uh oh!

hrntsm commented Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

y0z commented Mar 31, 2025

Uh oh!

hrntsm commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sawa3030 commented Apr 2, 2025

Uh oh!

sawa3030 commented Apr 3, 2025

Uh oh!

y0z commented Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hrntsm commented Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hrntsm commented Apr 3, 2025

Uh oh!

Uh oh!

y0z left a comment

Choose a reason for hiding this comment

Uh oh!

sawa3030 commented Apr 4, 2025

Uh oh!

hrntsm commented Apr 4, 2025

Uh oh!

sawa3030 left a comment

hrntsm Mar 21, 2025 •

edited

Loading

y0z Apr 1, 2025 •

edited

Loading

hrntsm Apr 3, 2025 •

edited

Loading

y0z commented Mar 21, 2025 •

edited

Loading

hrntsm commented Mar 24, 2025 •

edited

Loading

hrntsm commented Mar 25, 2025 •

edited

Loading

hrntsm commented Mar 31, 2025 •

edited

Loading

y0z commented Apr 3, 2025 •

edited

Loading

hrntsm commented Apr 3, 2025 •

edited

Loading