Fix log PDF of discrete trunc log-norm distribution for `TPESampler` by not522 · Pull Request #6258 · optuna/optuna

not522 · 2025-08-25T07:08:51Z

Motivation

TPESampler exhibits inconsistent handling of suggest_int(..., log=True). While the Parzen estimator uses the log-normal distribution's interval during sampling, it employs the log-normal distribution's value at the sampled point for the log PDF. This PR fixes the behavior to use the log-normal distribution's interval for the log PDF as well.

Description of the changes

The first commit (4ede700) introduced _BatchedTruncLogNormDistributions and _BatchedDiscreteTruncLogNormDistributions to clarify the behavior of each distribution. It is a refactoring and does not change the behavior.
The second commit (a5ab97c) fixed the log PDF of _BatchedDiscreteTruncLogNormDistributions.

Using the code below, I confirmed that the optimization performance remained nearly unchanged.

import numpy as np
import matplotlib.pyplot as plt
import optuna

def objective(trial):
    params = np.empty(5)
    for i in range(5):
        params[i] = trial.suggest_int(f"x{i}", 1, 100, log=True)
    return np.sum((np.log(params) - np.log(10)) ** 2)

N_STUDIES = 100
N_TRIALS = 100

y = [[] for _ in  range(N_STUDIES)]
for seed in range(N_STUDIES):
    sampler = optuna.samplers.TPESampler(seed=seed)
    study = optuna.create_study(sampler=sampler)
    study.optimize(objective, n_trials=N_TRIALS)
    y[seed].append(study.trials[0].value)
    for i in range(1, N_TRIALS):
        y[seed].append(min(y[seed][-1], study.trials[i].value))

y = np.asarray(y)
np.save("master.npy", y)
# np.save("pr.npy", y)

y = np.load("master.npy")
mean = np.mean(y, axis=0)
std = np.std(y, axis=0)
plt.fill_between(range(N_TRIALS), mean + std, mean - std, alpha=0.1)
plt.plot(range(N_TRIALS), mean, label="master")

y = np.load("pr.npy")
mean = np.mean(y, axis=0)
std = np.std(y, axis=0)
plt.fill_between(range(N_TRIALS), mean + std, mean - std, alpha=0.1)
plt.plot(range(N_TRIALS), mean, label="fixed")

plt.xlabel("trial number")
plt.ylabel("loss")
plt.legend()
plt.savefig("logint.png")

not522 · 2025-08-26T07:42:37Z

After examining the behavior of TPESampler, I discovered that suggest_int(..., log=True) exhibits a bias toward sampling smaller values. To properly address this issue, I will temporarily convert this PR to draft status.

not522 · 2025-08-27T04:15:58Z

This PR introduces the following behavioral changes, so I've confirmed that results may vary depending on the objective function.

The original implementation tends to sample larger values.
This PR makes sigmas greater.

While there may be room for discussion about how probability distributions should be determined, I think this PR can be treated as bug fixes.

====

def objective(trial):
    params = np.empty(5)
    for i in range(5):
        params[i] = trial.suggest_int(f"x{i}", 1, 100, log=True)
    return np.sum((np.log(params) - np.log(1)) ** 2)

def objective(trial):
    params = np.empty(20)
    for i in range(20):
        params[i] = trial.suggest_int(f"x{i}", 1, 100, log=True)
    return np.sum((np.log(params) - np.log(10)) ** 2)

github-actions · 2025-09-04T23:06:11Z

This pull request has not seen any recent activity.

github-actions · 2025-09-14T23:05:29Z

This pull request has not seen any recent activity.

github-actions · 2025-09-22T23:05:56Z

This pull request has not seen any recent activity.

github-actions · 2025-10-02T23:06:22Z

This pull request has not seen any recent activity.

github-actions · 2025-10-12T23:05:31Z

This pull request has not seen any recent activity.

github-actions · 2025-10-21T23:06:15Z

This pull request has not seen any recent activity.

kAIto47802 · 2025-10-22T10:56:16Z

optuna/samplers/_tpe/probability_distributions.py

+        lows_cont = []
+        highs_cont = []


Suggested change

lows_cont = []

highs_cont = []

lows_num, highs_num = [], []

How about renaming it to "num", since naming "cont" for both continuous and discrete distributions is a bit misleading.

How about numeric? (num might be confusing with number)

Thank you for your response.
That's right. numeric seems to be better.

Suggested change

lows_cont = []

highs_cont = []

lows_number, highs_number = [], []

Thank you. I've updated it.

kAIto47802

Thank you for the PR!
It's almost LGTM, leaving only a minor comment.

kAIto47802

Thank you for the update! LGTM

github-actions · 2025-11-09T23:05:50Z

This pull request has not seen any recent activity.

not522 · 2025-11-12T04:29:57Z

@y0z Could you review this PR?

y0z

LGTM

not522 added the bug Issue/PR about behavior that is broken. Not for typos/examples/CI/test but for Optuna itself. label Aug 25, 2025

nabenabe0928 self-assigned this Aug 25, 2025

nabenabe0928 added this to the v4.6.0 milestone Aug 25, 2025

not522 force-pushed the fix-tpe-logint branch from 5d76070 to 03a1dd1 Compare August 25, 2025 07:21

not522 added 2 commits August 25, 2025 16:24

Introduce log-norm distributions

4ede700

Fix log PDF of discrete trunc log-norm distribution

a5ab97c

not522 force-pushed the fix-tpe-logint branch from 03a1dd1 to a5ab97c Compare August 25, 2025 07:25

Fix test_init_parzen_estimator

cca5658

not522 marked this pull request as draft August 26, 2025 07:42

not522 marked this pull request as ready for review August 27, 2025 04:16

nabenabe0928 assigned kAIto47802 Sep 2, 2025

github-actions bot added the stale Exempt from stale bot labeling. label Sep 4, 2025

nabenabe0928 removed the stale Exempt from stale bot labeling. label Sep 5, 2025

github-actions bot added the stale Exempt from stale bot labeling. label Sep 14, 2025

nabenabe0928 removed the stale Exempt from stale bot labeling. label Sep 15, 2025

github-actions bot added the stale Exempt from stale bot labeling. label Sep 22, 2025

nabenabe0928 removed the stale Exempt from stale bot labeling. label Sep 25, 2025

github-actions bot added the stale Exempt from stale bot labeling. label Oct 2, 2025

nabenabe0928 removed the stale Exempt from stale bot labeling. label Oct 3, 2025

github-actions bot added the stale Exempt from stale bot labeling. label Oct 12, 2025

nabenabe0928 removed the stale Exempt from stale bot labeling. label Oct 14, 2025

github-actions bot added the stale Exempt from stale bot labeling. label Oct 21, 2025

c-bata removed the stale Exempt from stale bot labeling. label Oct 22, 2025

kAIto47802 reviewed Oct 22, 2025

View reviewed changes

Merge branch 'optuna:master' into fix-tpe-logint

48ccbbf

kAIto47802 reviewed Oct 24, 2025

View reviewed changes

Rename lows_cont and highs_cont to lows_numeric and highs_numeric

af89bbf

y0z removed this from the v4.6.0 milestone Oct 30, 2025

kAIto47802 approved these changes Oct 31, 2025

View reviewed changes

kAIto47802 removed their assignment Oct 31, 2025

github-actions bot added the stale Exempt from stale bot labeling. label Nov 9, 2025

c-bata removed the stale Exempt from stale bot labeling. label Nov 12, 2025

not522 assigned y0z and unassigned nabenabe0928 Nov 12, 2025

y0z approved these changes Nov 17, 2025

View reviewed changes

y0z merged commit 0297d1b into optuna:master Nov 17, 2025
12 checks passed

y0z removed their assignment Nov 17, 2025

not522 deleted the fix-tpe-logint branch November 18, 2025 02:08

not522 added this to the v4.7.0 milestone Jan 14, 2026

	lows_cont = []
	highs_cont = []
	lows_number, highs_number = [], []

Uh oh!

Conversation

not522 commented Aug 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Description of the changes

Uh oh!

not522 commented Aug 26, 2025

Uh oh!

not522 commented Aug 27, 2025

Uh oh!

github-actions bot commented Sep 4, 2025

Uh oh!

github-actions bot commented Sep 14, 2025

Uh oh!

github-actions bot commented Sep 22, 2025

Uh oh!

github-actions bot commented Oct 2, 2025

Uh oh!

github-actions bot commented Oct 12, 2025

Uh oh!

github-actions bot commented Oct 21, 2025

Uh oh!

kAIto47802 Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

not522 Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

kAIto47802 Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

not522 Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

kAIto47802 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kAIto47802 left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 9, 2025

Uh oh!

not522 commented Nov 12, 2025

Uh oh!

y0z left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

not522 commented Aug 25, 2025 •

edited

Loading

kAIto47802 Oct 22, 2025 •

edited

Loading

kAIto47802 left a comment •

edited

Loading