Vectorize `ndtri_exp` by nabenabe0928 · Pull Request #6229 · optuna/optuna

nabenabe0928 · 2025-08-01T09:43:16Z

Motivation

This PR vectorizes ndtri_exp for the future speedup.
In principle, we can further speed up TPESampler by vectorizing _truncnorm.rvs.
To do so, we need to vectorize ndtri_exp.
Another change includes the enhancement in the numerical stability for a large y.

Description of the changes

Replace math with numpy in ndtri_exp
Calculate -x first if x is positive, and then flip the sign later

nabenabe0928 · 2025-08-04T03:16:03Z

@kAIto47802 @sawa3030 Could you review this PR?

nabenabe0928 · 2025-08-04T03:18:40Z

Let me re-assign from @sawa3030 to @not522 !

not522

LGTM! I confirmed the speed and the precision are improved.

====

import optuna

def objective(trial):
    x = trial.suggest_float("x", -10, 10)
    y = trial.suggest_float("y", -10, 10)
    z = trial.suggest_float("z", -10, 10, step=0.5)
    return x ** 2 + y ** 2 + z ** 2

sampler = optuna.samplers.TPESampler(seed=42)
study = optuna.create_study(sampler=sampler)
study.optimize(objective, n_trials=2000)

master: 7.81s
PR: 7.49s

====

import math
import numpy as np
import matplotlib.pyplot as plt
import mpmath
import optuna


_norm_pdf_C = math.sqrt(2 * math.pi)
_norm_pdf_logC = math.log(_norm_pdf_C)
_ndtri_exp_approx_C = math.sqrt(3) / math.pi
_log_2 = math.log(2)


def _ndtri_exp(y: np.ndarray) -> np.ndarray:
    # Flip the sign of y close to zero for better numerical stability and flip back the sign later.
    flipped = y > -1e-2
    z = y.copy()
    z[flipped] = np.log(-np.expm1(y[flipped]))
    x = np.empty_like(y)
    if (small_inds := np.nonzero(z < -5))[0].size:
        x[small_inds] = -np.sqrt(-2.0 * (z[small_inds] + _norm_pdf_logC))
    if (moderate_inds := np.nonzero(z >= -5))[0].size:
        x[moderate_inds] = -_ndtri_exp_approx_C * np.log(np.expm1(-z[moderate_inds]))

    for _ in range(100):
        log_ndtr_x = optuna.samplers._tpe._truncnorm._log_ndtr(x)
        log_norm_pdf_x = -0.5 * x**2 - _norm_pdf_logC
        # NOTE(nabenabe): Use exp(log_ndtr_x - log_norm_pdf_x) instead of ndtr_x / norm_pdf_x for
        # numerical stability.
        dx = (log_ndtr_x - z) * np.exp(log_ndtr_x - log_norm_pdf_x)
        x -= dx
        if np.all(np.abs(dx) < 1e-8 * np.abs(x)):  # NOTE: rtol controls the precision.
            # Equivalent to np.isclose with atol=0.0 and rtol=1e-8.
            break
    x[flipped] *= -1
    # NOTE(nabe): x[y == 0.0] = np.inf, x[np.isneginf(y)] = -np.inf are necessary for the accurate
    # computation, but we omit them as the ppf applies clipping, removing the need for them.
    return x


def _ndtri_exp_single_mp(y):
    a = -1e9
    b = +1e9
    for _ in range(1000):
        m = (a + b) / 2
        if mpmath.log(mpmath.ncdf(m)) < y:
            a = m
        else:
            b = m
    return (a + b) / 2

mpmath.mp.dps = 100

y = np.asarray([-(10**i) for i in np.arange(-50, 10, 0.1)])
x_master = np.asarray([optuna.samplers._tpe._truncnorm._ndtri_exp_single(yi) for yi in y])
x_pr = _ndtri_exp(y)
x_mp = np.asarray([_ndtri_exp_single_mp(yi) for yi in y])
plt.plot(-y, x_master - x_mp, label="master")
plt.plot(-y, x_pr - x_mp, label="PR")
plt.xscale("log")
plt.legend()
plt.savefig("6229.png")

github-actions · 2025-08-12T23:06:35Z

This pull request has not seen any recent activity.

kAIto47802

Thank you for the PR! I followed the equation transformations and their implementation, confirming the correctness.

One minor suggestion is to use np.flatnonzero() instead of np.nonzero()[0].

nabenabe0928 added 3 commits August 1, 2025 11:32

Vectorize ndtri_exp

38371cf

Refactor

6f9209f

Add more comments

dcfd947

nabenabe0928 added the enhancement Change that does not break compatibility and not affect public interfaces, but improves performance. label Aug 1, 2025

nabenabe0928 added 13 commits August 1, 2025 12:05

Fix

e8254db

Rename test

a100084

Fix

e3d61be

Use vectorized rvs

6fcd9cd

Fix almost

0852259

Fix ndtri exp

a753392

Revert as much as possible

89fecd6

Revert as much as possible

7a29c15

Revert as much as possible

1446d73

Fix CI

cd88067

Refactor

5dca4bf

Refactor

96c7882

Add an inline comment

b223be9

nabenabe0928 assigned kAIto47802 and sawa3030 Aug 4, 2025

nabenabe0928 assigned not522 and unassigned sawa3030 Aug 4, 2025

Modify comments

03e2d58

not522 approved these changes Aug 5, 2025

View reviewed changes

not522 removed their assignment Aug 5, 2025

github-actions bot added the stale Exempt from stale bot labeling. label Aug 12, 2025

nabenabe0928 removed the stale Exempt from stale bot labeling. label Aug 12, 2025

kAIto47802 approved these changes Aug 13, 2025

View reviewed changes

nabenabe0928 merged commit e6b2e42 into optuna:master Aug 13, 2025
14 checks passed

nabenabe0928 added this to the v4.5.0 milestone Aug 13, 2025

nabenabe0928 unassigned kAIto47802 Aug 13, 2025

nabenabe0928 mentioned this pull request Oct 16, 2025

Speedup of TPESampler in Optuna nabenabe0928/my-skills#4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vectorize `ndtri_exp`#6229

Vectorize `ndtri_exp`#6229
nabenabe0928 merged 17 commits intooptuna:masterfrom
nabenabe0928:enhance/vectorize-ndtri-exp

nabenabe0928 commented Aug 1, 2025

Uh oh!

nabenabe0928 commented Aug 4, 2025

Uh oh!

nabenabe0928 commented Aug 4, 2025

Uh oh!

not522 left a comment

Uh oh!

github-actions bot commented Aug 12, 2025

Uh oh!

kAIto47802 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

nabenabe0928 commented Aug 1, 2025

Motivation

Description of the changes

Uh oh!

nabenabe0928 commented Aug 4, 2025

Uh oh!

nabenabe0928 commented Aug 4, 2025

Uh oh!

not522 left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Aug 12, 2025

Uh oh!

kAIto47802 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants