Reduce `SELECT` statements of `_CachedStorage.get_all_trials` by fixing filtering conditions by HideakiImamura · Pull Request #5704 · optuna/optuna

HideakiImamura · 2024-10-11T09:46:43Z

Motivation

This PR aims to reduce the number of SELECT statements of _CachedStorage.get_all_trials. The current filtering conditions are removing excluded trials in RDBStorage._get_trials, but it can be simplified by using "included" trials.

Description of the changes

Fix filtering conditions
Fix existing tests
Add unit tests for the new argument trial_id_cursor

HideakiImamura · 2024-10-14T23:47:25Z

@y0z @c-bata Could you review this PR?

c-bata · 2024-10-18T01:26:36Z

I assigned @porink0424 as an additional reviewer, considering his recent contributions to RDBStorage. I'll proceed with reviewing the changes after @porink0424 has approved this PR.

porink0424

I've left a few initial comments for now👍 I may add more comments later.

optuna/storages/_rdb/storage.py

optuna/storages/_cached_storage.py

c-bata

Let me leave some early feedback comments.

Do you think we should update the unfinished_trial_ids and last_finished_trial_id in _CachedStorage.set_trial_state_values?
https://github.com/optuna/optuna/pull/5704/files#diff-8ce5c3176c6b5fa3a21ed11da3f19d07c7c0291b7b7b6b67221b057855b7ac50R187

c-bata · 2024-10-18T07:32:05Z

I executed a micro-benchmark and confirmed that this PR makes study.optimize() significantly faster especially for the study with lots of trials.

Benchmark Script

import optuna
import time
import numpy as np

from concurrent.futures import ThreadPoolExecutor

n_trials = 10000
# Disable calling storage.get_best_trial for logging
optuna.logging.set_verbosity(optuna.logging.WARNING)


def objective(trial: optuna.Trial) -> float:
    s = 0.0
    for i in range(18):
        trial.set_user_attr(f"attr{i}", "attr value")
    for i in range(8):
        s += trial.suggest_float(f"x{i}", -10, 10) ** 2
    return s


storage = optuna.storages.RDBStorage("mysql+pymysql://optuna:password@127.0.0.1:3306/optuna")
start = time.time()
study = optuna.create_study(storage=storage, sampler=optuna.samplers.RandomSampler())
study.optimize(objective, n_trials, n_jobs=10)
print(f"study.optimize elapsed: {time.time() - start}")

start = time.time()
with ThreadPoolExecutor(max_workers=10) as pool:
    for i in range(20):
        pool.submit(storage.get_all_trials, study._study_id)
print(f"storage.get_all_trials: {time.time() - start}")

#!/bin/sh

docker pull mysql:8.0
docker stop optuna-mysql
sleep 3

set -e

docker run -d --rm -p 3306:3306 -v ./mysql-conf:/etc/mysql/conf.d \
    -e MYSQL_USER=optuna -e MYSQL_DATABASE=optuna -e MYSQL_PASSWORD=password -e MYSQL_ALLOW_EMPTY_PASSWORD=yes \
    --name optuna-mysql \
    mysql:8.0

echo "Wait ready for mysql"
sleep 20

python3 benchmark_for_pr5704.py

Benchmark Results

`study.optimize` (10 threads)

n_trials	master	this PR	diff
1000	76.90139293670654	74.6462049484253	-2.93%
10000	1038.7614076137543	762.2547173500061	-26.61%
50000	13536.754137277603	4220.0592222213745	-68.82%

`storage.get_all_trials` x 20 (10 threads)

n_trials	master	this PR	diff
1000	24.590099096298218	24.788296461105347	+0.80%
10000	237.29632568359375	242.50016379356384	+2.19%

HideakiImamura · 2024-10-21T02:45:51Z

@porink0424 @c-bata Thanks for the review comments. I applied your suggestions. Please take a look.

Do you think we should update the unfinished_trial_ids and last_finished_trial_id in _CachedStorage.set_trial_state_values?

I don't have a strong opinion, but I think we don't have to update them in set_trial_state_values. Currently, it is updated only when the _read_trials_from_remote_storage is called (and the new trial is created). There are several reasons.

(1) The existing finished_trial_ids is only updated when the current condition.
(2) If we use a sampler which do not depend on the history of optimization, i.e., do not call get_all_trials, then it is unnecessary to update the unfinished_trial_ids and last_finished_trial_id in the optimization. The first call of get_all_trials for some analysis of the optimization would be slow, but don't have a significant impact, and it is actually speed up with a lot of trials by current changes.
(3) If we use a sampler which depends on the history of optimization, then the unfinished_trial_ids and last_finished_trial_id are updated when each sampling. Therefore, there is no significant impacts.

c-bata

Thank you for the pull request. Changes look almost good to me. I left one minor suggestion though.

c-bata · 2024-10-23T03:01:48Z

optuna/storages/_rdb/storage.py

-                    .all()
-                )
+                elif trial_id_greater_than > -1:
+                    _query = query.filter(models.TrialModel.trial_id > trial_id_greater_than)


Can we reuse the query variable and remove the else block?

Suggested change

_query = query.filter(models.TrialModel.trial_id > trial_id_greater_than)

query = query.filter(models.TrialModel.trial_id > trial_id_greater_than)

The variable query is also used in the except block in which we don't add the filtering to the SQL query, so I use another name of variable here.

I see. I noticed another issue and have one concern:

Currently, selectinload is used even in the except block, which is likely unintentional.

Although this isn’t related to your changes, the logic in the except block isn't tested at all.

What are your thoughts on these?

Thanks for the comments.

Currently, selectinload is used even in the except block, which is likely unintentional.

I think we have already used selectinload in this except block (ref). Do you think we should avoid this? The processing enters the except block due to the specification of sqlite that the number of maximum allowed variables using IN is 999 as noted in the above comments in codes and the document, so I think it is reasonable to use selectinload here as we do in the other cases.

Although this isn’t related to your changes, the logic in the except block isn't tested at all.

In my understanding, this except block is already tested here. However, this test case is not obvious to understand what it tests.

I think we have already used selectinload in this except block (ref).

Ah, you are absolutely correct!

In my understanding, this except block is already tested here. However, this test case is not obvious to understand what it tests.

As for unit tests, it seems that the except block isn’t actually tested in the master branch.

$ git co master $ python3 >>> import optuna >>> storage = optuna.storages.RDBStorage("sqlite:///test.db") >>> study_id = storage.create_new_study(directions=[optuna.study.StudyDirection.MINIMIZE]) [I 2024-10-29 18:29:53,124] A new study created in RDB with name: no-name-56f4ff56-e9e8-488a-8c93-d88e312ad97b >>> storage.create_new_trial(study_id) 1 >>> trials = storage._get_trials(study_id, states=None, excluded_trial_ids=set(range(500000))) >>>

That said, this PR also addresses the test case, so my concern is effectively resolved. Thank you for your great work!

$ gh pr checkout 5704 $ python3 >>> import optuna >>> storage = optuna.storages.RDBStorage("sqlite:///test.db") >>> study_id = storage.create_new_study(directions=[optuna.study.StudyDirection.MINIMIZE]) [I 2024-10-29 18:29:53,124] A new study created in RDB with name: no-name-56f4ff56-e9e8-488a-8c93-d88e312ad97b >>> trial_id = storage.create_new_trial(study_id) >>> trial_id_greater_than = trial_id + 500000 >>> trials = storage._get_trials( ... study_id, ... states=None, ... included_trial_ids=set(range(500000)), ... trial_id_greater_than=trial_id_greater_than, ... ) : (Background on this error at: https://sqlalche.me/e/20/e3q8). Falling back to a slower alternative.

porink0424

LGTM, as long as the comment above from @c-bata is addressed.

optuna/storages/_rdb/storage.py

tests/storages_tests/test_storages.py

HideakiImamura · 2024-10-25T00:10:13Z

@c-bata @not522 @y0z Thanks for the comment. I updated codes and replied your suggestions. PTAL.

y0z

LGTM

c-bata

Looks perfect to me! 💯

HideakiImamura added 2 commits October 11, 2024 15:26

Update filter conditions for the cached storage

29134bb

Fix bug

a938bf4

HideakiImamura marked this pull request as draft October 11, 2024 09:46

HideakiImamura added 2 commits October 15, 2024 08:37

Merge branch 'master' into update-filter-condition-for-cached-storage

7e97dab

Add unit tests for trial_id_cursor

7dc4ff3

HideakiImamura marked this pull request as ready for review October 14, 2024 23:46

HideakiImamura assigned c-bata and y0z Oct 14, 2024

c-bata added the enhancement Change that does not break compatibility and not affect public interfaces, but improves performance. label Oct 15, 2024

Merge branch 'master' into update-filter-condition-for-cached-storage

dd475ea

c-bata assigned porink0424 Oct 16, 2024

porink0424 reviewed Oct 18, 2024

View reviewed changes

optuna/storages/_rdb/storage.py Show resolved Hide resolved

optuna/storages/_rdb/storage.py Outdated Show resolved Hide resolved

c-bata reviewed Oct 18, 2024

View reviewed changes

optuna/storages/_cached_storage.py Outdated Show resolved Hide resolved

c-bata reviewed Oct 18, 2024

View reviewed changes

HideakiImamura added 2 commits October 21, 2024 11:20

Merge branch 'master' into update-filter-condition-for-cached-storage

070aa4f

Apply review comments

a940abc

HideakiImamura added 2 commits October 21, 2024 11:57

Fix

05542e3

Update according to the feedback from internal discussions

93d482c

c-bata reviewed Oct 23, 2024

View reviewed changes

c-bata added this to the v4.1.0 milestone Oct 23, 2024

porink0424 approved these changes Oct 23, 2024

View reviewed changes

c-bata unassigned porink0424 Oct 23, 2024

not522 reviewed Oct 23, 2024

View reviewed changes

optuna/storages/_rdb/storage.py Outdated Show resolved Hide resolved

y0z reviewed Oct 23, 2024

View reviewed changes

tests/storages_tests/test_storages.py Show resolved Hide resolved

Follow review comments

70aa7e9

HideakiImamura added 2 commits October 25, 2024 13:53

Apply review comments

d359c55

Merge branch 'master' into update-filter-condition-for-cached-storage

c90d6dd

y0z approved these changes Oct 29, 2024

View reviewed changes

y0z removed their assignment Oct 29, 2024

c-bata approved these changes Oct 29, 2024

View reviewed changes

c-bata merged commit ee67746 into optuna:master Oct 29, 2024

HideakiImamura deleted the update-filter-condition-for-cached-storage branch October 29, 2024 09:41

not522 mentioned this pull request Oct 22, 2025

Fix incremental update algorithm in _CachedStorage's _read_trials_from_remote_storage #6310

Merged

	_query = query.filter(models.TrialModel.trial_id > trial_id_greater_than)
	query = query.filter(models.TrialModel.trial_id > trial_id_greater_than)

Uh oh!

Conversation

HideakiImamura commented Oct 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Description of the changes

Uh oh!

HideakiImamura commented Oct 14, 2024

Uh oh!

c-bata commented Oct 18, 2024

Uh oh!

porink0424 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

c-bata left a comment

Choose a reason for hiding this comment

Uh oh!

c-bata commented Oct 18, 2024

Benchmark Script

Benchmark Results

study.optimize (10 threads)

storage.get_all_trials x 20 (10 threads)

Uh oh!

HideakiImamura commented Oct 21, 2024

Uh oh!

c-bata left a comment

Choose a reason for hiding this comment

Uh oh!

c-bata Oct 23, 2024

Choose a reason for hiding this comment

Uh oh!

HideakiImamura Oct 24, 2024

Choose a reason for hiding this comment

Uh oh!

c-bata Oct 29, 2024

Choose a reason for hiding this comment

Uh oh!

HideakiImamura Oct 29, 2024

Choose a reason for hiding this comment

Uh oh!

c-bata Oct 29, 2024

Choose a reason for hiding this comment

Uh oh!

porink0424 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

HideakiImamura commented Oct 25, 2024

Uh oh!

y0z left a comment

Choose a reason for hiding this comment

Uh oh!

c-bata left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

HideakiImamura commented Oct 11, 2024 •

edited

Loading

`study.optimize` (10 threads)

`storage.get_all_trials` x 20 (10 threads)