[ML] Refactor AutodetectMemoryLimitIT to use memory limit constants and improve model size assertions by valeriy42 · Pull Request #135526 · elastic/elasticsearch

valeriy42 · 2025-09-26T10:22:47Z

Previously, the upper bound for model memory checks was set in absolute terms, which is not easy to understand and is brittle. I adjusted the assertions to ensure that the memory usage does not exceed 5% of the memory limit. Additionally, on Linux, we now report the process size (see #131981), which includes approximately 20 MB of native code overhead. I made handling this overhead more explicit.

More details:

Removed muted tests for testManyDistinctOverFields and testTooManyByAndOverFields.
Introduced constants for memory limits in AutodetectMemoryLimitIT.
Updated assertions to check effective model size against calculated limits.

Closes #132308
Closes #132310
Closes #132611

…prove model size assertions - Removed muted tests for `testManyDistinctOverFields` and `testTooManyByAndOverFields`. - Introduced constants for memory limits in `AutodetectMemoryLimitIT`. - Updated assertions to check effective model size against calculated limits.

elasticsearchmachine · 2025-09-26T10:26:51Z

Pinging @elastic/ml-core (Team:ML)

…leriy42/elasticsearch into tests/AutodetectMemoryLimitIT

...ts/src/javaRestTest/java/org/elasticsearch/xpack/ml/integration/AutodetectMemoryLimitIT.java

jan-elastic

Generally look good. Some small comments.

- Changed variable names from `memoryLimit` to `memoryLimitMb` for clarity. - Updated memory limit assertions to reflect the new variable naming. - Ensured consistency in memory limit usage across multiple test cases.

DonalEvans · 2025-09-26T17:05:22Z

...ts/src/javaRestTest/java/org/elasticsearch/xpack/ml/integration/AutodetectMemoryLimitIT.java

 */
 public class AutodetectMemoryLimitIT extends MlNativeAutodetectIntegTestCase {

+    private static final long PROCESS_OVERHEAD_BYTES = ByteSizeValue.ofMb(20).getBytes();


Where is this value coming from?

This value is estimated empirically. I used a small, simple anomaly detection job with a trivial model to establish the lower bound on the process memory usage.

Would it be helpful to add a comment explaining that? If there's a risk of that value changing at some point in the future, it would be good to know what needs to be do to recalculate and update it.

On the second though, checking if the autodetect process has a specific memory overhead make these tests unnecessary brittle. I removed it and only kept checking for hard_limit. In ml-cpp we have unit tests to ensure there are no memory leaks.

…tests/AutodetectMemoryLimitIT

valeriy42 · 2025-10-24T07:43:16Z

...ts/src/javaRestTest/java/org/elasticsearch/xpack/ml/integration/AutodetectMemoryLimitIT.java

-        assertThat(modelSizeStats.getModelBytes(), lessThan(120500000L));
-        assertThat(modelSizeStats.getModelBytes(), greaterThan(70000000L));
+        assertThat(getEffectiveModelSize(modelSizeStats.getModelBytes()), lessThan(ByteSizeValue.ofMb(memoryLimitMb).getBytes() * 1.05));
        assertThat(modelSizeStats.getMemoryStatus(), equalTo(ModelSizeStats.MemoryStatus.HARD_LIMIT));


The reporting of memory usage is different on different plattforms: on Linux and Windows we report actual process memory usage, while on MacOS this information is not available and hence we report estimated memory usage. This makes the test for a specific memory limit brittle. To improve robustness, I only check that the job is in the hard_limit state.

valeriy42 · 2025-10-24T07:44:32Z

...ts/src/javaRestTest/java/org/elasticsearch/xpack/ml/integration/AutodetectMemoryLimitIT.java

-        assertThat(modelSizeStats.getModelBytes(), lessThan(72000000L));
-        assertThat(modelSizeStats.getModelBytes(), greaterThan(24000000L));
+        assertThat(getEffectiveModelSize(modelSizeStats.getModelBytes()), lessThan(ByteSizeValue.ofMb(memoryLimitMb).getBytes() * 1.05));
        assertThat(modelSizeStats.getMemoryStatus(), equalTo(ModelSizeStats.MemoryStatus.HARD_LIMIT));


Only check if the job is in hard_limit state to increase test robustness on different plattforms

valeriy42 · 2025-10-24T07:45:08Z

...ts/src/javaRestTest/java/org/elasticsearch/xpack/ml/integration/AutodetectMemoryLimitIT.java

-        assertThat(modelSizeStats.getModelBytes(), greaterThan(24000000L));
+
+        assertThat(getEffectiveModelSize(modelSizeStats.getModelBytes()), lessThan(ByteSizeValue.ofMb(memoryLimitMb).getBytes() * 1.05));
        assertThat(


Only check for hard_limit to increase test robustness.

valeriy42 · 2025-10-24T07:45:57Z

...ts/src/javaRestTest/java/org/elasticsearch/xpack/ml/integration/AutodetectMemoryLimitIT.java

-        assertThat(modelSizeStats.getModelBytes(), lessThan(45000000L));
-        assertThat(modelSizeStats.getModelBytes(), greaterThan(25000000L));
+        assertThat(getEffectiveModelSize(modelSizeStats.getModelBytes()), lessThan(ByteSizeValue.ofMb(memoryLimitMb).getBytes() * 1.05));
        assertThat(modelSizeStats.getMemoryStatus(), equalTo(ModelSizeStats.MemoryStatus.HARD_LIMIT));


Only check hard_limit for robustness.

…MemoryLimitIT tests

…leriy42/elasticsearch into tests/AutodetectMemoryLimitIT

elasticsearchmachine added the v9.2.0 label Sep 26, 2025

valeriy42 added >test Issues or PRs that are addressing/adding tests :ml Machine learning Team:ML Meta label for the ML team labels Sep 26, 2025

valeriy42 marked this pull request as ready for review September 26, 2025 10:26

elasticsearchmachine and others added 3 commits September 26, 2025 10:30

[CI] Auto commit changes from spotless

fcf61c9

fix compilation errors

fd21170

Merge branch 'tests/AutodetectMemoryLimitIT' of https://github.com/va…

e4ad38a

…leriy42/elasticsearch into tests/AutodetectMemoryLimitIT

jan-elastic reviewed Sep 26, 2025

View reviewed changes

...ts/src/javaRestTest/java/org/elasticsearch/xpack/ml/integration/AutodetectMemoryLimitIT.java Outdated Show resolved Hide resolved

jan-elastic reviewed Sep 26, 2025

View reviewed changes

...ts/src/javaRestTest/java/org/elasticsearch/xpack/ml/integration/AutodetectMemoryLimitIT.java Outdated Show resolved Hide resolved

jan-elastic approved these changes Sep 26, 2025

View reviewed changes

valeriy42 added 2 commits September 26, 2025 16:01

Refactor AutodetectMemoryLimitIT to use memory limit variables

aceba64

- Changed variable names from `memoryLimit` to `memoryLimitMb` for clarity. - Updated memory limit assertions to reflect the new variable naming. - Ensured consistency in memory limit usage across multiple test cases.

Merge branch 'main' into tests/AutodetectMemoryLimitIT

dd08bce

DonalEvans reviewed Sep 26, 2025

View reviewed changes

Merge branch 'main' into tests/AutodetectMemoryLimitIT

dba3843

elasticsearchmachine added v9.3.0 and removed v9.2.0 labels Oct 2, 2025

Merge branch 'main' of https://github.com/elastic/elasticsearch into …

99b4a29

…tests/AutodetectMemoryLimitIT

valeriy42 commented Oct 24, 2025

View reviewed changes

valeriy42 added 3 commits October 24, 2025 09:55

Remove unnecessary overhead calculations and assertions in Autodetect…

ad91476

…MemoryLimitIT tests

Merge branch 'tests/AutodetectMemoryLimitIT' of https://github.com/va…

fc372b1

…leriy42/elasticsearch into tests/AutodetectMemoryLimitIT

Merge branch 'main' into tests/AutodetectMemoryLimitIT

b7fce94

valeriy42 merged commit 0a71ff4 into elastic:main Oct 31, 2025
34 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Refactor AutodetectMemoryLimitIT to use memory limit constants and improve model size assertions#135526

[ML] Refactor AutodetectMemoryLimitIT to use memory limit constants and improve model size assertions#135526
valeriy42 merged 11 commits intoelastic:mainfrom
valeriy42:tests/AutodetectMemoryLimitIT

valeriy42 commented Sep 26, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Sep 26, 2025

Uh oh!

Uh oh!

Uh oh!

jan-elastic left a comment

Uh oh!

DonalEvans Sep 26, 2025

Uh oh!

valeriy42 Sep 29, 2025

Uh oh!

DonalEvans Sep 29, 2025

Uh oh!

valeriy42 Oct 24, 2025

Uh oh!

valeriy42 Oct 24, 2025

Uh oh!

valeriy42 Oct 24, 2025

Uh oh!

valeriy42 Oct 24, 2025

Uh oh!

valeriy42 Oct 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

valeriy42 commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Sep 26, 2025

Uh oh!

Uh oh!

Uh oh!

jan-elastic left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

valeriy42 commented Sep 26, 2025 •

edited

Loading