[RemoteModels] OpenAI batch Support #9156

tomerm-iguazio · 2026-01-04T17:31:38Z

📝 Description

Added support for OpenAI batch processing, including asynchronous batch execution.
Added integration and unit tests.

Depend on #9117

🛠️ Changes Made

Added batch invocation support (sync via ThreadPoolExecutor, async via asyncio.gather) with two-level concurrency control: (1) instance-level thread pool (_executor) for sync batches with per-batch semaphore limiting concurrent threads, and (2) class-level global async semaphore (_global_async_semaphore) shared across all instances to enforce total async request limits while per-batch semaphores ensure fair distribution-preventing API rate limit violations and resource monopolization.

✅ Checklist

I updated the documentation (if applicable)
I have tested the changes in this PR
I confirmed whether my changes are covered by system tests
- If yes, I ran all relevant system tests and ensured they passed before submitting this PR
- I updated existing system tests and/or added new ones if needed to cover my changes
If I introduced a deprecation:
- I followed the Deprecation Guidelines
- I updated the relevant Jira ticket for documentation

🧪 Testing

🔗 References

Ticket link: ML-11682
Design docs links:
External links:

🚨 Breaking Changes?

Yes (explain below)
No

🔍️ Additional Notes

# Conflicts: # tests/serving/test_async_flow.py

update test to use v2modelserver as model_mode and not as None

add simpler test: test_monitoring_with_model_runner_batch_infer

mlrun/datastore/model_provider/openai_provider.py

royischoss

Hey looks good some comments

mlrun/datastore/model_provider/openai_provider.py

davesh0812

Looks good so far, just need to go over the tests.
One thing we should still think about is how the naive execution mechanism interacts with the asyncio event loop.

mlrun/datastore/datastore_profile.py

mlrun/config.py

mlrun/datastore/model_provider/openai_provider.py

tests/datastore/remote_model/test_openai_unit.py

tests/integration/model_providers/openai/test_openai.py

davesh0812

LGTM,

one comment for followup pr

davesh0812 · 2026-01-15T12:05:10Z

mlrun/datastore/model_provider/openai_provider.py

+        try:
+            # gather() stops on first exception - fast fail
+            return await asyncio.gather(*tasks)
+        except:


please add relevant list of exceptions here

tomerm-iguazio added 30 commits December 24, 2025 16:56

support model monitoring with list + unit test to all cases.

2e2f525

move test_mrs_direct_batch_input to test_tracking.py

ca0510b

Merge branch 'upstream-development' into mm_support_list

12f466d

# Conflicts: # tests/serving/test_async_flow.py

change model.pkl

870388d

change model2.pkl

30e0fb9

add list of lists to the batch test

b91ccec

add parameter

fffc2d5

add test_mrs_direct_batch_str

c6164d1

add test_mrs_direct_batch_str input

e9b9900

add format_batch to test app (temp)

4387791

add format_batch to test app (temp) 2

9908757

Merge branch 'upstream-development' into mm_support_list

382bbbc

# Conflicts: # tests/serving/test_async_flow.py

fmt

8c5b806

change the batch from batch of batches to regular batch.

2b28f91

add batch model runner mode

a82d540

update parameter

79f51e6

change model_runner_mode to enum

0e4b422

update error_count check

e789ae1

update body usage in system test.

fd9b89a

update test to use v2modelserver as model_mode and not as None

revert test_app.py

9b1a89b

test_app_flow split to test_app_batch

0718d69

Merge branch 'upstream-development' into mm_support_list

6e2316c

fix double invocations in infer_with_error

f71b4a9

revert test_app and __init__.py

c111132

add simpler test: test_monitoring_with_model_runner_batch_infer

fix error type

65bd7d5

update sleep

da5c248

Merge branch 'mm_support_list' into openai_batch

ae88e3e

implement openai batch, without test

2357a64

add basic (without model) integration test.

ad79c21

resolve warnings

41895be

gtopper reviewed Jan 8, 2026

View reviewed changes

mlrun/datastore/model_provider/openai_provider.py Outdated Show resolved Hide resolved

gtopper reviewed Jan 8, 2026

View reviewed changes

mlrun/datastore/model_provider/openai_provider.py Outdated Show resolved Hide resolved

royischoss reviewed Jan 8, 2026

View reviewed changes

mlrun/datastore/model_provider/openai_provider.py Outdated Show resolved Hide resolved

mlrun/datastore/model_provider/openai_provider.py Show resolved Hide resolved

mlrun/datastore/model_provider/openai_provider.py Show resolved Hide resolved

royischoss reviewed Jan 8, 2026

View reviewed changes

mlrun/datastore/model_provider/openai_provider.py Outdated Show resolved Hide resolved

tomerm-iguazio added 6 commits January 11, 2026 16:48

Merge branch 'upstream-development' into openai_batch

6979b4d

removed def _import_response_class

d966d12

add double lock check

5b114a5

removed BatchInvokeResponse

fd2dc45

added strings check to validation

d1dde0f

switch to private batch functions

33ef902

gtopper reviewed Jan 13, 2026

View reviewed changes

mlrun/datastore/model_provider/openai_provider.py Outdated Show resolved Hide resolved

gtopper reviewed Jan 13, 2026

View reviewed changes

mlrun/datastore/model_provider/openai_provider.py Show resolved Hide resolved

tomerm-iguazio force-pushed the openai_batch branch from 1d3c600 to 33ef902 Compare January 13, 2026 12:25

tomerm-iguazio and others added 5 commits January 13, 2026 16:20

few global locks fixes

1e1ff23

unused variable

11a6bf4

Merge branch 'development' into openai_batch

5017a86

update batch invoke to use async behind

c6248c2

Merge remote-tracking branch 'origin/openai_batch' into openai_batch

1d871db

tomerm-iguazio requested a review from gtopper January 14, 2026 13:43

tomerm-iguazio mentioned this pull request Jan 14, 2026

[RemoteModels] HuggingFace batch Support #9206

Draft

10 tasks

davesh0812 reviewed Jan 14, 2026

View reviewed changes

mlrun/datastore/datastore_profile.py Outdated Show resolved Hide resolved

mlrun/datastore/datastore_profile.py Outdated Show resolved Hide resolved

mlrun/config.py Show resolved Hide resolved

mlrun/datastore/model_provider/openai_provider.py Outdated Show resolved Hide resolved

davesh0812 reviewed Jan 14, 2026

View reviewed changes

mlrun/datastore/model_provider/openai_provider.py Show resolved Hide resolved

tomerm-iguazio added 5 commits January 15, 2026 12:03

remove workers limit from openai profile

cec983b

raise error on problematic input messages

3022f96

apply a workaround for existing event loop

c9d7f3e

apply tests for the workaround

b9ec3e9

docs + tests

d873f44

tomerm-iguazio commented Jan 15, 2026

View reviewed changes

tests/datastore/remote_model/test_openai_unit.py Outdated Show resolved Hide resolved

tests/datastore/remote_model/test_openai_unit.py Show resolved Hide resolved

tests/integration/model_providers/openai/test_openai.py Outdated Show resolved Hide resolved

davesh0812 approved these changes Jan 15, 2026

View reviewed changes

davesh0812 merged commit dd942e1 into mlrun:development Jan 15, 2026
14 checks passed

[RemoteModels] OpenAI batch Support #9156

[RemoteModels] OpenAI batch Support #9156

Uh oh!

Conversation

tomerm-iguazio commented Jan 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📝 Description

Depend on #9117

🛠️ Changes Made

✅ Checklist

🧪 Testing

🔗 References

🚨 Breaking Changes?

🔍️ Additional Notes

Uh oh!

Uh oh!

Uh oh!

royischoss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

davesh0812 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

davesh0812 left a comment

Choose a reason for hiding this comment

Uh oh!

davesh0812 Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tomerm-iguazio commented Jan 4, 2026 •

edited

Loading

davesh0812 left a comment •

edited

Loading