[model-gateway] Add e2e tests of streaming events and tool choice for response api by XinyueZhang369 · Pull Request #13880 · sgl-project/sglang

XinyueZhang369 · 2025-11-25T03:32:06Z

Motivation

This PR adds more integration test cases for e2e responses API, gRPC backend.

Modifications

Add streaming event tests for output_index, OutputItemDone, and reasoning content in output array
Add tool choice tests

Test result

Accuracy Tests

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.
Work with maintainers to merge your PR. See the PR Merge Process

gemini-code-assist · 2025-11-25T03:32:24Z

Summary of Changes

Hello @XinyueZhang369, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the robustness of the model-gateway's Response API by introducing comprehensive end-to-end integration tests. These new tests validate critical functionalities such as the accurate handling of streaming events, including zero-based indexing and event completion, and the correct behavior of the tool_choice parameter across various configurations and tool types. This ensures the API's reliability and predictability for both streaming responses and advanced tool integration.

Highlights

Streaming Event Validation: New end-to-end tests have been added to validate the behavior of streaming events in the Response API, ensuring correct zero-based output_index handling, proper emission of OutputItemDone events, and accurate construction of the output array in completed responses.
Tool Choice Parameter Testing: Comprehensive end-to-end tests for the tool_choice parameter have been introduced, covering various scenarios such as auto, required, and specific function choices, as well as verifying its functionality in streaming mode and with mixed tool types (function and MCP tools).
Harmony Backend Specific Tests: Dedicated tests for the Harmony backend ensure that reasoning content within streaming events correctly adheres to zero-based output_index and is properly included in the final output array of completed responses.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request adds valuable end-to-end tests for streaming events and the tool_choice parameter in the response API. The tests are well-structured and cover a good range of scenarios, including different backends and edge cases like mixed tool types. My main feedback is focused on improving the maintainability of the new test file test_tool_choice.py by refactoring duplicated and inconsistent tool definitions into shared constants. This will make the tests cleaner and easier to manage in the future.

key4ng

overall lgtm. noticed the ci running time increased to around 8 min. currently does every time we add a new class it will have to restart the backend?

XinyueZhang369 · 2025-11-27T01:55:05Z

overall lgtm. noticed the ci running time increased to around 8 min. currently does every time we add a new class it will have to restart the backend?

Sadly yes, I merged mcp , function call and tool choice tests into 1 test class to save some time

XinyueZhang369 · 2025-11-27T03:01:17Z

Also noticing that some tests like test_basic_function_call, can be a bit flaky, thinking about adding the retry for all responses e2e tests, what do you think?

key4ng · 2025-12-01T23:24:47Z

There is a ci-workflow change. May need @slin1237 's approval

… response api (sgl-project#13880) Co-authored-by: Simo Lin <linsimo.mark@gmail.com>

Add more e2e tests for response api

586eb94

XinyueZhang369 requested review from CatherineSue and key4ng as code owners November 25, 2025 03:32

github-actions Bot added the model-gateway label Nov 25, 2025

gemini-code-assist Bot reviewed Nov 25, 2025

View reviewed changes

Comment thread sgl-router/py_test/e2e_response_api/features/test_tool_choice.py Outdated

slin1237 added 2 commits November 26, 2025 11:42

Merge branch 'main' into xinyue/response-api-e2e-tests

c8ea5c6

Merge branch 'main' into xinyue/response-api-e2e-tests

5aacbe8

slin1237 added the run-ci label Nov 26, 2025

Xinyue Zhang added 2 commits November 26, 2025 13:30

Merge branch 'main' into xinyue/response-api-e2e-tests

ff4fb51

Merge branch 'main' into xinyue/response-api-e2e-tests

4e1bf1b

key4ng reviewed Nov 26, 2025

View reviewed changes

Address comments

bff2e6c

Xinyue Zhang added 2 commits November 26, 2025 18:09

Merge branch 'main' into xinyue/response-api-e2e-tests

5de2d77

Add retry on responses tests

824bd7b

XinyueZhang369 requested review from Fridge003, Kangyan-Zhou, ispobock and merrymercy as code owners November 27, 2025 02:57

Merge branch 'main' into xinyue/response-api-e2e-tests

94d9313

key4ng reviewed Dec 1, 2025

View reviewed changes

Comment thread scripts/ci/ci_install_dependency.sh Outdated

install pytest-rerunfailures in pr-test-rust.yml

b199095

key4ng approved these changes Dec 1, 2025

View reviewed changes

slin1237 approved these changes Dec 1, 2025

View reviewed changes

slin1237 merged commit 1d66a14 into sgl-project:main Dec 1, 2025
55 checks passed

XinyueZhang369 deleted the xinyue/response-api-e2e-tests branch December 2, 2025 00:10

harvenstar pushed a commit to harvenstar/sglang that referenced this pull request Dec 4, 2025

[model-gateway] Add e2e tests of streaming events and tool choice for…

4619136

… response api (sgl-project#13880) Co-authored-by: Simo Lin <linsimo.mark@gmail.com>

tonyluj pushed a commit to openanolis/sglang that referenced this pull request Dec 5, 2025

[model-gateway] Add e2e tests of streaming events and tool choice for…

674aa63

… response api (sgl-project#13880) Co-authored-by: Simo Lin <linsimo.mark@gmail.com>

yuchengz816-bot pushed a commit to yuchengz816-bot/sglang that referenced this pull request Dec 8, 2025

[model-gateway] Add e2e tests of streaming events and tool choice for…

819f8ec

… response api (sgl-project#13880) Co-authored-by: Simo Lin <linsimo.mark@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[model-gateway] Add e2e tests of streaming events and tool choice for response api#13880

[model-gateway] Add e2e tests of streaming events and tool choice for response api#13880
slin1237 merged 10 commits intosgl-project:mainfrom
XinyueZhang369:xinyue/response-api-e2e-tests

XinyueZhang369 commented Nov 25, 2025 •

edited

Loading

Uh oh!

gemini-code-assist Bot commented Nov 25, 2025

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

key4ng left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

XinyueZhang369 commented Nov 27, 2025

Uh oh!

XinyueZhang369 commented Nov 27, 2025

Uh oh!

Uh oh!

key4ng commented Dec 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

XinyueZhang369 commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Uh oh!

gemini-code-assist Bot commented Nov 25, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

key4ng left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

XinyueZhang369 commented Nov 27, 2025

Uh oh!

XinyueZhang369 commented Nov 27, 2025

Uh oh!

Uh oh!

key4ng commented Dec 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

XinyueZhang369 commented Nov 25, 2025 •

edited

Loading