extproc: properly stream chat completions by mathetake · Pull Request #468 · envoyproxy/ai-gateway

mathetake · 2025-03-07T19:49:13Z

Commit Message

This fixes a bug in the extproc when handling stream=true requests. Previously, mode_override was set at the request body handling phase, and it was not set in the response headers phase. That resulted in buffering the entire response body which is clearly not ideal as from clients point of view, they will receive the entire streaming vs line by line. This refactors around the mode override and properly handle it.

Signed-off-by: Takeshi Yoneda <t.y.mathetake@gmail.com>

mathetake · 2025-03-07T19:51:45Z

internal/extproc/chatcompletion_processor.go

this is where the actual change took place: Instead of setting mode override in ProcessRequestBody, this will instead set it in ProcessResponseHeaders

aabchoo

overall looks good to me

aabchoo · 2025-03-07T20:43:18Z

internal/extproc/translator/translator.go

 		tokenUsage LLMTokenUsage,
 		err error,
 	)
-


Why remove this?

Cause it's not used :)

mathetake · 2025-03-07T21:05:15Z

i am trying to write a regression test that actually verifies the streaming behavior

Signed-off-by: Takeshi Yoneda <t.y.mathetake@gmail.com>

mathetake · 2025-03-07T21:31:50Z

ok the regression test added (verified it's failing on the current main)

Signed-off-by: Takeshi Yoneda <t.y.mathetake@gmail.com>

**Commit Message** This removes the low log levels accidentally enabled in #468 **Related Issues/PRs (if applicable)** Follow up on #468 Signed-off-by: Takeshi Yoneda <t.y.mathetake@gmail.com>

**Commit Message** This fixes a bug in the extproc when handling stream=true requests. Previously, mode_override was set at the request body handling phase, and it was not set in the response headers phase. That resulted in buffering the entire response body which is clearly not ideal as from clients point of view, they will receive the entire streaming vs line by line. This refactors around the mode override and properly handle it. --------- Signed-off-by: Takeshi Yoneda <t.y.mathetake@gmail.com>

**Commit Message** PR to backport `mockChatCompletionMetrics`, chat completion stream fix, and openai content type. Including: - #459 (468 uses mock components introduced here) - #468 - #486 --------- Signed-off-by: Huamin Chen <hchen@redhat.com> Signed-off-by: Ignasi Barrera <ignasi@tetrate.io> Signed-off-by: Takeshi Yoneda <t.y.mathetake@gmail.com> Signed-off-by: Aaron Choo <achoo30@bloomberg.net> Co-authored-by: Ignasi Barrera <ignasi@tetrate.io> Co-authored-by: Takeshi Yoneda <t.y.mathetake@gmail.com> Co-authored-by: Dan Sun <dsun20@bloomberg.net>

extproc: properly stream chat completions

546b066

Signed-off-by: Takeshi Yoneda <t.y.mathetake@gmail.com>

mathetake marked this pull request as ready for review March 7, 2025 19:49

mathetake requested a review from a team as a code owner March 7, 2025 19:49

mathetake commented Mar 7, 2025

View reviewed changes

aabchoo approved these changes Mar 7, 2025

View reviewed changes

mathetake added 2 commits March 7, 2025 13:29

regression

a7bb15e

Signed-off-by: Takeshi Yoneda <t.y.mathetake@gmail.com>

regression

52b6282

Signed-off-by: Takeshi Yoneda <t.y.mathetake@gmail.com>

mathetake added 2 commits March 7, 2025 13:32

unnecessary

125fa37

Signed-off-by: Takeshi Yoneda <t.y.mathetake@gmail.com>

more

1c5c408

Signed-off-by: Takeshi Yoneda <t.y.mathetake@gmail.com>

mathetake merged commit b7c658a into main Mar 7, 2025
15 checks passed

mathetake deleted the streamingcorrectly branch March 7, 2025 21:38

mathetake mentioned this pull request Mar 7, 2025

test: removes unnecessary logs #469

Merged

aabchoo mentioned this pull request Mar 14, 2025

backport: completion stream + metrics and assistant content #497

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extproc: properly stream chat completions#468

extproc: properly stream chat completions#468
mathetake merged 5 commits intomainfrom
streamingcorrectly

mathetake commented Mar 7, 2025

Uh oh!

mathetake Mar 7, 2025

Uh oh!

aabchoo left a comment

Uh oh!

aabchoo Mar 7, 2025

Uh oh!

mathetake Mar 7, 2025

Uh oh!

mathetake commented Mar 7, 2025 •

edited

Loading

Uh oh!

mathetake commented Mar 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mathetake commented Mar 7, 2025

Uh oh!

mathetake Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

aabchoo left a comment

Choose a reason for hiding this comment

Uh oh!

aabchoo Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

mathetake Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

mathetake commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mathetake commented Mar 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mathetake commented Mar 7, 2025 •

edited

Loading