Implement `return_hidden_states` for the OpenAI API by kyle-pena-kuzco · Pull Request #6137 · sgl-project/sglang

kyle-pena-kuzco · 2025-05-09T02:21:02Z

Motivation

The native API supports returning hidden states from the model. This PR implements the same feature for OpenAI.

Returning hidden states is important for model verification and diagnostics purposes. Inference providers that use SGLang as a backend route requests that are in the OpenAI format, so if an inference provider would like to do internal diagnostics and verification, it is much more straightforward to simply include return_hidden_states.

This PR also fixes #5761 and adds appropriate test coverage for that bug.

Modifications

Changes were made to protocol.py to support the return_hidden_states flag as well as returning hidden_states on /v1/completions and /v1/chat/completions for both streaming and non-streaming.

If no hidden states are requested, the hidden states property is omitted from the response instead of including a null field. That way, the responses are completely backwards compatible.

The adapter was changed to include hidden states in responses when requested.

The dictionary n_prev_tokens was not being updated in v1_chat_completions for a streaming response, leading the same top logprobs to being repeated for every chunk. This has been fixed as well.

Checklist

Format your code according to the Code Formatting with Pre-Commit.
Add unit tests as outlined in the Running Unit Tests.
Update documentation / docstrings / example tutorials as needed, according to Writing Documentation.
Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to Benchmark and Profiling and Accuracy Results.
For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.
Please feel free to join our Slack channel at https://slack.sglang.ai to discuss your PR.

kyle-pena-kuzco · 2025-05-12T20:43:47Z

@zhyncs - let me know if you have any feedback! We'd like to get this feature merged.

zhaochenyang20 · 2025-05-16T19:33:18Z

@Qiaolin-Yu qiaolin could you take a look plz?

Qiaolin-Yu · 2025-05-16T19:34:25Z

@Qiaolin-Yu qiaolin could you take a look plz?

Sure. Very happy to help.

Qiaolin-Yu

Great work! LGTM

atbe · 2025-05-16T22:24:33Z

thank you @Qiaolin-Yu @zhaochenyang20 @yinfan98 <3

zhaochenyang20 · 2025-05-17T16:01:30Z

will merge it, no need to rebase

zhaochenyang20 · 2025-05-17T16:01:40Z

nice work @kyle-pena-kuzco

zhyncs · 2025-05-18T05:55:36Z

@kyle-pena-kuzco May you help resolve the conflicts? Thanks. cc @CatherineSue @ispobock @Qiaolin-Yu

kyle-pena-kuzco · 2025-05-18T16:37:06Z

@kyle-pena-kuzco May you help resolve the conflicts? Thanks. cc @CatherineSue @ispobock @Qiaolin-Yu

Yes. Starting on that now. i'll comment when completed.

kyle-pena-kuzco · 2025-05-18T17:19:42Z

@kyle-pena-kuzco May you help resolve the conflicts? Thanks. cc @CatherineSue @ispobock @Qiaolin-Yu

Yes. Starting on that now. i'll comment when completed.

@zhyncs - I've fixed the merge conflicts.

CatherineSue

LGTM

This reverts commit 4f39bcf.

…6440)

…ect#6137)" (sgl-project#6440)

Kyle Pena and others added 12 commits May 8, 2025 15:38

initial implementation - no tests

6045393

removed turtle import

a936a21

added test coverage

4c0351a

implemented /v1/completions hidden states

f370828

small update

76a5826

fixed the test_completion and test_chat_completion tests

321e0c9

including adapter.py - mistakenly ommitted from last commit

2e1bad3

checkpoint before adding more detailed assert messages

dfddfba

all tests pass relating to returning hidden states pass now

fd70c53

fixed a few remaining issues

08213ce

removed debugging print statement

d51ab85

hidden states now excluded from api response if None, instead of null

f60cc26

kyle-pena-kuzco marked this pull request as ready for review May 9, 2025 05:14

kyle-pena-kuzco requested review from ByronHsu, CatherineSue, Ying1123, hnyls2002, ispobock, merrymercy and zhyncs as code owners May 9, 2025 05:14

kyle-pena-kuzco added 2 commits May 9, 2025 15:00

removed mistaken way to write out to file

684e0ab

Merge remote-tracking branch 'upstream/main' into openai-hidden-states

1d651e2

last token only, as was intended

2a3e24a

zhyncs assigned CatherineSue, ispobock and sleepcoo May 13, 2025

zhyncs added the high priority label May 13, 2025

zhyncs self-assigned this May 13, 2025

Qiaolin-Yu self-assigned this May 16, 2025

Qiaolin-Yu approved these changes May 16, 2025

View reviewed changes

Merge remote-tracking branch 'origin/main' into openai-hidden-states

f48c54c

CatherineSue approved these changes May 19, 2025

View reviewed changes

zhyncs merged commit 4f39bcf into sgl-project:main May 19, 2025
60 of 78 checks passed

zhyncs added a commit that referenced this pull request May 20, 2025

Revert "Implement return_hidden_states for the OpenAI API (#6137)"

4c9bcd8

This reverts commit 4f39bcf.

zhyncs added a commit that referenced this pull request May 20, 2025

Revert "Implement return_hidden_states for the OpenAI API (#6137)" (#…

b146555

…6440)

woodx9 pushed a commit to woodx9/sglang that referenced this pull request Jun 8, 2025

Implement return_hidden_states for the OpenAI API (sgl-project#6137)

0518001

woodx9 pushed a commit to woodx9/sglang that referenced this pull request Jun 8, 2025

Revert "Implement return_hidden_states for the OpenAI API (sgl-proj…

80da7b3

…ect#6137)" (sgl-project#6440)

Layssy pushed a commit to Layssy/sglang-iaas that referenced this pull request Jun 9, 2025

Implement return_hidden_states for the OpenAI API (sgl-project#6137)

6ee8a9f

Layssy pushed a commit to Layssy/sglang-iaas that referenced this pull request Jun 9, 2025

Revert "Implement return_hidden_states for the OpenAI API (sgl-proj…

8cf98c9

…ect#6137)" (sgl-project#6440)

xwu-intel pushed a commit to xwu-intel/sglang that referenced this pull request Jun 17, 2025

Implement return_hidden_states for the OpenAI API (sgl-project#6137)

48b1bcd

xwu-intel pushed a commit to xwu-intel/sglang that referenced this pull request Jun 17, 2025

Revert "Implement return_hidden_states for the OpenAI API (sgl-proj…

4d23ef9

…ect#6137)" (sgl-project#6440)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `return_hidden_states` for the OpenAI API#6137

Implement `return_hidden_states` for the OpenAI API#6137
zhyncs merged 16 commits intosgl-project:mainfrom
context-labs:openai-hidden-states

kyle-pena-kuzco commented May 9, 2025 •

edited

Loading

Uh oh!

kyle-pena-kuzco commented May 12, 2025

Uh oh!

zhaochenyang20 commented May 16, 2025

Uh oh!

Qiaolin-Yu commented May 16, 2025

Uh oh!

Qiaolin-Yu left a comment

Uh oh!

atbe commented May 16, 2025

Uh oh!

zhaochenyang20 commented May 17, 2025

Uh oh!

zhaochenyang20 commented May 17, 2025

Uh oh!

zhyncs commented May 18, 2025

Uh oh!

kyle-pena-kuzco commented May 18, 2025

Uh oh!

kyle-pena-kuzco commented May 18, 2025

Uh oh!

CatherineSue left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Conversation

kyle-pena-kuzco commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Checklist

Uh oh!

kyle-pena-kuzco commented May 12, 2025

Uh oh!

zhaochenyang20 commented May 16, 2025

Uh oh!

Qiaolin-Yu commented May 16, 2025

Uh oh!

Qiaolin-Yu left a comment

Choose a reason for hiding this comment

Uh oh!

atbe commented May 16, 2025

Uh oh!

zhaochenyang20 commented May 17, 2025

Uh oh!

zhaochenyang20 commented May 17, 2025

Uh oh!

zhyncs commented May 18, 2025

Uh oh!

kyle-pena-kuzco commented May 18, 2025

Uh oh!

kyle-pena-kuzco commented May 18, 2025

Uh oh!

CatherineSue left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

kyle-pena-kuzco commented May 9, 2025 •

edited

Loading