Add openai embedding API by Ying1123 · Pull Request #997 · sgl-project/sglang

Ying1123 · 2024-08-09T06:21:24Z

This is a follow-up PR for

For embedding API, input as a list will be covered in the next PR.

yichuan-w · 2024-08-09T17:56:27Z

LGTM

yichuan-w · 2024-08-09T17:57:38Z

btw, I have one small question, why switch the max_num_tokens from 1 to 0?

Ying1123 · 2024-08-09T18:19:13Z

btw, I have one small question, why switch the max_num_tokens from 1 to 0?

Did you mean switch from 0 to 1? This is because our default logic for generation model suppose it will generate 1 token after prefill. Although embedding models do not generate tokens, I added one dummy token to cheat with the shared part of the code.

yichuan-w · 2024-08-09T18:36:16Z

btw, I have one small question, why switch the max_num_tokens from 1 to 0?

Did you mean switch from 0 to 1? This is because our default logic for generation model suppose it will generate 1 token after prefill. Although embedding models do not generate tokens, I added one dummy token to cheat with the shared part of the code.

I see, thanks for answering

Ying1123 force-pushed the openai-embedding branch 2 times, most recently from 5cf0901 to c343e77 Compare August 9, 2024 08:45

Ying1123 requested a review from yichuan-w August 9, 2024 08:46

Ying1123 force-pushed the openai-embedding branch 2 times, most recently from 363da49 to 0807173 Compare August 9, 2024 08:56

Ying1123 mentioned this pull request Aug 9, 2024

Development Roadmap (2024 Q3) #634

Closed

29 tasks

add openai api

c85cf2a

Ying1123 force-pushed the openai-embedding branch from 0807173 to c85cf2a Compare August 9, 2024 10:04

yichuan-w approved these changes Aug 9, 2024

View reviewed changes

Ying1123 merged commit b16e856 into main Aug 9, 2024

Ying1123 deleted the openai-embedding branch August 9, 2024 18:19

timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025

Add openai embedding API (sgl-project#997)

a5c2201

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add openai embedding API#997

Add openai embedding API#997
Ying1123 merged 1 commit intomainfrom
openai-embedding

Ying1123 commented Aug 9, 2024 •

edited

Loading

Uh oh!

yichuan-w commented Aug 9, 2024

Uh oh!

yichuan-w commented Aug 9, 2024

Uh oh!

Ying1123 commented Aug 9, 2024

Uh oh!

yichuan-w commented Aug 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Ying1123 commented Aug 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yichuan-w commented Aug 9, 2024

Uh oh!

yichuan-w commented Aug 9, 2024

Uh oh!

Ying1123 commented Aug 9, 2024

Uh oh!

yichuan-w commented Aug 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Ying1123 commented Aug 9, 2024 •

edited

Loading