Add CFG to vllm serving by mory91 · Pull Request #517 · dottxt-ai/outlines

mory91 · 2024-01-10T09:52:54Z

Hi,
This pull request adds support for CFG in vllm serving.

rlouf · 2024-01-11T19:32:07Z

    # Sets default for the model (`facebook/opt-125m`)
    engine = AsyncLLMEngine.from_engine_args(engine_args)

+    _adapt_tokenizer(engine.engine.tokenizer)


Why are you calling this function here? The result is not used.

The tokenizer is changed inside the function anyways. I now assigned it to the tokenizer though.

Ah makes sense. It's not needed however as vLLM handles tokenisation on its end during encoding/decoding.

It's needed because in here https://github.com/outlines-dev/outlines/blob/fde61a80a58de0401fdecdee7408db53e17ca4f4/outlines/fsm/fsm.py#L345 outlines expects tokenizer to return a list but vllm tokenizers return string.
Also I just realized that this change makes a breaking change to the library. If it's fine by the project directors its fine, if not we might need a change.
for example we can call _adapt_tokenizer inside __init__ functions of the logit processors

rlouf · 2024-01-12T07:00:55Z

Thank you for your contributions! I added some documentation before merging.

mory91 mentioned this pull request Jan 10, 2024

Add CFG guided generation to vLLM integration #494

Closed

rlouf linked an issue Jan 10, 2024 that may be closed by this pull request

Add CFG guided generation to vLLM integration #494

Closed

rlouf force-pushed the vllm-cfg branch from 749eed6 to 9c2f50f Compare January 10, 2024 20:09

rlouf added structured generation Linked to structured generation vLLM Things involving vLLM support labels Jan 10, 2024

mory91 force-pushed the vllm-cfg branch from 9c2f50f to 67e26b4 Compare January 10, 2024 20:39

rlouf force-pushed the vllm-cfg branch from d682fef to 128741c Compare January 11, 2024 19:30

rlouf reviewed Jan 11, 2024

View reviewed changes

mory91 force-pushed the vllm-cfg branch 2 times, most recently from 012a0e9 to 281c0af Compare January 11, 2024 23:20

rlouf force-pushed the vllm-cfg branch from 281c0af to f902bab Compare January 12, 2024 06:51

Add CFG to vllm serving

acfde57

rlouf force-pushed the vllm-cfg branch from f902bab to acfde57 Compare January 12, 2024 06:58

rlouf merged commit fde61a8 into dottxt-ai:main Jan 12, 2024

lapp0 mentioned this pull request Jan 13, 2024

Add Grammars vllm-project/vllm#2105

Closed

11 tasks

rlouf mentioned this pull request Jan 14, 2024

Revert "Add CFG to vllm serving" #537

Merged

brucethemoose mentioned this pull request Jan 17, 2024

[Feature Request] CFG in Backend Calls sgl-project/sglang#21

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CFG to vllm serving#517

Add CFG to vllm serving#517
rlouf merged 1 commit intodottxt-ai:mainfrom
mory91:vllm-cfg

mory91 commented Jan 10, 2024

Uh oh!

rlouf Jan 11, 2024

Uh oh!

mory91 Jan 11, 2024

Uh oh!

rlouf Jan 12, 2024

Uh oh!

mory91 Jan 12, 2024 •

edited

Loading

Uh oh!

rlouf commented Jan 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mory91 commented Jan 10, 2024

Uh oh!

rlouf Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

mory91 Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

rlouf Jan 12, 2024

Choose a reason for hiding this comment

Uh oh!

mory91 Jan 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rlouf commented Jan 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mory91 Jan 12, 2024 •

edited

Loading