Get token streaming working

This proved a bit tricky, because the MLC library works based on a callback mechanism:
```python
from mlc_chat import ChatModule
from mlc_chat.callback import StreamToStdout

cm = ChatModule(model="Llama-2-7b-chat-hf-q4f16_1")
cm.generate(
   prompt="A poem about a bunny eating lunch",
   progress_callback=StreamToStdout(callback_interval=1),
)
```
But... LLM expects to be able to do something like this:

```python
for chunk in cm.generate(...):
    yield chunk
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Get token streaming working #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Get token streaming working #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions