[Bug] Server crashes when processing image requests with Qwen2.5-VL-7B-Instruct

### Checklist

- [x] 1. I have searched related issues but cannot get the expected help.
- [x] 2. The bug has not been fixed in the latest version.
- [x] 3. Please note that if the bug-related issue you submitted lacks corresponding environment info and a minimal reproducible demo, it will be challenging for us to reproduce and resolve the issue, reducing the likelihood of receiving feedback.
- [x] 4. If the issue you raised is not a bug but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
- [x] 5. Please use English, otherwise it will be closed.

### Describe the bug

If I run the Qwen2.5-VL-7B-Instruct model with dynamic quantization using SGLang (--quantization w8a8_fp8 or fp8), the server crashes with an error when receiving a request containing an image.

Logs:

![Image](https://github.com/user-attachments/assets/83ea7506-efb5-4c31-a83c-76809edfa521)

![Image](https://github.com/user-attachments/assets/508be119-6b3f-44a3-9ea0-27e4cef61812)

### Reproduction

![Image](https://github.com/user-attachments/assets/068ebf9c-fec8-45e2-8dd5-acfdf04ea0ea)

### Environment

sglang docker v.0.4.6.post5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Server crashes when processing image requests with Qwen2.5-VL-7B-Instruct #6828

Checklist

Describe the bug

Reproduction

Environment

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug] Server crashes when processing image requests with Qwen2.5-VL-7B-Instruct #6828

Description

Checklist

Describe the bug

Reproduction

Environment

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions