Add video processing utils to vision_utils by mmathew23 · Pull Request #279 · unslothai/unsloth-zoo

mmathew23 · 2025-09-13T03:47:24Z

This is a cleaned up and merged version of
#240.

This adds video processing utilities for VLM finetuning.

It's based on qwen-vl-utils repo https://github.com/QwenLM/Qwen2.5-VL/tree/main/qwen-vl-utils

I tested all the Vision notebooks to confirm everything works. Video Finetuning notebook on the way.

Gemma3: https://colab.research.google.com/drive/1gzVgFvFou6dTE9UvMZR5DEiQTzqc3A4S?usp=sharing
Llama Vision: https://colab.research.google.com/drive/1E2xCsbh7-raFYMtkbFOwgvcfIyn64zLL?usp=sharing
Pixtral: https://colab.research.google.com/drive/1meeeZnE-IlV9IRuEUFqgbUNFpwxtrN5g?usp=sharing
Qwen 2.5VL: https://colab.research.google.com/drive/1RrRUnHBLMPSWfzHoc853iij4O4Yly7fr?usp=sharing

*qwen 2.5vl was tested with transformers 4.56.1

This is a cleaned up and merged version of unslothai#240. This adds video processing utilities for VLM finetuning. It's based on qwen-vl-utils repo https://github.com/QwenLM/Qwen2.5-VL/tree/main/qwen-vl-utils Co-authored-by: autinn <au-yeung@uni.minerva.edu> Co-authored-by: Neenu Antony <Neenu.antony@sjsu.edu> Co-authored-by: Suchith Gali <sgali@ucmerced.edu>

madhav1ag · 2025-10-14T16:31:20Z

@mmathew23 Thanks for the video support code. May I know when we can expect video finetuning notebooks for Qwen2.5-VL?

mmathew23 mentioned this pull request Sep 13, 2025

feat: Added Video inference feature into unsloth_zoo/vision_utils.py from qwen-vl-utils #240

Closed

danielhanchen merged commit 627895b into unslothai:main Sep 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add video processing utils to vision_utils#279

Add video processing utils to vision_utils#279
danielhanchen merged 1 commit into
unslothai:mainfrom
mmathew23:video-inference-feature

mmathew23 commented Sep 13, 2025

Uh oh!

madhav1ag commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mmathew23 commented Sep 13, 2025

Uh oh!

madhav1ag commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants