DebuggerCafe - Deep Learning, Machine Learning, Artificial Intelligence

Image to 3D Mesh Generation with Detection Grounding

In this article we create a simple, yet robust pipeline for image to 3D mesh generation with detection grounding using Qwen3-VL, BiRefNet, and Hunyuan3D 2.0 model. ...

Grounding Qwen3-VL Detection with SAM2

Sovit Ranjan Rath January 5, 2026 0 Comment

In this article, are grounding the Qwen3-VL object detection capabilities with SAM2 segmentation. The pipeline uses Qwen3-VL to detect objects via natural language whose coordinates are then fed to the SAM2 model for segmentation. ...

Fine-Tuning Qwen3-VL

Sovit Ranjan Rath December 29, 2025 0 Comment

In this article, we are fine-tuning the Qwen3-VL 2B model for sketch and image to HTML. After fine-tuning, we will be able to feed an image of a website to the model and get the HTML code for it. ...

Creating a Sketch to HTML Application with Qwen3-VL

Sovit Ranjan Rath December 22, 2025 0 Comment

Creating a Sketch to Image Application with Qwen3-VL

In this article, we explore creating a simple sketch to HTML application using Qwen3-VL where users can upload an image or screenshot for a potential website and the Qwen3-VL model will give back the HTML. ...

Introduction to Qwen3-VL

Sovit Ranjan Rath December 15, 2025 2 Comments

In this article, we explore the Qwen3-VL model, the latest iteration of the Qwen-VL series. We start with model architecture and benchmarks, and then move to hands-on inference for object detection, OCR, video understanding, and sketch-to-HTML using Qwen3-VL. ...