Skip to content
DebuggerCafe

Machine Learning and Deep Learning

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics
Close Menu

Image to 3D Mesh Generation with Detection Grounding

Sovit Ranjan RathSovit Ranjan Rath January 12, 2026January 12, 2026 2 Comments
Image to 3D Mesh Generation with Detection Grounding

In this article we create a simple, yet robust pipeline for image to 3D mesh generation with detection grounding using Qwen3-VL, BiRefNet, and Hunyuan3D 2.0 model. ...

Read MoreRead More

Grounding Qwen3-VL Detection with SAM2

Sovit Ranjan RathSovit Ranjan Rath January 5, 2026January 5, 2026 0 Comment
Grounding Qwen3-VL Detection with SAM2

In this article, are grounding the Qwen3-VL object detection capabilities with SAM2 segmentation. The pipeline uses Qwen3-VL to detect objects via natural language whose coordinates are then fed to the SAM2 model for segmentation. ...

Read MoreRead More

Fine-Tuning Qwen3-VL

Sovit Ranjan RathSovit Ranjan Rath December 29, 2025December 29, 2025 0 Comment
Fine-Tuning Qwen3-VL

In this article, we are fine-tuning the Qwen3-VL 2B model for sketch and image to HTML. After fine-tuning, we will be able to feed an image of a website to the model and get the HTML code for it. ...

Read MoreRead More

Creating a Sketch to HTML Application with Qwen3-VL

Sovit Ranjan RathSovit Ranjan Rath December 22, 2025December 22, 2025 0 Comment
Creating a Sketch to Image Application with Qwen3-VL

In this article, we explore creating a simple sketch to HTML application using Qwen3-VL where users can upload an image or screenshot for a potential website and the Qwen3-VL model will give back the HTML. ...

Read MoreRead More

Introduction to Qwen3-VL

Sovit Ranjan RathSovit Ranjan Rath December 15, 2025December 15, 2025 2 Comments
Introduction to Qwen3-VL

In this article, we explore the Qwen3-VL model, the latest iteration of the Qwen-VL series. We start with model architecture and benchmarks, and then move to hands-on inference for object detection, OCR, video understanding, and sketch-to-HTML using Qwen3-VL. ...

Read MoreRead More

Posts pagination

Previous page Page 1 Page 2 Page 3 Page 4 … Page 77 Next page

Subscribe

* indicates required

Categories

Recent Posts

  • Multi-Turn Tool Call with gpt-oss-chat
  • RAG Tool Call for gpt-oss-chat
  • Web Search Tool with Streaming in gpt-oss-chat
  • gpt-oss-chat Local RAG and Web Search
  • SAM 3 UI – Image, Video, and Multi-Object Inference

Pages

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics

Reach out

  • Facebook
  • LinkedIn
  • Twitter

Business WordPress Theme copyright 2025

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

DebuggerCafe
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.