Skip to content
DebuggerCafe

Machine Learning and Deep Learning

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics
Close Menu

JEPA Series Part-3: Image Classification using I-JEPA

Sovit Ranjan RathSovit Ranjan Rath August 25, 2025August 25, 2025 0 Comments
JEPA Series Part-3: Image Classification using I-JEPA

In this article, we use transfer learning with I-JEPA model for image classification. We add a simple Linear layer classifier on top of the frozen backbone. ...

Read MoreRead More

JEPA Series Part 2: Image Similarity with I-JEPA

Sovit Ranjan RathSovit Ranjan Rath August 18, 2025August 18, 2025 0 Comment
JEPA Series Part 2: Image Similarity with I-JEPA

In this article, we use a pretrained I-JEPA model for image similarity. We specifically use the ViT-H I-JEPA trained with 14x14 patches. ...

Read MoreRead More

JEPA Series Part 1: Introduction to I-JEPA

Sovit Ranjan RathSovit Ranjan Rath August 11, 2025August 11, 2025 2 Comments
JEPA Series Part 1: Introduction to I-JEPA

In this article, we cover the introduction to I-JEPA. We start with what is I-JEPA, why we need it, its architecture, evaluation results, and comparison with other similar methods. ...

Read MoreRead More

Video Summarizer Using Qwen2.5-Omni

Sovit Ranjan RathSovit Ranjan Rath August 4, 2025August 4, 2025 0 Comment
Video Summarizer Using Qwen2.5-Omni

In this article, we build a simple video summarizer application using Qwen2.5-Omni 3B model with the UI powered by Gradio. ...

Read MoreRead More

Introduction to BAGEL: An Unified Multimodal Model

Sovit Ranjan RathSovit Ranjan Rath July 28, 2025July 28, 2025 0 Comment
Introduction to BAGEL: An Unified Multimodal Model

In this article, we cover the introduction to BAGEL, an unified multimodal model for image generation, image editing, and free-form image manipulation with non-thinking and thinking capabilties. ...

Read MoreRead More

Posts pagination

Previous page Page 1 … Page 6 Page 7 Page 8 … Page 77 Next page

Subscribe

* indicates required

Categories

Recent Posts

  • Multi-Turn Tool Call with gpt-oss-chat
  • RAG Tool Call for gpt-oss-chat
  • Web Search Tool with Streaming in gpt-oss-chat
  • gpt-oss-chat Local RAG and Web Search
  • SAM 3 UI – Image, Video, and Multi-Object Inference

Pages

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics

Reach out

  • Facebook
  • LinkedIn
  • Twitter

Business WordPress Theme copyright 2025

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

DebuggerCafe
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.