𝗗𝗮𝘆-𝟯𝟯𝟯 𝗖𝗼𝗺𝗽𝘂𝘁𝗲𝗿 𝗩𝗶𝘀𝗶𝗼𝗻 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 Google Research Open-Sources ‘SAVi’: An Object-Centric Architecture That Extends The Slot Attention Mechanism To Videos Follow me for a similar post: 🇮🇳 Ashish Patel ------------------------------------------------------------------- 𝗜𝗻𝘁𝗲𝗿𝗲𝘀𝘁𝗶𝗻𝗴 𝗙𝗮𝗰𝘁𝘀 : 🔸 Paper: 𝗖𝗼𝗻𝗱𝗶𝘁𝗶𝗼𝗻𝗮𝗹 𝗢𝗯𝗷𝗲𝗰𝘁-𝗖𝗲𝗻𝘁𝗿𝗶𝗰 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 𝗳𝗿𝗼𝗺 𝗩𝗶𝗱𝗲𝗼 🔸 This paper is published arxiv2021. 🔸 Multiple distinct things act as compositional building blocks that can be processed independently and recombined in humans’ understanding of the world. The foundation for high-level cognitive abilities like language, causal reasoning, arithmetic, planning, and so on is a compositional model of the universe. Therefore, it’s essential for generalizing in predictable and systematic ways. Machine learning algorithms with object-centric representations have the potential to dramatically improve sampling efficiency, resilience, generalization to new problems, and interpretability. ------------------------------------------------------------------- 𝗜𝗠𝗣𝗢𝗥𝗧𝗔𝗡𝗖𝗘 🔹Unsupervised multi-object representation learning is widely used in various applications. These algorithms learn to separate and represent objects from the statistical structure of the data alone, without the requirement for supervision, by using object-centric inductive biases. Despite their promising outcomes, these approaches are currently constrained by two major issues: 1️⃣ They are limited to toy data such as moving 2D sprites or extremely rudimentary 3D scenes, and they struggle with more realistic data with complex textures. 2️⃣Both during training and inference, it is not clear how to interact with these models. The concept of an object is imprecise and task-dependent, and these models’ segmentation does not always correspond to the tasks of interest. 🔹To overcome the problem of unsupervised / weakly-supervised multi-object segmentation and tracking in video data, a new Google research introduces a sequential extension of Slot Attention called Slot Attention for Video (SAVi). ------------------------------------------------------------------- #computervision #artificialintelligence #innovation -------------------------------------------------------------------
Ford Credit•88K followers
4yAbsolutely gem of a work 👍👍