Multi-Part Object Representations via Graph Structures and Co-Part Discovery
By: Alex Foo, Wynne Hsu, Mong Li Lee
Discovering object-centric representations from images can significantly enhance the robustness, sample efficiency and generalizability of vision models. Works on images with multi-part objects typically follow an implicit object representation approach, which fail to recognize these learned objects in occluded or out-of-distribution contexts. This is due to the assumption that object part-whole relations are implicitly encoded into the representations through indirect training objectives. We address this limitation by proposing a novel method that leverages on explicit graph representations for parts and present a co-part object discovery algorithm. We then introduce three benchmarks to evaluate the robustness of object-centric methods in recognizing multi-part objects within occluded and out-of-distribution settings. Experimental results on simulated, realistic, and real-world images show marked improvements in the quality of discovered objects compared to state-of-the-art methods, as well as the accurate recognition of multi-part objects in occluded and out-of-distribution contexts. We also show that the discovered object-centric representations can more accurately predict key object properties in a downstream task, highlighting the potential of our method to advance the field of object-centric representations.
Similar Papers
Disentangled Object-Centric Image Representation for Robotic Manipulation
CV and Pattern Recognition
Robots learn to grab things better, even with many objects.
Object-Centric Data Synthesis for Category-level Object Detection
CV and Pattern Recognition
Teaches computers to spot new things with less data.
UniPart: Part-Level 3D Generation with Unified 3D Geom-Seg Latents
CV and Pattern Recognition
Builds 3D objects from parts, like building blocks.