Towards Open World Detection: A Survey
By: Andrei-Stefan Bulzan, Cosmin Cernazanu-Glavan
Potential Business Impact:
Lets computers see and understand anything.
For decades, Computer Vision has aimed at enabling machines to perceive the external world. Initial limitations led to the development of highly specialized niches. As success in each task accrued and research progressed, increasingly complex perception tasks emerged. This survey charts the convergence of these tasks and, in doing so, introduces Open World Detection (OWD), an umbrella term we propose to unify class-agnostic and generally applicable detection models in the vision domain. We start from the history of foundational vision subdomains and cover key concepts, methodologies and datasets making up today's state-of-the-art landscape. This traverses topics starting from early saliency detection, foreground/background separation, out of distribution detection and leading up to open world object detection, zero-shot detection and Vision Large Language Models (VLLMs). We explore the overlap between these subdomains, their increasing convergence, and their potential to unify into a singular domain in the future, perception.
Similar Papers
Towards 3D Objectness Learning in an Open World
CV and Pattern Recognition
Finds any object in 3D, even new ones.
ODOV: Towards Open-Domain Open-Vocabulary Object Detection
CV and Pattern Recognition
Helps computers recognize any object anywhere.
Language-Guided Open-World Anomaly Segmentation
CV and Pattern Recognition
Names unknown things for self-driving cars.