Active Semantic Perception
By: Huayi Tang, Pratik Chaudhari
Potential Business Impact:
Helps robots understand and explore new places.
We develop an approach for active semantic perception which refers to using the semantics of the scene for tasks such as exploration. We build a compact, hierarchical multi-layer scene graph that can represent large, complex indoor environments at various levels of abstraction, e.g., nodes corresponding to rooms, objects, walls, windows etc. as well as fine-grained details of their geometry. We develop a procedure based on large language models (LLMs) to sample plausible scene graphs of unobserved regions that are consistent with partial observations of the scene. These samples are used to compute an information gain of a potential waypoint for sophisticated spatial reasoning, e.g., the two doors in the living room can lead to either a kitchen or a bedroom. We evaluate this approach in complex, realistic 3D indoor environments in simulation. We show using qualitative and quantitative experiments that our approach can pin down the semantics of the environment quicker and more accurately than baseline approaches.
Similar Papers
Understanding while Exploring: Semantics-driven Active Mapping
Robotics
Robots learn to explore better by choosing what to see.
Intelligent Spatial Perception by Building Hierarchical 3D Scene Graphs for Indoor Scenarios with the Help of LLMs
Robotics
Helps robots understand buildings to move around better.
DSM: Building A Diverse Semantic Map for 3D Visual Grounding
CV and Pattern Recognition
Helps robots understand and interact with their surroundings.