Enhancing Video-Based Robot Failure Detection Using Task Knowledge
By: Santosh Thoduka , Sebastian Houben , Juergen Gall and more
Potential Business Impact:
Helps robots know when they mess up.
Robust robotic task execution hinges on the reliable detection of execution failures in order to trigger safe operation modes, recovery strategies, or task replanning. However, many failure detection methods struggle to provide meaningful performance when applied to a variety of real-world scenarios. In this paper, we propose a video-based failure detection approach that uses spatio-temporal knowledge in the form of the actions the robot performs and task-relevant objects within the field of view. Both pieces of information are available in most robotic scenarios and can thus be readily obtained. We demonstrate the effectiveness of our approach on three datasets that we amend, in part, with additional annotations of the aforementioned task-relevant knowledge. In light of the results, we also propose a data augmentation method that improves performance by applying variable frame rates to different parts of the video. We observe an improvement from 77.9 to 80.0 in F1 score on the ARMBench dataset without additional computational expense and an additional increase to 81.4 with test-time augmentation. The results emphasize the importance of spatio-temporal information during failure detection and suggest further investigation of suitable heuristics in future implementations. Code and annotations are available.
Similar Papers
Reliable Robotic Task Execution in the Face of Anomalies
Robotics
Robot learns to fix its own mistakes.
KRAST: Knowledge-Augmented Robotic Action Recognition with Structured Text for Vision-Language Models
CV and Pattern Recognition
Helps robots see and understand what people do.
Real-Time Detection of Robot Failures Using Gaze Dynamics in Collaborative Tasks
Human-Computer Interaction
Watches your eyes to spot robot mistakes.