ASkDAgger: Active Skill-level Data Aggregation for Interactive Imitation Learning
By: Jelle Luijkx , Zlatan Ajanović , Laura Ferranti and more
Potential Business Impact:
Teaches robots faster by learning from their mistakes.
Human teaching effort is a significant bottleneck for the broader applicability of interactive imitation learning. To reduce the number of required queries, existing methods employ active learning to query the human teacher only in uncertain, risky, or novel situations. However, during these queries, the novice's planned actions are not utilized despite containing valuable information, such as the novice's capabilities, as well as corresponding uncertainty levels. To this end, we allow the novice to say: "I plan to do this, but I am uncertain." We introduce the Active Skill-level Data Aggregation (ASkDAgger) framework, which leverages teacher feedback on the novice plan in three key ways: (1) S-Aware Gating (SAG): Adjusts the gating threshold to track sensitivity, specificity, or a minimum success rate; (2) Foresight Interactive Experience Replay (FIER), which recasts valid and relabeled novice action plans into demonstrations; and (3) Prioritized Interactive Experience Replay (PIER), which prioritizes replay based on uncertainty, novice success, and demonstration age. Together, these components balance query frequency with failure incidence, reduce the number of required demonstration annotations, improve generalization, and speed up adaptation to changing domains. We validate the effectiveness of ASkDAgger through language-conditioned manipulation tasks in both simulation and real-world environments. Code, data, and videos are available at https://askdagger.github.io.
Similar Papers
TubeDAgger: Reducing the Number of Expert Interventions with Stochastic Reach-Tubes
Systems and Control
Teaches robots to learn from mistakes better.
CubeDAgger: Improved Robustness of Interactive Imitation Learning without Violation of Dynamic Stability
Robotics
Teaches robots to move smoothly and safely.
Active Query Selection for Crowd-Based Reinforcement Learning
Machine Learning (CS)
Teaches robots to learn faster from people.