Training People to Reward Robots
By: Endong Sun, Yuqing Zhu, Matthew Howard
Potential Business Impact:
Teaches robots to learn better from humans.
Learning from demonstration (LfD) is a technique that allows expert teachers to teach task-oriented skills to robotic systems. However, the most effective way of guiding novice teachers to approach expert-level demonstrations quantitatively for specific teaching tasks remains an open question. To this end, this paper investigates the use of machine teaching (MT) to guide novice teachers to improve their teaching skills based on reinforcement learning from demonstration (RLfD). The paper reports an experiment in which novices receive MT-derived guidance to train their ability to teach a given motor skill with only 8 demonstrations and generalise this to previously unseen ones. Results indicate that the MT-guidance not only enhances robot learning performance by 89% on the training skill but also causes a 70% improvement in robot learning performance on skills not seen by subjects during training. These findings highlight the effectiveness of MT-guidance in upskilling human teaching behaviours, ultimately improving demonstration quality in RLfD.
Similar Papers
Active Robot Curriculum Learning from Online Human Demonstrations
Robotics
Teaches robots better by asking for help smartly.
Robot Policy Transfer with Online Demonstrations: An Active Reinforcement Learning Approach
Robotics
Robots learn new jobs faster with live help.
Learning and generalization of robotic dual-arm manipulation of boxes from demonstrations via Gaussian Mixture Models (GMMs)
Robotics
Robots learn new tasks from few examples.