TY - CHAP
T1 - Learning probabilistic decision making by a service robot with generalization of user demonstrations and interactive refinement
AU - Schmidt-Rohr, Sven R.
AU - Romahn, Fabian
AU - Meissner, Pascal
AU - Jäkel, Rainer
AU - Dillmann, Rüdiger
PY - 2013
Y1 - 2013
N2 - When learning abstract probabilistic decision making models for multi-modal service robots from human demonstrations, alternative courses of events may be missed by human teachers during demonstrations. We present an active model space exploration approach with generalization of observed action effect knowledge leading to interactive requests of new demonstrations to verify generalizations.At first, the robot observes several user demonstrations of interacting humans, including dialog, object poses and human body movement. Discretization and analysis then lead to a symbolic-causal model of a demonstrated task in the form of a preliminary Partially observable Markov decision process. Based on the transition model generated from demonstrations, new hypotheses of unobserved action effects, generalized transitions, can be derived along with a generalization confidence estimate. To validate generalized transitions which have a strong impact on a decision policy, a request generator proposes further demonstrations to human teachers, used in turn to implicitly verify hypotheses.The system has been evaluated on a multi-modal service robot with realistic tasks, including furniture manipulation and execution-time interacting humans.
AB - When learning abstract probabilistic decision making models for multi-modal service robots from human demonstrations, alternative courses of events may be missed by human teachers during demonstrations. We present an active model space exploration approach with generalization of observed action effect knowledge leading to interactive requests of new demonstrations to verify generalizations.At first, the robot observes several user demonstrations of interacting humans, including dialog, object poses and human body movement. Discretization and analysis then lead to a symbolic-causal model of a demonstrated task in the form of a preliminary Partially observable Markov decision process. Based on the transition model generated from demonstrations, new hypotheses of unobserved action effects, generalized transitions, can be derived along with a generalization confidence estimate. To validate generalized transitions which have a strong impact on a decision policy, a request generator proposes further demonstrations to human teachers, used in turn to implicitly verify hypotheses.The system has been evaluated on a multi-modal service robot with realistic tasks, including furniture manipulation and execution-time interacting humans.
UR - http://www.scopus.com/inward/record.url?scp=84893045871&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-35485-4_26
DO - 10.1007/978-3-642-35485-4_26
M3 - Chapter
SN - 9783642354847
VL - 466
T3 - Advances in Intelligent Systems and Computing
SP - 309
EP - 322
BT - Frontiers of Intelligent Autonomous Systems
A2 - Lee, S
A2 - Yoon, KJ
A2 - Lee, J.
PB - Springer-Verlag Berlin Heidelberg
T2 - 12th International Conference of Intelligent Autonomous Systems
Y2 - 26 June 2012 through 29 June 2012
ER -