Self-Supervised Learning for Precise Pick-and-Place Without Object Model

Lars Berscheid; Pascal Meissner; Torsten Kröger

doi:10.1109/LRA.2020.3003865

Self-Supervised Learning for Precise Pick-and-Place Without Object Model

Lars Berscheid^*, Pascal Meissner, Torsten Kröger

^*Corresponding author for this work

Engineering

Karlsruhe Institute of Technology

Research output: Contribution to journal › Letter › peer-review

42 Citations (Scopus)

7 Downloads (Pure)

Abstract

Flexible pick-and-place is a fundamental yet challenging task within robotics, in particular due to the need of an object model for a simple target pose definition. In this work, the robot instead learns to pick-and-place objects using planar manipulation according to a single, demonstrated goal state. Our primary contribution lies within combining robot learning of primitives, commonly estimated by fully-convolutional neural networks, with one-shot imitation learning. Therefore, we define the place reward as a contrastive loss between real-world measurements and a task-specific noise distribution. Furthermore, we design our system to learn in a self-supervised manner, enabling real-world experiments with up to 25000 pick-and-place actions. Then, our robot is able to place trained objects with an average placement error of 2.7 (0.2) mm and 2.6 (0.8){\deg}. As our approach does not require an object model, the robot is able to generalize to unknown objects while keeping a precision of 5.9 (1.1) mm and 4.1 (1.2){\deg}. We further show a range of emerging behaviors: The robot naturally learns to select the correct object in the presence of multiple object types, precisely inserts objects within a peg game, picks screws out of dense clutter, and infers multiple pick-and-place actions from a single goal state.

Original language	English
Pages (from-to)	4828 - 4835
Number of pages	8
Journal	IEEE Robotics and Automation Letters
Volume	5
Issue number	3
Early online date	19 Jun 2020
DOIs	https://doi.org/10.1109/LRA.2020.3003865
Publication status	Published - 1 Jul 2020

Bibliographical note

ACKNOWLEDGEMENT
We would like to thank Tamim Asfour for his helpful suggestions and discussions.

Keywords

reinforcement learning
Deep learning in grasping and manipulation
imitation learning

Access to Document

10.1109/LRA.2020.3003865Licence: Unspecified

Berscheid_etal_IEEERAL_Selfsupervised_AAM
© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Accepted author manuscript, 4.81 MBLicence: Unspecified

https://arxiv.org/abs/2006.08373Licence: Unspecified

Cite this

@article{c2ed09921f01459e84d5d8f9522c54b3,

title = "Self-Supervised Learning for Precise Pick-and-Place Without Object Model",

abstract = " Flexible pick-and-place is a fundamental yet challenging task within robotics, in particular due to the need of an object model for a simple target pose definition. In this work, the robot instead learns to pick-and-place objects using planar manipulation according to a single, demonstrated goal state. Our primary contribution lies within combining robot learning of primitives, commonly estimated by fully-convolutional neural networks, with one-shot imitation learning. Therefore, we define the place reward as a contrastive loss between real-world measurements and a task-specific noise distribution. Furthermore, we design our system to learn in a self-supervised manner, enabling real-world experiments with up to 25000 pick-and-place actions. Then, our robot is able to place trained objects with an average placement error of 2.7 (0.2) mm and 2.6 (0.8){\deg}. As our approach does not require an object model, the robot is able to generalize to unknown objects while keeping a precision of 5.9 (1.1) mm and 4.1 (1.2){\deg}. We further show a range of emerging behaviors: The robot naturally learns to select the correct object in the presence of multiple object types, precisely inserts objects within a peg game, picks screws out of dense clutter, and infers multiple pick-and-place actions from a single goal state. ",

keywords = "reinforcement learning, Deep learning in grasping and manipulation, imitation learning",

author = "Lars Berscheid and Pascal Meissner and Torsten Kr{\"o}ger",

note = "ACKNOWLEDGEMENT We would like to thank Tamim Asfour for his helpful suggestions and discussions.",

year = "2020",

month = jul,

day = "1",

doi = "10.1109/LRA.2020.3003865",

language = "English",

volume = "5",

pages = "4828 -- 4835",

journal = "IEEE Robotics and Automation Letters",

issn = "2377-3766",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

number = "3",

}

TY - JOUR

T1 - Self-Supervised Learning for Precise Pick-and-Place Without Object Model

AU - Berscheid, Lars

AU - Meissner, Pascal

AU - Kröger, Torsten

N1 - ACKNOWLEDGEMENT We would like to thank Tamim Asfour for his helpful suggestions and discussions.

PY - 2020/7/1

Y1 - 2020/7/1

N2 - Flexible pick-and-place is a fundamental yet challenging task within robotics, in particular due to the need of an object model for a simple target pose definition. In this work, the robot instead learns to pick-and-place objects using planar manipulation according to a single, demonstrated goal state. Our primary contribution lies within combining robot learning of primitives, commonly estimated by fully-convolutional neural networks, with one-shot imitation learning. Therefore, we define the place reward as a contrastive loss between real-world measurements and a task-specific noise distribution. Furthermore, we design our system to learn in a self-supervised manner, enabling real-world experiments with up to 25000 pick-and-place actions. Then, our robot is able to place trained objects with an average placement error of 2.7 (0.2) mm and 2.6 (0.8){\deg}. As our approach does not require an object model, the robot is able to generalize to unknown objects while keeping a precision of 5.9 (1.1) mm and 4.1 (1.2){\deg}. We further show a range of emerging behaviors: The robot naturally learns to select the correct object in the presence of multiple object types, precisely inserts objects within a peg game, picks screws out of dense clutter, and infers multiple pick-and-place actions from a single goal state.

AB - Flexible pick-and-place is a fundamental yet challenging task within robotics, in particular due to the need of an object model for a simple target pose definition. In this work, the robot instead learns to pick-and-place objects using planar manipulation according to a single, demonstrated goal state. Our primary contribution lies within combining robot learning of primitives, commonly estimated by fully-convolutional neural networks, with one-shot imitation learning. Therefore, we define the place reward as a contrastive loss between real-world measurements and a task-specific noise distribution. Furthermore, we design our system to learn in a self-supervised manner, enabling real-world experiments with up to 25000 pick-and-place actions. Then, our robot is able to place trained objects with an average placement error of 2.7 (0.2) mm and 2.6 (0.8){\deg}. As our approach does not require an object model, the robot is able to generalize to unknown objects while keeping a precision of 5.9 (1.1) mm and 4.1 (1.2){\deg}. We further show a range of emerging behaviors: The robot naturally learns to select the correct object in the presence of multiple object types, precisely inserts objects within a peg game, picks screws out of dense clutter, and infers multiple pick-and-place actions from a single goal state.

KW - reinforcement learning

KW - Deep learning in grasping and manipulation

KW - imitation learning

UR - http://www.scopus.com/inward/record.url?scp=85089472027&partnerID=8YFLogxK

U2 - 10.1109/LRA.2020.3003865

DO - 10.1109/LRA.2020.3003865

M3 - Letter

SN - 2377-3766

VL - 5

SP - 4828

EP - 4835

JO - IEEE Robotics and Automation Letters

JF - IEEE Robotics and Automation Letters

IS - 3

ER -

Self-Supervised Learning for Precise Pick-and-Place Without Object Model

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this