TrueRMA: Learning Fast and Smooth Robot Trajectories with Recursive Midpoint Adaptations in Cartesian Space

Jonas C. Kiemel, Pascal Meißner, Torsten Kröger

Research output: Contribution to journalConference articlepeer-review

1 Citation (Scopus)

Abstract

We present TrueRMA, a data-efficient, model-free method to learn cost-optimized robot trajectories over a wide range of starting points and endpoints. The key idea is to calculate trajectory waypoints in Cartesian space by recursively predicting orthogonal adaptations relative to the midpoints of straight lines. We generate a differentiable path by adding circular blends around the waypoints, calculate the corresponding joint positions with an inverse kinematics solver and calculate a time-optimal parameterization considering velocity and acceleration limits. During training, the trajectory is executed in a physics simulator and costs are assigned according to a user-specified cost function which is not required to be differentiable. Given a starting point and an endpoint as input, a neural network is trained to predict midpoint adaptations that minimize the cost of the resulting trajectory via reinforcement learning. We successfully train a KUKA iiwa robot to keep a ball on a plate while moving between specified points and compare the performance of TrueRMA against two baselines. The results show that our method requires less training data to learn the task while generating shorter and faster trajectories.
Original languageEnglish
Pages (from-to)4225-4231
Number of pages7
JournalProceedings - IEEE International Conference on Robotics and Automation
Volume2020
Early online date15 Sep 2020
DOIs
Publication statusPublished - 15 Sep 2020
EventIEEE International Conference on Robotics and Automation -
Duration: 31 May 202031 Aug 2020
https://www.icra2020.org/

Keywords

  • trajectory
  • robots
  • kinematics
  • task analysis
  • training
  • neural networks

Fingerprint Dive into the research topics of 'TrueRMA: Learning Fast and Smooth Robot Trajectories with Recursive Midpoint Adaptations in Cartesian Space'. Together they form a unique fingerprint.

Cite this