Analysing Deep Reinforcement Learning Agents Trained with Domain Randomisation

Tianhong Dai; Kai Arulkumaran; Tamara Gerbert; Samyakh Tukra; Feryal Behbahani; Anil Anthony Bharath

doi:10.1016/j.neucom.2022.04.005

Analysing Deep Reinforcement Learning Agents Trained with Domain Randomisation

Tianhong Dai^* (Corresponding Author), Kai Arulkumaran^* (Corresponding Author), Tamara Gerbert, Samyakh Tukra, Feryal Behbahani, Anil Anthony Bharath

^*Corresponding author for this work

Imperial College London

Research output: Contribution to journal › Article › peer-review

3 Downloads (Pure)

Abstract

Deep reinforcement learning (DRL) has the potential to train robots to perform complex tasks in the real world without requiring accurate models of the robot or its environment. However, agents trained with these algorithms typically lack the explainability of more traditional control methods. In this work, we use a combination of out-of-distribution generalisation tests and post hoc interpretability methods in order to understand what strategies DRL-trained agents use to perform a reaching task. To do so, we train agents under different conditions, using comparison to better interpret both quantitative and qualitative results; this allows us to not only provide local explanations, but also broad categorisations of behaviour. A key aim of our work is to understand how agents trained with visual domain randomisation (DR)—a technique which allows agents to generalise from simulation-based-training to the real world—differ from agents trained without. Our results show that the primary outcome of DR is more robust, entangled representations, accompanied by greater spatial structure in convolutional filters. Furthermore, even with an improved saliency method introduced in this work, we show that qualitative studies may not always correspond with quantitative measures, necessitating the combination of inspection tools in order to provide sufficient insights into the behaviour of trained agents. We conclude with recommendations for applying interpretability methods to DRL agents.

Original language	English
Pages (from-to)	143-165
Number of pages	23
Journal	Neurocomputing
Volume	493
Early online date	19 Apr 2022
DOIs	https://doi.org/10.1016/j.neucom.2022.04.005
Publication status	Published - 7 Jul 2022

Keywords

Deep reinforcement learning
Generalisation
Interpretability
Saliency

Access to Document

10.1016/j.neucom.2022.04.005Licence: CC BY

Dai_etal_Analysing_deep_reinforcement_VOR
This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
Final published version, 10.1 MBLicence: CC BY

Cite this

@article{8e4d12ba5fee44e08dd71d9ccd350c6e,

title = "Analysing Deep Reinforcement Learning Agents Trained with Domain Randomisation",

abstract = "Deep reinforcement learning (DRL) has the potential to train robots to perform complex tasks in the real world without requiring accurate models of the robot or its environment. However, agents trained with these algorithms typically lack the explainability of more traditional control methods. In this work, we use a combination of out-of-distribution generalisation tests and post hoc interpretability methods in order to understand what strategies DRL-trained agents use to perform a reaching task. To do so, we train agents under different conditions, using comparison to better interpret both quantitative and qualitative results; this allows us to not only provide local explanations, but also broad categorisations of behaviour. A key aim of our work is to understand how agents trained with visual domain randomisation (DR)—a technique which allows agents to generalise from simulation-based-training to the real world—differ from agents trained without. Our results show that the primary outcome of DR is more robust, entangled representations, accompanied by greater spatial structure in convolutional filters. Furthermore, even with an improved saliency method introduced in this work, we show that qualitative studies may not always correspond with quantitative measures, necessitating the combination of inspection tools in order to provide sufficient insights into the behaviour of trained agents. We conclude with recommendations for applying interpretability methods to DRL agents.",

keywords = "Deep reinforcement learning, Generalisation, Interpretability, Saliency",

author = "Tianhong Dai and Kai Arulkumaran and Tamara Gerbert and Samyakh Tukra and Feryal Behbahani and Bharath, {Anil Anthony}",

year = "2022",

month = jul,

day = "7",

doi = "10.1016/j.neucom.2022.04.005",

language = "English",

volume = "493",

pages = "143--165",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "ELSEVIER SCIENCE BV",

}

TY - JOUR

T1 - Analysing Deep Reinforcement Learning Agents Trained with Domain Randomisation

AU - Dai, Tianhong

AU - Arulkumaran, Kai

AU - Gerbert, Tamara

AU - Tukra, Samyakh

AU - Behbahani, Feryal

AU - Bharath, Anil Anthony

PY - 2022/7/7

Y1 - 2022/7/7

N2 - Deep reinforcement learning (DRL) has the potential to train robots to perform complex tasks in the real world without requiring accurate models of the robot or its environment. However, agents trained with these algorithms typically lack the explainability of more traditional control methods. In this work, we use a combination of out-of-distribution generalisation tests and post hoc interpretability methods in order to understand what strategies DRL-trained agents use to perform a reaching task. To do so, we train agents under different conditions, using comparison to better interpret both quantitative and qualitative results; this allows us to not only provide local explanations, but also broad categorisations of behaviour. A key aim of our work is to understand how agents trained with visual domain randomisation (DR)—a technique which allows agents to generalise from simulation-based-training to the real world—differ from agents trained without. Our results show that the primary outcome of DR is more robust, entangled representations, accompanied by greater spatial structure in convolutional filters. Furthermore, even with an improved saliency method introduced in this work, we show that qualitative studies may not always correspond with quantitative measures, necessitating the combination of inspection tools in order to provide sufficient insights into the behaviour of trained agents. We conclude with recommendations for applying interpretability methods to DRL agents.

AB - Deep reinforcement learning (DRL) has the potential to train robots to perform complex tasks in the real world without requiring accurate models of the robot or its environment. However, agents trained with these algorithms typically lack the explainability of more traditional control methods. In this work, we use a combination of out-of-distribution generalisation tests and post hoc interpretability methods in order to understand what strategies DRL-trained agents use to perform a reaching task. To do so, we train agents under different conditions, using comparison to better interpret both quantitative and qualitative results; this allows us to not only provide local explanations, but also broad categorisations of behaviour. A key aim of our work is to understand how agents trained with visual domain randomisation (DR)—a technique which allows agents to generalise from simulation-based-training to the real world—differ from agents trained without. Our results show that the primary outcome of DR is more robust, entangled representations, accompanied by greater spatial structure in convolutional filters. Furthermore, even with an improved saliency method introduced in this work, we show that qualitative studies may not always correspond with quantitative measures, necessitating the combination of inspection tools in order to provide sufficient insights into the behaviour of trained agents. We conclude with recommendations for applying interpretability methods to DRL agents.

KW - Deep reinforcement learning

KW - Generalisation

KW - Interpretability

KW - Saliency

UR - http://dx.doi.org/10.1016/j.neucom.2022.04.005

U2 - 10.1016/j.neucom.2022.04.005

DO - 10.1016/j.neucom.2022.04.005

M3 - Article

SN - 0925-2312

VL - 493

SP - 143

EP - 165

JO - Neurocomputing

JF - Neurocomputing

ER -

Analysing Deep Reinforcement Learning Agents Trained with Domain Randomisation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this