An Unsupervised Autonomous Learning Framework for Goal-directed Behaviours in Dynamic Contexts

Chinedu Ezenkwu; Andrew Starkey

doi:10.1007/s43674-022-00037-9

An Unsupervised Autonomous Learning Framework for Goal-directed Behaviours in Dynamic Contexts

Chinedu Ezenkwu^* (Corresponding Author), Andrew Starkey

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

10 Downloads (Pure)

Abstract

Due to their dependence on a task-specific reward function, reinforcement learning agents are ineffective at responding to a dynamic goal or environment. This paper seeks to overcome this limitation of traditional reinforcement learning through a task-agnostic, self-organising autonomous agent framework. The proposed algorithm is a hybrid of TMGWR for self-adaptive learning of sensorimotor maps and value iteration for goal directed planning. TMGWR has been previously demonstrated to overcome the problems associated with competing sensorimotor techniques such SOM, GNG, and GWR; these problems include: difficulty in setting a suitable number of neurons for a task, inflexibility, the inability to cope with non-markovian environments, challenges with noise, and inappropriate representation of sensory observations and actions together.
However, the binary sensorimotor-link implementation in the original TMGWR enables catastrophic forgetting when the agent experiences changes in the task and it is therefore not suitable for self-adaptive learning. A new sensorimotor-link update rule is presented in this paper to enable the adaptation of the sensorimotor map to new experiences. This paper has demonstrated that the TMGWR-based algorithm has better sample efficiency than model-free reinforcement learning and better self-adaptivity than both the model free and the traditional model-based reinforcement learning algorithms. Moreover, the
algorithm has been demonstrated to give the lowest overall computational cost when compared to traditional reinforcement learning algorithms.

Original language	English
Article number	26
Number of pages	14
Journal	Advances in Computational Intelligence
Volume	2
DOIs	https://doi.org/10.1007/s43674-022-00037-9
Publication status	Published - 2 Jun 2022

Bibliographical note

This work is funded by the Tertiary Education Trust Fund (TETFund) scheme of
the Federal Republic of Nigeria.

Data Availability Statement

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

Keywords

autonomous agent
planning
unsupervised learning
sensorimotor
Artificial Intelligence

Access to Document

10.1007/s43674-022-00037-9Licence: CC BY

Ezenkwu_etal_ACI_Unsupervised_autonomous_learning_VOR
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Final published version, 1.54 MBLicence: CC BY

Cite this

@article{af7ace251b804ae78760e1cc4702322a,

title = "An Unsupervised Autonomous Learning Framework for Goal-directed Behaviours in Dynamic Contexts",

abstract = "Due to their dependence on a task-specific reward function, reinforcement learning agents are ineffective at responding to a dynamic goal or environment. This paper seeks to overcome this limitation of traditional reinforcement learning through a task-agnostic, self-organising autonomous agent framework. The proposed algorithm is a hybrid of TMGWR for self-adaptive learning of sensorimotor maps and value iteration for goal directed planning. TMGWR has been previously demonstrated to overcome the problems associated with competing sensorimotor techniques such SOM, GNG, and GWR; these problems include: difficulty in setting a suitable number of neurons for a task, inflexibility, the inability to cope with non-markovian environments, challenges with noise, and inappropriate representation of sensory observations and actions together.However, the binary sensorimotor-link implementation in the original TMGWR enables catastrophic forgetting when the agent experiences changes in the task and it is therefore not suitable for self-adaptive learning. A new sensorimotor-link update rule is presented in this paper to enable the adaptation of the sensorimotor map to new experiences. This paper has demonstrated that the TMGWR-based algorithm has better sample efficiency than model-free reinforcement learning and better self-adaptivity than both the model free and the traditional model-based reinforcement learning algorithms. Moreover, thealgorithm has been demonstrated to give the lowest overall computational cost when compared to traditional reinforcement learning algorithms.",

keywords = "autonomous agent, planning, unsupervised learning, sensorimotor, Artificial Intelligence",

author = "Chinedu Ezenkwu and Andrew Starkey",

note = "This work is funded by the Tertiary Education Trust Fund (TETFund) scheme of the Federal Republic of Nigeria.",

year = "2022",

month = jun,

day = "2",

doi = "10.1007/s43674-022-00037-9",

language = "English",

volume = "2",

journal = "Advances in Computational Intelligence",

issn = "2730-7808",

publisher = "Springer ",

}

TY - JOUR

T1 - An Unsupervised Autonomous Learning Framework for Goal-directed Behaviours in Dynamic Contexts

AU - Ezenkwu, Chinedu

AU - Starkey, Andrew

N1 - This work is funded by the Tertiary Education Trust Fund (TETFund) scheme of the Federal Republic of Nigeria.

PY - 2022/6/2

Y1 - 2022/6/2

N2 - Due to their dependence on a task-specific reward function, reinforcement learning agents are ineffective at responding to a dynamic goal or environment. This paper seeks to overcome this limitation of traditional reinforcement learning through a task-agnostic, self-organising autonomous agent framework. The proposed algorithm is a hybrid of TMGWR for self-adaptive learning of sensorimotor maps and value iteration for goal directed planning. TMGWR has been previously demonstrated to overcome the problems associated with competing sensorimotor techniques such SOM, GNG, and GWR; these problems include: difficulty in setting a suitable number of neurons for a task, inflexibility, the inability to cope with non-markovian environments, challenges with noise, and inappropriate representation of sensory observations and actions together.However, the binary sensorimotor-link implementation in the original TMGWR enables catastrophic forgetting when the agent experiences changes in the task and it is therefore not suitable for self-adaptive learning. A new sensorimotor-link update rule is presented in this paper to enable the adaptation of the sensorimotor map to new experiences. This paper has demonstrated that the TMGWR-based algorithm has better sample efficiency than model-free reinforcement learning and better self-adaptivity than both the model free and the traditional model-based reinforcement learning algorithms. Moreover, thealgorithm has been demonstrated to give the lowest overall computational cost when compared to traditional reinforcement learning algorithms.

AB - Due to their dependence on a task-specific reward function, reinforcement learning agents are ineffective at responding to a dynamic goal or environment. This paper seeks to overcome this limitation of traditional reinforcement learning through a task-agnostic, self-organising autonomous agent framework. The proposed algorithm is a hybrid of TMGWR for self-adaptive learning of sensorimotor maps and value iteration for goal directed planning. TMGWR has been previously demonstrated to overcome the problems associated with competing sensorimotor techniques such SOM, GNG, and GWR; these problems include: difficulty in setting a suitable number of neurons for a task, inflexibility, the inability to cope with non-markovian environments, challenges with noise, and inappropriate representation of sensory observations and actions together.However, the binary sensorimotor-link implementation in the original TMGWR enables catastrophic forgetting when the agent experiences changes in the task and it is therefore not suitable for self-adaptive learning. A new sensorimotor-link update rule is presented in this paper to enable the adaptation of the sensorimotor map to new experiences. This paper has demonstrated that the TMGWR-based algorithm has better sample efficiency than model-free reinforcement learning and better self-adaptivity than both the model free and the traditional model-based reinforcement learning algorithms. Moreover, thealgorithm has been demonstrated to give the lowest overall computational cost when compared to traditional reinforcement learning algorithms.

KW - autonomous agent

KW - planning

KW - unsupervised learning

KW - sensorimotor

KW - Artificial Intelligence

U2 - 10.1007/s43674-022-00037-9

DO - 10.1007/s43674-022-00037-9

M3 - Article

SN - 2730-7808

VL - 2

JO - Advances in Computational Intelligence

JF - Advances in Computational Intelligence

M1 - 26

ER -

An Unsupervised Autonomous Learning Framework for Goal-directed Behaviours in Dynamic Contexts

Abstract

Bibliographical note

Data Availability Statement

Keywords

Access to Document

Fingerprint

Cite this