## Abstract

Abstract—A developing agent learns a model of the world

by observing regularities occurring in its sensory inputs. In a

continuous domain where the model is represented by a set of

rules, a significant part of the task of learning such a model is to

find appropriate intervals within the continuous state variables,

such that these intervals can be used to define rules whose

predictions are reliable. We propose a technique to find such

intervals (or regions) by means of finding clusters on approximate

probability distributions of sensory variables. We compare

this cluster-based method with an alternative landmark-based

algorithm. We evaluate both techniques on a data log recorded

in a simulation based on OpenArena, a three-dimensional firstperson-

perspective computer game, and demonstrate the results

of how the techniques can learn rules which describe walking

behaviour. While both techniques work reasonably well, the

clustering approach seems to give more “natural” regions which

correspond more closely to what a human would expect; we

speculate that such regions should be more useful if they are to

form a basis for further learning of higher order rules.

by observing regularities occurring in its sensory inputs. In a

continuous domain where the model is represented by a set of

rules, a significant part of the task of learning such a model is to

find appropriate intervals within the continuous state variables,

such that these intervals can be used to define rules whose

predictions are reliable. We propose a technique to find such

intervals (or regions) by means of finding clusters on approximate

probability distributions of sensory variables. We compare

this cluster-based method with an alternative landmark-based

algorithm. We evaluate both techniques on a data log recorded

in a simulation based on OpenArena, a three-dimensional firstperson-

perspective computer game, and demonstrate the results

of how the techniques can learn rules which describe walking

behaviour. While both techniques work reasonably well, the

clustering approach seems to give more “natural” regions which

correspond more closely to what a human would expect; we

speculate that such regions should be more useful if they are to

form a basis for further learning of higher order rules.

Original language | English |
---|---|

Title of host publication | 2011 IEEE International Conference on Development and Learning (ICDL 2011) |

Publisher | Institute of Electrical and Electronics Engineers (IEEE) |

Pages | n/a |

Number of pages | 8 |

Volume | 2 |

ISBN (Print) | 978-1-61284-989-8 |

DOIs | |

Publication status | Published - 24 Aug 2011 |

Event | International Conference on Development and Learning and Epigenetic Robotics - Frankfurt, Germany Duration: 24 Aug 2011 → … |

### Conference

Conference | International Conference on Development and Learning and Epigenetic Robotics |
---|---|

Country/Territory | Germany |

City | Frankfurt |

Period | 24/08/11 → … |