Reinforcement and shaping in learning action sequences with neural dynamics

Matthew Luciw, Yulia Sandamirskaya, Sohrob Kazerounian, Jurgen Schmidhuber, Gregor Schoner

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

Neural dynamics offer a theoretical and computational framework, in which cognitive architectures may be developed, which are suitable both to model psychophysics of human behaviour and to control robotic behaviour. Recently, we have introduced reinforcement learning in this framework, which allows an agent to learn goal-directed sequences of behaviours based on a reward signal, perceived at the end of a sequence. Although stability of the dynamic neural fields and behavioural organisation allowed to demonstrate autonomous learning in the robotic system, learning of longer sequences was taking prohibitedly long time. Here, we combine the neural dynamic reinforcement learning with shaping, which consists in providing intermediate rewards and accelerates learning.We have implemented the new learning algorithm on a simulated Kuka YouBot robot and evaluated robustness and efficacy of learning in a pick-and-place task.
Original languageEnglish (US)
Title of host publicationIEEE ICDL-EPIROB 2014 - 4th Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages48-55
Number of pages8
ISBN (Print)9781479975402
DOIs
StatePublished - Dec 11 2014
Externally publishedYes

Bibliographical note

Generated from Scopus record by KAUST IRTS on 2022-09-14

Fingerprint

Dive into the research topics of 'Reinforcement and shaping in learning action sequences with neural dynamics'. Together they form a unique fingerprint.

Cite this