TY - GEN
T1 - Reinforcement and shaping in learning action sequences with neural dynamics
AU - Luciw, Matthew
AU - Sandamirskaya, Yulia
AU - Kazerounian, Sohrob
AU - Schmidhuber, Jurgen
AU - Schoner, Gregor
N1 - Generated from Scopus record by KAUST IRTS on 2022-09-14
PY - 2014/12/11
Y1 - 2014/12/11
N2 - Neural dynamics offer a theoretical and computational framework, in which cognitive architectures may be developed, which are suitable both to model psychophysics of human behaviour and to control robotic behaviour. Recently, we have introduced reinforcement learning in this framework, which allows an agent to learn goal-directed sequences of behaviours based on a reward signal, perceived at the end of a sequence. Although stability of the dynamic neural fields and behavioural organisation allowed to demonstrate autonomous learning in the robotic system, learning of longer sequences was taking prohibitedly long time. Here, we combine the neural dynamic reinforcement learning with shaping, which consists in providing intermediate rewards and accelerates learning.We have implemented the new learning algorithm on a simulated Kuka YouBot robot and evaluated robustness and efficacy of learning in a pick-and-place task.
AB - Neural dynamics offer a theoretical and computational framework, in which cognitive architectures may be developed, which are suitable both to model psychophysics of human behaviour and to control robotic behaviour. Recently, we have introduced reinforcement learning in this framework, which allows an agent to learn goal-directed sequences of behaviours based on a reward signal, perceived at the end of a sequence. Although stability of the dynamic neural fields and behavioural organisation allowed to demonstrate autonomous learning in the robotic system, learning of longer sequences was taking prohibitedly long time. Here, we combine the neural dynamic reinforcement learning with shaping, which consists in providing intermediate rewards and accelerates learning.We have implemented the new learning algorithm on a simulated Kuka YouBot robot and evaluated robustness and efficacy of learning in a pick-and-place task.
UR - https://ieeexplore.ieee.org/document/6982953
UR - http://www.scopus.com/inward/record.url?scp=84920913382&partnerID=8YFLogxK
U2 - 10.1109/DEVLRN.2014.6982953
DO - 10.1109/DEVLRN.2014.6982953
M3 - Conference contribution
SN - 9781479975402
SP - 48
EP - 55
BT - IEEE ICDL-EPIROB 2014 - 4th Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics
PB - Institute of Electrical and Electronics Engineers Inc.
ER -