TY - GEN
T1 - Quasi-online reinforcement learning for robots
AU - Bakker, Bram
AU - Zhumatiy, Viktor
AU - Gruener, Gabriel
AU - Schmidhuber, Jürgen
N1 - Generated from Scopus record by KAUST IRTS on 2022-09-14
PY - 2006/12/27
Y1 - 2006/12/27
N2 - This paper describes quasi-online reinforcement learning: while a robot is exploring its environment, in the back-ground a probabilistic model of the environment is built on the fly as new experiences arrive; the policy is trained concurrently based on this model using an anytime algorithm. Prioritized sweeping, directed exploration, and transformed reward functions provide additional speed-ups. The robot quickly learns goal-directed policies from scratch, requiring few interactions with the environment and making efficient use of available computation time. From an outside perspective it learns the behavior online and in real time. We describe comparisons with standard methods and show the individual utility of each of the proposed techniques. © 2006 IEEE.
AB - This paper describes quasi-online reinforcement learning: while a robot is exploring its environment, in the back-ground a probabilistic model of the environment is built on the fly as new experiences arrive; the policy is trained concurrently based on this model using an anytime algorithm. Prioritized sweeping, directed exploration, and transformed reward functions provide additional speed-ups. The robot quickly learns goal-directed policies from scratch, requiring few interactions with the environment and making efficient use of available computation time. From an outside perspective it learns the behavior online and in real time. We describe comparisons with standard methods and show the individual utility of each of the proposed techniques. © 2006 IEEE.
UR - http://ieeexplore.ieee.org/document/1642157/
UR - http://www.scopus.com/inward/record.url?scp=33845607326&partnerID=8YFLogxK
U2 - 10.1109/ROBOT.2006.1642157
DO - 10.1109/ROBOT.2006.1642157
M3 - Conference contribution
SN - 0780395069
SP - 2997
EP - 3002
BT - Proceedings - IEEE International Conference on Robotics and Automation
ER -