Integrating state representation learning into deep reinforcement learning

doi:10.1109/LRA.2018.2800101

Integrating state representation learning into deep reinforcement learning

Journal article (2018)

Research Group

Learning & Autonomous Control (Mechanical, Maritime and Materials Engineering) (TU Delft)

DOI: https://doi.org/10.1109/LRA.2018.2800101

Sensor fusion Deep learning in robotics and automation Learning and adaptive systems

To reference this document use:

http://resolver.tudelft.nl/uuid:adbec839-4b4b-4877-9143-09346b560249

More Info

expand_more

Published Date

2018

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Mechanical, Maritime and Materials Engineering

Department

Cognitive Robotics

Research Group

Learning & Autonomous Control

Abstract

Most deep reinforcement learning techniques are unsuitable for robotics, as they require too much interaction time to learn useful, general control policies. This problem can be largely attributed to the fact that a state representation needs to be learned as a part of learning control policies, which can only be done through fitting expected returns based on observed rewards. While the reward function provides information on the desirability of the state of the world, it does not necessarily provide information on how to distill a good, general representation of that state from the sensory observations. State representation learning objectives can be used to help learn such a representation. While many of these objectives have been proposed, they are typically not directly combined with reinforcement learning algorithms. We investigate several methods for integrating state representation learning into reinforcement learning. In these methods, the state representation learning objectives help regularize the state representation during the reinforcement learning, and the reinforcement learning itself is viewed as a crucial state representation learning objective and allowed to help shape the representation. Using autonomous racing tests in the TORCS simulator, we show how the integrated methods quickly learn policies that generalize to new environments much better than deep reinforcement learning without state representation learning.

Files

Integrating_State_Representati... (pdf)

(pdf | 0.773 Mb)

- Embargo expired in 31-07-2018

Unknown license