Non-deterministic policy improvement stabilizes approximated reinforcement learning

Conference paper (2016)

Authors

J.W. Böhmer Technical University of Berlin

Rong Guo

Klaus Obermayer

Affiliation

External organisation

To reference this document use:

http://resolver.tudelft.nl/uuid:05fb68cf-d6e8-4612-be99-fe2cc62492c7

More Info

expand_more

Published Date

2016

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Affiliation

External organisation