Learning to Control Multi- Dimensional Autonomous Agents using Hebbian Learning

Husić, A.

Learning to Control Multi- Dimensional Autonomous Agents using Hebbian Learning

A Global Reward Approach

Master thesis (2018)

Authors

A. Husić Mechanical Engineering

Contributors

M. Wisse (mentor)

W.J. Wolfslag (mentor)

Faculty

Mechanical Engineering, Mechanical Engineering

Machine Learning Hebbian learning Autonomous cars Robot control

To reference this document use:

http://resolver.tudelft.nl/uuid:9e2b4a67-041c-4ee7-be9a-e03891d3d17d

More Info

expand_more

Published Date

20-12-2018

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Mechanical Engineering

Abstract

The novelty-raahn algorithm has been shown to effectively learn a desired behavior from raw inputs by connecting an autoencoder with a Hebbian network. Hebbian learning is compelling for its biological plausibility and simplicity. It changes the weight of a connection based only on the activations of neurons it connects, and can effectively reinforce good behaviors when combined with neuromodulation. These low-level synaptic weight changes make for a better merge of the three learning tasks of perception, prediction and action. However, the state-ofthe art algorithm requires the design of a highly detailed modulation scheme designed for a specific system, which is disconnected from the overall objective it optimizes. In this thesis, we will propose that similar learning behavior can be achieved, by making the autonomous agent react to longer-term rewards, and thus implicitly introducing prediction capabilities. In doing so, the required modulation scheme becomes connected to the global optimization objective.

Files

Ajdin_Thesis.pdf

(pdf | 1.16 Mb)

Unknown license