Person | TU Delft Repository

M. Peschl

2 records found

Authored

Training for Implicit Norms in Deep Reinforcement Learning Agents through Adversarial Multi-Objective Reward Optimization

Conference paper (2021) - M. Peschl

We propose a deep reinforcement learning algorithm that employs an adversarial training strategy for adhering to implicit human norms alongside optimizing for a narrow goal objective. Previous methods which incorporate human values into reinforcement learning algorithms either sc ...

Aligning AI with Human Norms

Multi-Objective Deep Reinforcement Learning with Active Preference Elicitation

Master thesis (2021) - M. Peschl, L. Cavalcante Siebert, A. Zgonnikov, F.A. Oliehoek, D. Kurowicka

The field of deep reinforcement learning has seen major successes recently, achieving superhuman performance in discrete games such as Go and the Atari domain, as well as astounding results in continuous robot locomotion tasks. However, the correct specification of human intentio ...