MP

M. Peschl

2 records found

Authored

We propose a deep reinforcement learning algorithm that employs an adversarial training strategy for adhering to implicit human norms alongside optimizing for a narrow goal objective. Previous methods which incorporate human values into reinforcement learning algorithms either sc ...

Aligning AI with Human Norms

Multi-Objective Deep Reinforcement Learning with Active Preference Elicitation

The field of deep reinforcement learning has seen major successes recently, achieving superhuman performance in discrete games such as Go and the Atari domain, as well as astounding results in continuous robot locomotion tasks. However, the correct specification of human intentio ...