Max Welling | TU Delft Repository

Multi-agent MDP homomorphic networks

Conference paper (2022) - Elise Pol (author) , Herke van Hoof (author) , Frans A Oliehoek (author) , Max Welling (author)

This paper introduces Multi-Agent MDP Homomorphic Networks, a class of networks that allows distributed execution using only local information, yet is able to share experience between global symmetries in the joint state-action space of cooperative multi-agent systems. In coopera ...

Plannable Approximations to MDP Homomorphisms: Equivariance under Actions

Conference paper (2020) - Elise Pol (author) , Thomas Kipf (author) , Frans A Oliehoek (author) , Max Welling (author)

This work exploits action equivariance for representation learning in reinforcement learning. Equivariance under actions states that transitions in the input space are mirrored by equivalent transitions in latent space, while the map and transition functions should also commute. ...

MDP homomorphic networks

Group symmetries in reinforcement learning

Journal article (2020) - Elise van der Pol (author) , Daniel E. Worrall (author) , Herke van Hoof (author) , Frans A Oliehoek (author) , Max Welling (author)

This paper introduces MDP homomorphic networks for deep reinforcement learning. MDP homomorphic networks are neural networks that are equivariant under symmetries in the joint state-action space of an MDP. Current approaches to deep reinforcement learning do not usually exploit k ...