Sam Devlin

Conference paper (1)

Journal article (1)

2 records found

Difference rewards policy gradients

Journal article (2022) - Jacopo Castellini (author), Sam Devlin (author), Frans Oliehoek (author), Frans A Oliehoek (author), Frans A. Oliehoek (author), F.A. Oliehoek (author), Rahul Savani (author)

Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key challenge, however, that is not addressed by many of these methods is multi-agent credit assignment: assessing an agent’s contribution to the overall pe ...

Difference Rewards Policy Gradients

Conference paper (2021) - Jacopo Castellini (author), Frans Oliehoek (author), Frans A Oliehoek (author), Frans A. Oliehoek (author), F.A. Oliehoek (author), Sam Devlin (author), Rahul Savani (author)