A.V. Mandersloot

Conference paper (1)

1 records found

Exploring the Effects of Conditioning Independent Q-Learners on the Sufficient Statistic for Dec-POMDPs

Conference paper (2020) - A.V. Mandersloot (author), F.A. Oliehoek (author), A.T. Czechowski (author)

In this study, we investigate the effects of conditioning Independent Q-Learners (IQL) not solely on the individual action-observation history, but additionally on the sufficient plan-time statistic for Decentralized Partially Observable Markov Decision Processes. In doing so, we ...