M.D.I. Museur

Bachelor thesis (1)

1 records found

One-Shot Generalization in Offline Reinforcement Learning with WSAC-N

Bachelor thesis (2024) - M.D.I. Museur (author), M.R. Weltevrede (mentor), M.T.J. Spaan (mentor), Matthijs Spaan (mentor), Matthijs T.J. Spaan (mentor), Matthijs T. J. Spaan (mentor), E. Congeduti (graduation committee member)

Recent work has shown that offline reinforcement learning (RL) does not generalize well to new environments compared to behavioral cloning (BC). We propose WSAC-N, an ensemble model of soft actor-critics with weights to de-emphasize actions with high variance. We compare the zero ...