M.R. Weltevrede | TU Delft Repository

Performance of Decision Transformer in multi-task offline reinforcement learning

How does the introduction of sub-optimal data affect the performance of the model?

Bachelor thesis (2024) - P.Z. Bieszczad (author), Matthijs Spaan (mentor), Matthijs T.J. Spaan (mentor), Matthijs T. J. Spaan (mentor), M.T.J. Spaan (mentor), M. T.J. Spaan (mentor), M.R. Weltevrede (mentor), E. Congeduti (graduation committee member)

In the field of Artificial Intelligence (AI), techniques like Reinforcement Learning (RL) and Decision Transformer (DT) are utilized by machines to learn from experiences and solve problems. The distinction between offline and online learning determines whether the machine learns ...

One-Shot Generalization in Offline Reinforcement Learning with WSAC-N

Bachelor thesis (2024) - M.D.I. Museur (author), M.R. Weltevrede (mentor), Matthijs Spaan (mentor), Matthijs T.J. Spaan (mentor), Matthijs T. J. Spaan (mentor), M.T.J. Spaan (mentor), M. T.J. Spaan (mentor), E. Congeduti (graduation committee member)

Recent work has shown that offline reinforcement learning (RL) does not generalize well to new environments compared to behavioral cloning (BC). We propose WSAC-N, an ensemble model of soft actor-critics with weights to de-emphasize actions with high variance. We compare the zero ...

Multi-Task Offline Reinforcement Learning

Experimental Evaluation of the Generalizability of the Soft Actor-Critic + Behavioral Cloning Algorithm

Bachelor thesis (2024) - A.O. Geist (author), Matthijs Spaan (mentor), Matthijs T.J. Spaan (mentor), Matthijs T. J. Spaan (mentor), M.T.J. Spaan (mentor), M. T.J. Spaan (mentor), M.R. Weltevrede (mentor), E. Congeduti (graduation committee member)

This paper examines the generalization capabilities of the Soft Actor-Critic (SAC) algorithm when combined with Behavioral Cloning (BC) in a MiniGrid Four-Room Environment. Reinforcement learning (RL), particularly offline, is important for tasks where interactions with the envir ...

Multi-task Offline Reinforcement Learning with CQL

A study on how dataset size and diversity increase generalization performance

Bachelor thesis (2024) - L. Lipinskas (author), Matthijs Spaan (mentor), Matthijs T.J. Spaan (mentor), Matthijs T. J. Spaan (mentor), M.T.J. Spaan (mentor), M. T.J. Spaan (mentor), M.R. Weltevrede (mentor), E. Congeduti (graduation committee member)

Reinforcement learning (RL) is a type of machine learning where a model learns by
making an observation of the current state it is in, picking out an action to execute, and
observing the reward of said action, after which it receives the next state and repeats the
...