R.A. Jaldevik

Master thesis (1)

1 records found

General Tree Evaluation for AlphaZero

Master thesis (2024) - R.A. Jaldevik (author), J.W. Böhmer (mentor), N. Yorke-Smith (graduation committee member), Neil Yorke-Smith (graduation committee member)

Over the last decade, there have been significant advances in model-based deep reinforcement learning. One of the most successful such algorithms is AlphaZero which combines Monte Carlo Tree Search with deep learning. AlphaZero and its successors commonly describe a unified frame ...