Using Decision Trees produced by Generative Adversarial Imitation Learning to give insight into black box Reinforcement Learning models

Meijer, C.J.

Using Decision Trees produced by Generative Adversarial Imitation Learning to give insight into black box Reinforcement Learning models

Bachelor thesis (2022)

Authors

C.J. Meijer Electrical Engineering, Mathematics and Computer Science

Contributors

A. Lukina Algorithmics - (mentor)

P.K. Murukannaiah Interactive Intelligence - (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

Machine learning Python Reinforcement Learning (RL) Neural network Markov Decision Process Decision Trees Generative Adversarial Network Interpretability Imitation Learning Black box Generative Adversarial Imitation Learning Explainable Reinforcement Learning

To reference this document use:

http://resolver.tudelft.nl/uuid:dbc92b47-7fae-473a-ad53-e906ad7b2008

More Info

expand_more

Published Date

28-01-2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Machine learning models are increasingly being used in fields that have a direct impact on the lives of humans. Often these machine learning models are black-box models and they lack transparency and trust which is holding back the implementation. To increase transparency and trust this research investigates whether imitation learning, specifically Generative Adversarial ImitationLearning (GAIL), can be used to give insights into the black-box models by extracting decision trees. To achieve this, an extension of GAIL was made allowing it to extract decision trees. The decision trees were then measured in terms of performance, fidelity, behavior, and interpretability in three different environments. We find that GAIL is able to extract decision trees with high fidelity and can give insightful information into the expert models. Moreover, further research can be done on more complex environments and black-box models, other surrogate models, and possibilities for more specific local insights.

Files

FINAL_FINAL_PAPER_CASPAR_MEIJE... (pdf)

(pdf | 0.963 Mb)

Unknown license