Interpretability and performance comparisons of decision tree surrogate models produced by AGGREVATE

Wols, J.S.

Interpretability and performance comparisons of decision tree surrogate models produced by AGGREVATE

Bachelor thesis (2022)

Authors

J.S. Wols Electrical Engineering, Mathematics and Computer Science

Contributors

A. Lukina Algorithmics (mentor)

P.K. Murukannaiah Interactive Intelligence (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

Imitation Learning Explainable Reinforcement Learning Interpretability comparison AggreVaTe

To reference this document use:

http://resolver.tudelft.nl/uuid:2105319b-f6f5-419e-a3ac-219dea591f58

More Info

expand_more

Published Date

28-01-2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Imitation learning algorithms, such as AggreVaTe, have proven successful in solving many challenging tasks accurately and efficiently. In practice, however, they have not been applied quite as much. Black box policies produced by imitation learning algorithms can not ensure the safety needed for real-world applications. This paper extends this field by outputting a decision tree surrogate model from AggreVaTe and comparing it to other imitation learning algorithms (Behavioral cloning, GAIL, DAgger, Viper) in terms of interpretability as well as performance. A modification to AggreVaTe is proposed to train decision tree policies that can be used to explain individual decision-making of the model. Three simple environments of open AI Gym have been used to compare the multiple different imitation learning algorithms. The experiments reveal that on performance, AggreVaTe overall performs better than the baseline behavioral cloning but slightly worse than GAIL, DAgger and Viper. AggreVaTe performs slightly better in terms of interpretability on these simple environments. Both of these conclusions could be explained by the fewer data points used by AggreVaTe. Further study can be done into the subjective interpretability of AggreVaTe as well as more difficult environments where the extra exploring of AggreVaTe should help with finding the best solution.

Files

Research_Project_Explainable_R... (pdf)

(pdf | 1.17 Mb)

Unknown license