Sample-Efficient Reinforcement Learning for Flight Control

Chan, W.Y.

Sample-Efficient Reinforcement Learning for Flight Control

Advancing Fault-Tolerant Control

Master thesis (2024)

Authors

W.Y. Chan Aerospace Engineering

Contributors

E. van Kampen (mentor)

Faculty

Aerospace Engineering

Reinforcement Learning (RL) Adaptive control Fault Tolerance Flight Control Incremental Dual Heuristic Programming (IDHP) Actor Critic Designs (ACD)

To reference this document use:

http://resolver.tudelft.nl/uuid:2e6b0eda-aceb-44c7-88de-7322cf8e8958

More Info

expand_more

Published Date

30-08-2024

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Aerospace Engineering

Abstract

Incremental Dual Heuristic Programming (IDHP) is a successor to the Dual Heuristic Programming (DHP) algorithm that uses an online identified incremental system model, this algorithm showed promising flight control performance and tolerance of faults in simulation experiments. This paper studies the potential for extending IDHP through augmenting the computation of agent updates and returns, more specifically, by using eligibility trace updates and multi-step temporal difference error. This results in the IDHP(𝜆), MIDHP, and MIDHP(𝜆) algorithms, which are compared against IDHP in several simulated flight control scenarios with faults introduced mid-flight. The results demonstrate that the proposed algorithms have improved flight control performance and fault tolerance in terms of tracking errors when controlling a nominal aircraft and an aircraft with faults introduced, with the most improvement observed in MIDHP(𝜆)

Files

Wing MSc Thesis

.pdf | 9.29 Mb)