Fast Dynamic Programming

Max, G.F.

Fast Dynamic Programming

A Numerical Method for Solving Dynamic Programming Problems

Master thesis (2019)

Authors

G.F. Max Electrical Engineering, Mathematics and Computer Science

Contributors

Tamas Keviczky Team Tamas Keviczky (mentor)

Peyman Mohajerin Mohajerin Esfahani Team Tamas Keviczky (mentor)

Mohamad Amin Sharifi Sharifi Kolarijani Team Tamas Keviczky (graduation committee member)

AJ van der Veen Signal Processing Systems (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

Optimization Dynamic Programming Numerical method Optimal control Convex optimization Fast Fourier Transformation Legendre-Fenchel Transform Convex conjugate

To reference this document use:

http://resolver.tudelft.nl/uuid:424dc384-92b5-4cb6-8392-1f9739d5075d

More Info

expand_more

Published Date

27-11-2019

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

A well-established method for finding the optimal control policy for a given dynamical system is to solve the problem iteratively going from its terminal state "backwards" in time, known as Dynamic Programming Algorithm. For a generic problem with discrete state/action space, the algorithm has computational complexity of O(NM) for N states and M actions. In this thesis, we propose a novel numerical algorithm that approaches this problem in the conjugate domain, using the so-called Legendre-Fenchel Transform. In essence, the proposed approach is analogous to, and was inspired by Fast Fourier Transform, and how it can be beneficial to do computations/analysis in the frequency domain. In particular, this approach allows us to exploit the structure of the problem (e.g., in LQ control) to drastically reduce the computational complexity to O(N+M). Of course, this computational gain comes with a cost of introducing error.

Files

MSc_thesis_Fast_Dynamic_Progra... (pdf)

(pdf | 1.64 Mb)

License info not available

MSc_thesis_Fast_Dynamic_Progra... (pdf)

(pdf | 1.64 Mb)

License info not available