Regret Analysis of Learning-Based Linear Quadratic Gaussian Control with Additive Exploration

Athrey, Archith; Mazhar, Othmane; Guo, Meichen; De Schutter, B.H.K.; Shi, Shengling

Regret Analysis of Learning-Based Linear Quadratic Gaussian Control with Additive Exploration

Conference paper (2024)

Authors

Archith Athrey Student

Othmane Mazhar Université Paris Cité Grands

Meichen Guo Team Meichen Guo

B.H.K. De Schutter Delft Center for Systems and Control

Shengling Shi Team Bart De Schutter

Research Group

Team Bart De Schutter

To reference this document use:

http://resolver.tudelft.nl/uuid:9d7488a1-1715-467a-8761-9c60a0eb2bcd

More Info

expand_more

Published Date

2024

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Research Group

Team Bart De Schutter

Abstract

In this paper, we analyze the regret incurred by a computationally efficient exploration strategy, known as naive exploration, for controlling unknown partially observable systems within the Linear Quadratic Gaussian (LQG) framework. We introduce a two-phase control algorithm called LQG-NAIVE, which involves an initial phase of injecting Gaussian input signals to obtain a system model, followed by a second phase of an interplay between naive exploration and control in an episodic fashion. We show that LQG-NAIVE achieves a regret growth rate of Õ(√T), i.e., O(√T) up to logarithmic factors after T time steps, and we validate its performance through numerical simulations. Additionally, we propose LQG-IF2E, which extends the exploration signal to a 'closed-loop' setting by incorporating the Fisher Information Matrix (FIM). We provide compelling numerical evidence of the competitive performance of LQG-IF2E compared to LQG-NAIVE.

Files

Regret_Analysis_of_Learning-Ba... (pdf)

(pdf | 0.58 Mb)

- Embargo expired in 24-01-2025

License info not available