DistFlow Safe Reinforcement Learning Algorithm for Voltage Magnitude Regulation in Distribution Networks

Shengren, H.; Fu, Aihui; Duque, Edgar Mauricio; Palensky, P.; Chen, Qixin; Barrios, Pedro

DistFlow Safe Reinforcement Learning Algorithm for Voltage Magnitude Regulation in Distribution Networks

Journal article (2025)

Authors

H. Shengren Intelligent Electrical Power Grids

Aihui Fu Intelligent Electrical Power Grids

Edgar Mauricio Duque Eindhoven University of Technology

P. Palensky Electrical Sustainable Energy

Qixin Chen Tsinghua University

Pedro Barrios Intelligent Electrical Power Grids

Research Group

Intelligent Electrical Power Grids

Energy management Distribution network Voltage regulation Safe reinforcement learning

To reference this document use:

http://resolver.tudelft.nl/uuid:bad46128-17a3-4d64-819d-6b1dde12b2d7

More Info

expand_more

Published Date

2025

Language

English

Research Group

Intelligent Electrical Power Grids

Abstract

The integration of distributed energy resources (DERs) has escalated the challenge of voltage magnitude regulation in distribution networks. Model-based approaches, which rely on complex sequential mathematical formulations, cannot meet the real-time demand. Deep reinforcement learning (DRL) offers an alternative by utilizing offline training with distribution network simulators and then executing online without computation. However, DRL algorithms fail to enforce voltage magnitude constraints during training and testing, potentially leading to serious operational violations. To tackle these challenges, we introduce a novel safe-guaranteed reinforcement learning algorithm, the DistFlow safe reinforcement learning (DF-SRL), designed specifically for real-time voltage magnitude regulation in distribution networks. The DF-SRL algorithm incorporates a DistFlow linearization to construct an expert-knowledge-based safety layer. Subsequently, the DF-SRL algorithm overlays this safety layer on top of the agent policy, recalibrating unsafe actions to safe domains through a quadratic programming formulation. Simulation results show the DF-SRL algorithm consistently ensures voltage magnitude constraints during training and real-time operation (test) phases, achieving faster convergence and higher performance, which differentiates it apart from (safe) DRL benchmark algorithms.

Files

DistFlow_Safe_Reinforcement_Le... (pdf)

(pdf | 1.03 Mb)