Machine learning for real-time reservoir operation simulation: comparing input variables and algorithms for the Sirikit Reservoir, Thailand

Wannasin, C.; Brauer, C. C.; Uijlenhoet, R.; Torfs, P. J.J.F.; Weerts, A. H.

Machine learning for real-time reservoir operation simulation: comparing input variables and algorithms for the Sirikit Reservoir, Thailand

Journal article (2024)

Authors

C. Wannasin University of Twente, Wageningen University & Research, Deltares

C. C. Brauer Wageningen University & Research

R. Uijlenhoet Water Resources

P. J.J.F. Torfs Wageningen University & Research

A. H. Weerts Deltares, Wageningen University & Research

Research Group

Water Resources

Machine learning Input variable selection Input variable scaling Multi-purpose reservoir Real-time reservoir operation Upper Chao Phraya River basin

To reference this document use:

http://resolver.tudelft.nl/uuid:de869079-132a-44f6-9025-f61229a333af

More Info

expand_more

Published Date

2024

Language

English

Research Group

Water Resources

Abstract

Machine learning (ML) models offer advantages over process-based models for real-time reservoir operation modelling, yet the impact of input variable selection (IVS) and data pre-processing on model performance remains underexplored. This study investigates various input variables for simulating daily reservoir outflow, using the Sirikit reservoir in Thailand as a case study. The datasets include daily Sirikit storage and inflow, outflow of Bhumibol (neighbouring reservoir), downstream discharge, and temporal factors (month and day of the week). Time series decomposition and correlation analyses were used to assess data relationships. We tested seven ML models: multiple linear regression, support vector machine, K-nearest neighbour, classification and regression tree, random forest, multi-layer perceptron, and recurrent neural network (RNN). The optimal input set comprised the previous day’s storage, inflow from 2 days before to 2 days after, and month. With these inputs, all ML models simulated outflow adequately (KGEtraining = 0.42–1.0 and KGEtesting = 0.46–0.56), with RNN showing the most potential for improvement. Input scaling significantly enhanced model performance, reducing RMSEtraining by 44 m3 s-1 and RMSEtesting by 14 m3 s-1. This study’s novelty lies in its comprehensive insights of IVS and data scaling, highlighting their critical roles in enhancing ML model application for operational reservoir simulations.

Files

Jh2024153-1.pdf

(pdf | 1.53 Mb)