Deep learning methods for clinical workflow phase-based prediction of procedure duration

Frassini, E.; Vijfvinkel, Teddy S.; Butler, Rick M.; van der Elst, M.; Hendriks, B.H.W.; Van Den Dobbelsteen, JJ

Deep learning methods for clinical workflow phase-based prediction of procedure duration

a benchmark study

Journal article (2025)

Authors

E. Frassini Medical Instruments & Bio-Inspired Technology

Teddy S. Vijfvinkel Medical Instruments & Bio-Inspired Technology, Reinier de Graaf Gasthuis

Rick M. Butler Medical Instruments & Bio-Inspired Technology

M. van der Elst Reinier de Graaf Gasthuis, Medical Instruments & Bio-Inspired Technology

B.H.W. Hendriks Philips Healthcare Nederland, Medical Instruments & Bio-Inspired Technology

JJ Van Den Dobbelsteen Medical Instruments & Bio-Inspired Technology

Research Group

Medical Instruments & Bio-Inspired Technology

Deep learning CNN Time series Regression

To reference this document use:

http://resolver.tudelft.nl/uuid:966381cb-4afb-41a8-8562-1e9c05c7c9df

More Info

expand_more

Published Date

2025

Language

English

Research Group

Medical Instruments & Bio-Inspired Technology

Abstract

This study evaluates the performance of deep learning models in the prediction of the end time of procedures performed in the cardiac catheterization laboratory (cath lab). We employed only the clinical phases derived from video analysis as input to the algorithms. Our results show that InceptionTime and LSTM-FCN yielded the most accurate predictions. InceptionTime achieves Mean Absolute Error (MAE) values below 5 min and Symmetric Mean Absolute Percentage Error (SMAPE) under 6% at 60-s sampling intervals. In contrast, LSTM with attention mechanism and standard LSTM models have higher error rates, indicating challenges in handling both long-term and short-term dependencies. CNN-based models, especially InceptionTime, excel at feature extraction across different scales, making them effective for time-series predictions. We also analyzed training and testing times. CNN models, despite higher computational costs, significantly reduce prediction errors. The Transformer model has the fastest inference time, making it ideal for real-time applications. An ensemble model derived by averaging the two best performing algorithms reported low MAE and SMAPE, although needing longer training. Future research should validate these findings across different procedural contexts and explore ways to optimize training times without losing accuracy. Integrating these models into clinical scheduling systems could improve efficiency in cath labs. Our research demonstrates that the models we implemented can form the basis of an automated tool, which predicts the optimal time to call the next patient with an average error of approximately 30 s. These findings show the effectiveness of deep learning models, especially CNN-based architectures, in accurately predicting procedure end times.

Files

Deep_learning_methods_for_clin... (pdf)

(pdf | 4.4 Mb)