Towards automated video-based assessment of dystonia in dyskinetic cerebral palsy

Haberfehlner, Helga; van de Ven, Shankara S.; van der Burg, Sven A.; Huber, Florian; Georgievska, Sonja; Aleo, Ignazio; Harlaar, Jaap; Bonouvrié, Laura A.; Van Der Krogt, Marjolein Margaretha; Buizer, A. I.

doi:10.3389/frobt.2023.1108114

Towards automated video-based assessment of dystonia in dyskinetic cerebral palsy

A novel approach using markerless motion tracking and machine learning

Journal article (2023)

Authors

Helga Haberfehlner Amsterdam Movement Sciences, Rehabilitation & Development, Amsterdam UMC, Katholieke Universiteit Leuven

Shankara S. van de Ven Amsterdam UMC

Sven A. van der Burg Netherlands eScience Center

Florian Huber Netherlands eScience Center, University of Applied Sciences, Düsseldorf

Sonja Georgievska Netherlands eScience Center

Ignazio Aleo Moveshelf Labs B.V.

Jaap Harlaar Biomechatronics & Human-Machine Control

Laura A. Bonouvrié Vrije Universiteit Amsterdam

Marjolein Margaretha Van Der Krogt Amsterdam UMC, Amsterdam Movement Sciences, Rehabilitation & Development

A. I. Buizer Amsterdam Movement Sciences, Rehabilitation & Development, Emma Children's Hospital Academic Medical Center, University of Amsterdam, Amsterdam UMC

Research Group

Biomechatronics & Human-Machine Control

Copyright: © 2023 Helga Haberfehlner, Shankara S. van de Ven, Sven A. van der Burg, Florian Huber, Sonja Georgievska, Ignazio Aleo, J. Harlaar, Laura A. Bonouvrié, Marjolein M. van der Krogt, Annemieke I. Buizer

DOI: https://doi.org/10.3389/frobt.2023.1108114

Machine learning Movement disorders Cerebral palsy Motion capture Human pose estimation Markerless skeleton tracking

To reference this document use:

http://resolver.tudelft.nl/uuid:e4931165-63d4-4137-9b38-1d19339b3de2

More Info

expand_more

Published Date

2023

Language

English

Research Group

Biomechatronics & Human-Machine Control

Abstract

Introduction: Video-based clinical rating plays an important role in assessing dystonia and monitoring the effect of treatment in dyskinetic cerebral palsy (CP). However, evaluation by clinicians is time-consuming, and the quality of rating is dependent on experience. The aim of the current study is to provide a proof-of-concept for a machine learning approach to automatically assess scoring of dystonia using 2D stick figures extracted from videos. Model performance was compared to human performance. Methods: A total of 187 video sequences of 34 individuals with dyskinetic CP (8–23 years, all non-ambulatory) were filmed at rest during lying and supported sitting. Videos were scored by three raters according to the Dyskinesia Impairment Scale (DIS) for arm and leg dystonia (normalized scores ranging from 0–1). Coordinates in pixels of the left and right wrist, elbow, shoulder, hip, knee and ankle were extracted using DeepLabCut, an open source toolbox that builds on a pose estimation algorithm. Within a subset, tracking accuracy was assessed for a pretrained human model and for models trained with an increasing number of manually labeled frames. The mean absolute error (MAE) between DeepLabCut’s prediction of the position of body points and manual labels was calculated. Subsequently, movement and position features were calculated from extracted body point coordinates. These features were fed into a Random Forest Regressor to train a model to predict the clinical scores. The model performance trained with data from one rater evaluated by MAEs (model-rater) was compared to inter-rater accuracy. Results: A tracking accuracy of 4.5 pixels (approximately 1.5 cm) could be achieved by adding 15–20 manually labeled frames per video. The MAEs for the trained models ranged from 0.21 ± 0.15 for arm dystonia to 0.14 ± 0.10 for leg dystonia (normalized DIS scores). The inter-rater MAEs were 0.21 ± 0.22 and 0.16 ± 0.20, respectively. Conclusion: This proof-of-concept study shows the potential of using stick figures extracted from common videos in a machine learning approach to automatically assess dystonia. Sufficient tracking accuracy can be reached by manually adding labels within 15–20 frames per video. With a relatively small data set, it is possible to train a model that can automatically assess dystonia with a performance comparable to human scoring.

Files

Frobt_10_1108114.pdf

(pdf | 1.3 Mb)