Towards growth-accommodating deep learning-based semantic segmentation of pediatric hand phalanges
More Info
expand_more
Abstract
Existing deep learning (DL) networks are primarily trained on adult datasets and may not always generalize to pediatric populations, where growth plays a major role. Here, we investigated improving semantic segmentation outcomes of pediatric hand phalanges from radiographs without relying on fully pediatric training datasets, which are scarce. First, alternative DL networks (FCN-8, FCN-32, U-Net, Inception U-Net, and DeepLabv3+) were trained with manually segmented radiographs of near-skeletally-mature (NSM) subjects and their performances were evaluated using mean intersection-over-union (Mean IoU) and multiclass Dice scores. DeepLabv3+ and Inception U-Net performed the best for NSM segmentation, with Mean IoU scores of 0.899 ± 0.035 and 0.887 ± 0.062, respectively. These networks were then used to investigate zero pediatric data (scaling-based data augmentation) and minimal pediatric data (incremental pediatric data substitution) approaches to improve age-domain generalizability. The minimal pediatric data approach proved effective, with a 20 % pediatric data inclusion leading to an up to 21.1 % increase in Mean IoU for pediatric subjects compared to networks trained exclusively on NSM subjects. Furthermore, no adverse effects of this approach were found when tested on NSM subjects, and there were even improvements in performance for Inception U-Net. To conclude, we highlight that networks utilizing multi-scale filters perform best for the semantic segmentation of hand phalanges. We further demonstrate that a minimal inclusion of pediatric training data can markedly improve age-domain generalizability for semantic segmentation tasks. This removes the difficult task of gathering large training datasets of pediatric subjects, which is often impractical, if not impossible.