Understanding the influence of DNA fragment lengths in detecting cancer

Detection of cancer using blood

More Info
expand_more

Abstract

Detecting cancer at an initial stage could change the course of the disease's development. A non-invasive examination consists of the liquid biopsy of blood, revealing biomarkers that could provide information about the existence of a tumour or not in the organism. The research touches upon the relevance of DNA fragments, precisely the length of fragments, in the detection of cancer. An in-depth interpretation of the fragment length distribution for predicting the state of a patient as being healthy or sick with cancer was approached. The distribution was explored from four perspectives: the complete fragment length distribution, the size range from 90 to 150 bp, important lengths selected by the feature extraction methods and the Fourier Transform of the initial data. These were input in three machine learning models. Using the fragment lengths between 93 and 98 produced accuracy and AUC scores of over 0.85 for all supervised classification models. Processing the data with the Fourier Transform and using the amplitude of spectrums as features in the Random Forest model resulted in an AUC of 0.99.

Files

Unknown license