Modified GNN-SubNet: leveraging local versus global Graph Neural Network explanations for disease subnetwork detection

Milchi, E.

Modified GNN-SubNet: leveraging local versus global Graph Neural Network explanations for disease subnetwork detection

Bachelor thesis (2024)

Authors

E. Milchi Electrical Engineering, Mathematics and Computer Science

Contributors

M. Khosla (mentor)

J.M. Weber Pattern Recognition and Bioinformatics (mentor)

Thomas Abeel Pattern Recognition and Bioinformatics (coach)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

Explainable AI Disease detection Graph Neural Network

To reference this document use:

http://resolver.tudelft.nl/uuid:95952008-b82c-4ffe-8c26-edb623473a8f

More Info

expand_more

Published Date

26-06-2024

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

As graph neural networks (GNNs) become more frequently used in the biomedical field, there is a growing need to provide insight into how their predictions are made. An algorithm that does this is GNN-SubNet, developed with the aim of detecting disease subnetworks in protein-protein interaction (PPI) networks. GNN-SubNet makes use of a sampling scheme to generate a global explanation in the form of a node mask which indicates each node's importance for all of the GNN's predictions on a dataset. The aim of this study is to validate GNN-SubNet by comparing it with an alternative approach of obtaining global explanations. Instead of obtaining the node mask via a sampling scheme, multiple (local) explanations are optimized per dataset sample, then the node masks are aggregated by either the mean (Mean Aggregation) or the median value (Median Aggregation) per node.
GNN-SubNet is compared with its two modifications firstly by analyzing which disease subnetworks each algorithm detects, and secondly by leveraging metrics devised to assess explainers for GNNs. The results show that all algorithms detect subnetworks associated with cancer. In terms of the metric scores, Mean Aggregation obtains explanations with the highest fidelity, however no algorithm obtains sparse explanations. The study also indicates that GNN-SubNet obtains variate outcomes over multiple runs, and as such the results may not be reproducible.

Files

Research_Paper_Elena_Oana_Milc... (pdf)

(pdf | 1.14 Mb)

Unknown license