Spectral Modularity: Conceptual Challenges, Algorithmic Enhancements and Systematic Benchmarking

Master thesis (2024)

Authors

V.R. Bockstael Electrical Engineering, Mathematics and Computer Science

Contributors

N. Yorke-Smith Algorithmics - (mentor)

A.I. Băbeanu Algorithmics - (mentor)

H. Wang (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science

To reference this document use:

http://resolver.tudelft.nl/uuid:d2dffafd-8218-42ca-848e-51f9f02c8ad8

More Info

expand_more

Published Date

28-08-2024

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Cluster analysis in high dimensional data is a difficult but desirable task. Many existing methods fail to cluster high dimensional data due to what is known as the curse of dimensionality. Therefore, sophisticated clustering methods are in wide development. Along these lines, spectral modularity maximization emerged from the theory of random matrices and graph modularity. The method is based on a filtering of the spectral decomposition of similarity matrices. Despite the recent success of this method, we uncover a fundamental challenge of spectral modularity: the spectral modularity breaks down as the number of groups in a data set grows. To mitigate this challenge, we propose two solutions: one solution based on a regularization and one solution based on a normalization. We perform a thorough empirical analysis of the clustering performance of the solutions and find that, not only do our methods resolve the breakdown of spectral modularity, but they also outperform existing clustering methods in
a variety of settings.

Files

Cs_thesis_final_vrbockstael.pd... (pdf)

(pdf | 10.5 Mb)

- Embargo expired in 28-08-2024

Unknown license