Graph Neural Networks Training Set Analysis

Effect of Training Data Size

Bachelor thesis (2024)

Authors

A.V. Păcurar Electrical Engineering, Mathematics and Computer Science

Contributors

E. Congeduti Computer Science & Engineering-Teaching Team - (mentor)

E.A. Markatou Cyber Security - (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

Traffic forecasting GNN Training data

To reference this document use:

http://resolver.tudelft.nl/uuid:71a1f92a-f99c-42af-9127-88df7c89d350

More Info

expand_more

Published Date

26-06-2024

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

With the rapid increase in popularity of graph neural networks (GNNs) for the task of traffic forecasting, understanding the inner workings of these complex models becomes more important. This experiment aims to deepen our understanding of the importance that the training data has in regards to the ability of GNNs to accurately predict traffic. By repeatedly training the same GNN model with different training datasets spanning over various time frames and comparing standard performance metrics computed based on the predictions performed by the model, this paper concludes that while using less training data leads to a slight decrease in performance, this is heavily dependent on the quality of the dataset. If the data gathering process is short and the sensors are not properly maintained, GNNs are not able to accurately predict traffic. On the other hand, if the data gathering process goes well and there are few missing values, GNNs perform well even when trained with smaller amounts of historical data.

Files

RP_Paper_Final.pdf

(pdf | 0.533 Mb)