Neural network partitioning for resource-limited environments

Geel, P.

Neural network partitioning for resource-limited environments

Master thesis (2023)

Authors

P. Geel Electrical Engineering, Mathematics and Computer Science

Contributors

Z Al-Ars Computer Engineering (mentor)

N.P. van der Meijs Signal Processing Systems (coach)

J. Petri-König ExternalOrganization (graduation committee member)

Kevin McElligott (coach)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

Machine Learning Deep learning FPGA FINN Heterogeneous acceleration CPU Edge AI

To reference this document use:

http://resolver.tudelft.nl/uuid:e39ca6fc-cdd0-40c1-97ed-cca7ff7232c8

More Info

expand_more

Published Date

02-05-2023

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

The demand for implementing neural networks on edge devices has rapidly increased as they allow designers to move away from expensive server-grade hardware. However, due to the limited resources available on edge devices, it is challenging to implement complex neural networks. This study selected the Kria SoM KV260 hardware platform due to its affordability and sufficient hardware capabilities for creating a resource-constrained environment. By leveraging the hardware acceleration capabilities of the FPGA for specific nodes of the MobileNetv1 model and offloading other nodes to the onboard quad-core ARM cortex-A53 CPU, it was feasible to implement a neural network on a hybrid combination of CPU and FPGA. Results showed that when executing the MobileNetv1 model in a hybrid configuration, a total runtime improvement of 2.8x over a pure CPU implementation can be achieved. The study concludes that node-wise partitioning of the MobileNetv1 model is a practical solution. This approach offers a cost-effective solution for users who seek an accessible way to run neural networks without the need for expensive server-grade hardware.

Files

Patrick_Geel_Master_Thesis_fin... (pdf)

(pdf | 4.49 Mb)

Unknown license