Improving the Performance of Object Counting Using Training Images in the Frequency Domain

Rogmans, D.Z.

Improving the Performance of Object Counting Using Training Images in the Frequency Domain

Bachelor thesis (2021)

Authors

D.Z. Rogmans Electrical Engineering, Mathematics and Computer Science

Contributors

Yancong Lin Pattern Recognition and Bioinformatics (mentor)

Silvia Pintea Pattern Recognition and Bioinformatics (graduation committee member)

Elvin Isufi (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

Neural network Fft Object locator Image processing

To reference this document use:

http://resolver.tudelft.nl/uuid:866fe0a9-f8f6-4037-863b-890827eaaa34

More Info

expand_more

Published Date

01-07-2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Convolutional Neural Networks (CNNs) have made significant strides in the field of image processing over the last decade. Different approaches have been taken and improvements have been suggested. This paper looks at a newer novelty to neural networks for image counting, which is based on single-pixel center localization instead of the traditional bounding boxes. This neural network’s loss function is the weighted average Hausdorff distance, which does not only take into account the number of misclassified points but also the distance between predicted points and ground truth values. The paper aims to compare the accuracy of the single-pixel center neural network on original training images of wheat heads as compared to filtered images. The filtered images have had a band pass filter applied to them, that is constructed by looking at the average frequency of wheat heads. It filters out certain lower and higher frequencies up to a threshold, and its aim is to reduce background noise and accentuate the wheat heads. Results showed that there was no significant and attributable improvement in the performance of the object counter when trained on images with filtered frequency information. A discussion of the unexpected results then carries out, with the aim of rationalizing the insignificant improvement in performance of the neural network on filtered images. As part of the discussion and conclusion, a recommendation is also made, giving insights into determining if this single-pixel center neural network is appropriate for a given dataset of images.

Files

Danirogmans_researchproject_fi... (pdf)

(pdf | 1.08 Mb)

Unknown license