Evaluation of Video Summarization using DSNet and Action Localization Datasets

Groenewegen, D.H.E.

Evaluation of Video Summarization using DSNet and Action Localization Datasets

Bachelor thesis (2021)

Authors

D.H.E. Groenewegen Electrical Engineering, Mathematics and Computer Science

Contributors

O. Strafforello Electrical Engineering, Mathematics and Computer Science (mentor)

Seyran Khademi History, Form & Aesthetics (graduation committee member)

Thomas Höllt Computer Graphics and Visualisation (coach)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

Deep learning Action localization dataset Video summarization DSNet Supervised learning

To reference this document use:

http://resolver.tudelft.nl/uuid:f463d54d-de06-4ae3-8106-59d2d4e9353d

More Info

expand_more

Published Date

01-07-2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

In this paper, the DSNet framework used for automatic video summarization gets reviewed when using action localization datasets. The problem facing video summarizations using deep learning techniques is that datasets can be subjective depending on preferences of human annotators, making for noise in the labeling. This paper will look at a anchor-based approach and anchor-free approach which were introduced by the DSNet framework. More specific it will evaluate in experiments using different hyper-parameters if these approaches gain an increased performances when using action localization datasets instead. These results will show the increase in accuracy when using action localization datasets. Moreover it will compare the different approaches, meaning anchor-based and anchor-free, and see if they still have comparable performance with the method.

Files

Final_paper.pdf

(pdf | 0.375 Mb)

Unknown license