Exploiting visual-based intent classification for diverse social image retrieval

Wang, Bo; Larson, M.

Exploiting visual-based intent classification for diverse social image retrieval

Conference paper (2017)

Authors

Bo Wang External organisation

M. Larson , Radboud Universiteit Nijmegen

To reference this document use:

http://resolver.tudelft.nl/uuid:c15a255b-65b6-4328-8df7-9f579c48b14d

More Info

expand_more

Published Date

2017

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In the 2017 MediaEval Retrieving Diverse Social Images task, we (TUD-MMC team) propose a novel method, namely an intent-based approach, for social image search result diversification. The underlying assumption is that the visual appearance of social images is impacted by the underlying photographic act, i.e., why the images were taken. Better understanding the rationale behind the photographic act could potentially benefit social image search result diversification. To investigate this idea, we employ a manual content analysis approach to create a taxonomy of intent classes. Our experiments show that a CNN-based neural network classifier is able to capture the visual difference between the classes in the intent taxonomy. We cluster images of the Flickr baseline based on predicted intent class and generate a re-ranked list by alternating images from different clusters. Our results reveal that, compared to conventional diversification strategies, intent-based search result diversification is able to bring a considerable improvement in terms of cluster recall with several extra benefits.

Files

35664288.pdf

(pdf | 0.569 Mb)

Unknown license