AmsterTime

Yildiz, B.; Khademi, S.; Siebes, R.M.; van Gemert, J.C.

doi:10.1109/ICPR56361.2022.9956049

AmsterTime

A Visual Place Recognition Benchmark Dataset for Severe Domain Shift

Conference paper (2022)

Authors

B. Yildiz Pattern Recognition and Bioinformatics -

S. Khademi History, Form & Aesthetics - Architecture and the Built Environment

R.M. Siebes Vrije Universiteit Amsterdam

J.C. van Gemert Pattern Recognition and Bioinformatics -

Research Group

Pattern Recognition and Bioinformatics () (TU Delft)

DOI: https://doi.org/10.1109/ICPR56361.2022.9956049

To reference this document use:

http://resolver.tudelft.nl/uuid:c659750b-9fe1-4a42-b9b1-e18e367bdb4a

More Info

expand_more

Published Date

2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Intelligent Systems

Research Group

Pattern Recognition and Bioinformatics

Abstract

We introduce AmsterTime: a challenging dataset to benchmark visual place recognition (VPR) in presence of a severe domain shift. AmsterTime offers a collection of 2,500 well-curated images matching the same scene from a street view matched to historical archival image data from Amsterdam city. The image pairs capture the same place with different cameras, viewpoints, and appearances. Unlike existing benchmark datasets, AmsterTime is directly crowdsourced in a GIS navigation platform (Mapillary). We evaluate various baselines, including non-learning, supervised and self-supervised methods, pre-trained on different relevant datasets, for both verification and retrieval tasks. Our result credits the best accuracy to the ResNet-101 model pre-trained on the Landmarks dataset for both verification and retrieval tasks by 84% and 24%, respectively. Additionally, a subset of Amsterdam landmarks is collected for feature evaluation in a classification task. Classification labels are further used to extract the visual explanations using Grad-CAM for inspection of the learned similar visuals in a deep metric learning models.

Files

AmsterTime_A_Visual_Place_Reco... (pdf)

(pdf | 10.6 Mb)

- Embargo expired in 01-07-2023

Unknown license