Danny Merkx

Master thesis (1)

1 records found

Word recognition in a model of visually grounded speech

An analysis using techniques inspired by human speech processing research

Master thesis (2020) - J.S.M. Scholten (author), Odette Scharenborg (mentor), O.E. Scharenborg (mentor), Danny Merkx (mentor), N. Tintarev (graduation committee member), Nava Tintarev (graduation committee member), Catherine Oertel (graduation committee member), Catharine Oertel (graduation committee member)

A Visually Grounded Speech model is a neural model which is trained to embed image caption pairs closely together in a common embedding space. As a result, such a model can retrieve semantically related images given a speech caption and vice versa. The purpose of this research is ...