Sebastiaan Scholten | TU Delft Repository

Modelling Human Word Learning and Recognition Using Visually Grounded Speech

Journal article (2022) - D.G.M. Merkx (author), D.G.M. Merkx (author), D.G.M. Merkx (author), Sebastiaan Scholten (author), Stefan L. Frank (author), Mirjam Ernestus (author), O.E. Scharenborg (author), Odette Scharenborg (author)

Many computational models of speech recognition assume that the set of target words is already given. This implies that these models learn to recognise speech in a biologically unrealistic manner, i.e. with prior lexical knowledge and explicit supervision. In contrast, visually g ...

Learning to recognise words using visually grounded speech

Conference paper (2021) - Sebastiaan Scholten (author), Danny Merkx (author), O.E. Scharenborg (author), Odette Scharenborg (author)

We investigated word recognition in a Visually Grounded Speech model. The model has been trained on pairs of images and spoken captions to create visually grounded embeddings which can be used for speech to image retrieval and vice versa. We investigate whether such a model can b ...

Towards creating a non-synthetic group recommendation dataset

Conference paper (2019) - Matthijs Rijlaarsdam (author), Sebastiaan Scholten (author), Cynthia C.S. Liem (author), Cynthia Liem (author), Cynthia C. S. Liem (author), C.C.S. Liem (author)

Recommender systems can be useful in group settings, e.g. when choosing a movie to watch with a group. However, while considerable research in group recommendation has been performed, we still lack truly ecological datasets on group recommendations in real life consumption scenar ...