Tian Tian

Conference paper (1)

1 records found

Learning fine-grained semantics in spoken language using visual grounding

Conference paper (2021) - X. Wang (author), Tian Tian (author), Jihua Zhu (author), O.E. Scharenborg (author)

In the case of unwritten languages, acoustic models cannot be trained in the standard way, i.e., using speech and textual transcriptions. Recently, several methods have been proposed to learn speech representations using images, i.e., using visual grounding. Existing studies have ...