HZ
Hengshun Zhou
2 records found
1
The First Multimodal Information Based Speech Processing (Misp) Challenge
Data, Tasks, Baselines And Results
In this paper we discuss the rational of the Multi-model Information based Speech Processing (MISP) Challenge, and provide a detailed description of the data recorded, the two evaluation tasks and the corresponding baselines, followed by a summary of submitted systems and evaluat
...
Audio-Visual Wake Word Spotting in MISP2021 Challenge
Dataset Release and Deep Analysis
In this paper, we describe and release publicly the audio-visual wake word spotting (WWS) database in the MISP2021 Challenge, which covers a range of scenarios of audio and video data collected by near-, mid-, and far-field microphone arrays, and cameras, to create a shared and p
...