An Evaluation of Intrusive Instrumental Intelligibility Metrics

van Kuyk, Steven; Kleijn, W.B.; Hendriks, R.C.

doi:10.1109/TASLP.2018.2856374

An Evaluation of Intrusive Instrumental Intelligibility Metrics

Journal article (2018)

Authors

Steven van Kuyk Victoria University of Wellington

W.B. Kleijn Victoria University of Wellington, Signal Processing Systems -

R.C. Hendriks Signal Processing Systems -

Research Group

Signal Processing Systems () (TU Delft)

DOI: https://doi.org/10.1109/TASLP.2018.2856374

Speech enhancement Instrumental measures Intelligibility prediction

To reference this document use:

http://resolver.tudelft.nl/uuid:2b452430-054c-43af-9544-bcf0b042996c

More Info

expand_more

Published Date

2018

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Microelectronics

Research Group

Signal Processing Systems

Abstract

Instrumental intelligibility metrics are commonly used as an alternative to listening tests. This paper evaluates 12 monaural intrusive intelligibility metrics: SII, HEGP, CSII, HASPI, NCM, QSTI, STOI, ESTOI, MIKNN, SIMI, SIIB, and sEPSM^corr. In addition, this paper investigates the ability of intelligibility metrics to generalize to new types of distortions and analyzes why the top performing metrics have high performance. The intelligibility data were obtained from 11 listening tests described in the literature. The stimuli included Dutch, Danish, and English speech that was distorted by additive noise, reverberation, competing talkers, preprocessing enhancement, and postprocessing enhancement. SIIB and HASPI had the highest performance achieving a correlation with listening test scores on average of ρ =0.92 and ρ =0.89, respectively. The high performance of SIIB may, in part, be the result of SIIBs developers having access to all the intelligibility data considered in the evaluation. The results show that intelligibility metrics tend to perform poorly on datasets that were not used during their development. By modifying the original implementations of SIIB and STOI, the advantage of reducing statistical dependencies between input features is demonstrated. Additionally, this paper presents a new version of SIIB called SIIB^Gauss, which has similar performance to SIIB and HASPI, but takes less time to compute by two orders of magnitude.

Files

An_Evaluation_of_Intrusive_Ins... (pdf)

(pdf | 2.35 Mb)

Unknown license