Loading…

Predicting the intrusiveness of noise through sparse coding with auditory kernels

•We propose a perceptual model using sparse sound representations with auditory kernels.•We show that the number of kernels models perceptual properties of background noise.•The achieved average correlation to subjective noise intrusiveness scores exceeds 95%. This paper presents a novel approach to...

Full description

Saved in:
Bibliographic Details
Published in:Speech communication 2016-02, Vol.76, p.186-200
Main Authors: Ullmann, Raphael, Bourlard, Hervé
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•We propose a perceptual model using sparse sound representations with auditory kernels.•We show that the number of kernels models perceptual properties of background noise.•The achieved average correlation to subjective noise intrusiveness scores exceeds 95%. This paper presents a novel approach to predicting the intrusiveness of background noises in speech signals as it is perceived by human listeners. This problem is of particular interest in telephony, where the recently widened range of transmitted audio frequencies has increased the importance of appropriate background noise reduction strategies. Current approaches predict the average noise intrusiveness score that would be obtained in a subjective listening test by combining different signal features related to physical properties (e.g., signal energy, spectral distribution) or psychoacoustic estimations (e.g., loudness) of noise. The combination and/or implementation of such features requires expert knowledge or the availability of training data. We present a novel approach that is based on a model of efficient sound coding, using a sparse spike coding representation of noise. We show that the sparsity of these representations implicitly models several factors in the perception of noise, and yields predictions of noise intrusiveness scores that compare to or outperform traditional features, without the use of training data. Our evaluation datasets and used performance metrics are based on standardized methods for the evaluation of quality prediction models.
ISSN:0167-6393
1872-7182
DOI:10.1016/j.specom.2015.07.005