Loading…

Deep Neural Network Based Multi-Channel Speech Enhancement for Real-Time Voice Communication Using Smartphones

Recently, the performance of speech enhancement has been improved via deep neural networks. However, most of them are too heavy for voice communication using smartphones, and some are non-causal systems. In this paper, we introduce some effective techniques improving the performance even with light-...

Full description

Saved in:
Bibliographic Details
Main Authors: Baek, Soonho, Lee, Myungho, Moon, Han-gil
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Recently, the performance of speech enhancement has been improved via deep neural networks. However, most of them are too heavy for voice communication using smartphones, and some are non-causal systems. In this paper, we introduce some effective techniques improving the performance even with light-weight models at causal system. We extract the input features by incorporating two kinds of beamformers. Furthermore, a normalization scheme is proposed to diminish the inter-channel variance between two beamformer outputs. The experimental results show the superiority of the proposed features. Moreover, the proposed method is extendable to any number of microphone systems without additional model training.
ISSN:2158-4001
DOI:10.1109/ICCE53296.2022.9730152