Loading…
Single Channel speech separation based on empirical mode decomposition and Hilbert Transform
In this study, the authors discuss unsupervised separation of two speakers from single microphone recording using empirical mode decomposition (EMD) and Hilbert transform (HT) generally known as Hilbert–Huang transform. A two-stage separation procedure is proposed for single-channel (SC) speech sepa...
Saved in:
Published in: | IET signal processing 2017-07, Vol.11 (5), p.579-586 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | In this study, the authors discuss unsupervised separation of two speakers from single microphone recording using empirical mode decomposition (EMD) and Hilbert transform (HT) generally known as Hilbert–Huang transform. A two-stage separation procedure is proposed for single-channel (SC) speech separation. Initial stage of separation is done using EMD, HT and instantaneous frequencies. EMD decomposes the mixed signal into oscillatory functions known as intrinsic mode functions (IMFs). Suitable IMFs are selected using successive EMD decomposition and HT is applied to extract the instantaneous frequencies. The speech frames are grouped into two speakers using correlation of instantaneous frequencies between mixed signal and selected IMFs. Second-stage separation is done by further decomposing the estimated speakers into IMFs and finding the instantaneous amplitudes using HT. A ratio of instantaneous amplitudes of mixed speech and stage 1 recovered speech signal is computed for both speakers. Histogram of the ratio obtained can be used to estimate the ideal binary mask for each speaker. These masks are applied to the speech mixture and the underlying speakers are estimated. The proposed method was compared with the existing unsupervised SC source separation algorithms. The results show significant improvement in objective measures. |
---|---|
ISSN: | 1751-9675 1751-9683 1751-9683 |
DOI: | 10.1049/iet-spr.2016.0450 |