Loading…

Deepspace: Dynamic Spatial and Source CUE Based Source Separation for Dialog Enhancement

Dialog Enhancement (DE) is a feature which allows a user to increase the level of dialog in TV or movie content relative to nondialog sounds. When only the original mix is available, DE is "unguided," and requires source separation. In this paper, we describe the DeepSpace system, which pe...

Full description

Saved in:
Bibliographic Details
Main Authors: Master, Aaron, Lu, Lie, Samuelsson, Jonas, Lehtonen, Heidi-Maria, Norcross, Scott, Swedlow, Nathan, Howard, Audrey
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Dialog Enhancement (DE) is a feature which allows a user to increase the level of dialog in TV or movie content relative to nondialog sounds. When only the original mix is available, DE is "unguided," and requires source separation. In this paper, we describe the DeepSpace system, which performs source separation using both dynamic spatial cues and source cues to support unguided DE. Its technologies include spatio-level filtering (SLF) and deep-learning based dialog classification and denoising. Using subjective listening tests, we show that DeepSpace demonstrates significantly improved overall performance relative to state-of-the-art systems available for testing. We explore the feasibility of using existing automated metrics to evaluate unguided DE systems.
ISSN:2379-190X
DOI:10.1109/ICASSP49357.2023.10095497