Loading…
Automatic speech recognition fusion approach to unsupervised speaker clustering and labeling
This paper describes a fully unsupervised approach to speaker clustering and labeling employing speech recognition (ASR) technology to bootstrap speaker identification (SID). An algorithm that combined these two technologies was able to correctly cluster and label 299 NATO ship-to-ship transmissions...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | This paper describes a fully unsupervised approach to speaker clustering and labeling employing speech recognition (ASR) technology to bootstrap speaker identification (SID). An algorithm that combined these two technologies was able to correctly cluster and label 299 NATO ship-to-ship transmissions with an accuracy of 89% in an on-line (no a priori training) scenario. This fusion approach out-performed ASR alone by 23.6%, and outperformed manually-trained VQ-SID by 12.7% and GMM/UMB-SID by 8.6%. This paper demonstrates that, under certain circumstances, unsupervised, self-organizing systems can be more effective than manually-trained ones |
---|---|
ISSN: | 1095-323X 2996-2358 |
DOI: | 10.1109/AERO.2006.1656042 |