Loading…

Development of context dependent sequential K-nearest neighbor classifier for usable speech classification

The accuracy of speech processing applications degrades when operating in co-channel environment. Co-channel speech occurs when more than one person is talking at the same time. The idea of usable speech segmentation is to identify and extract those portions of co-channel speech that are minimally d...

Full description

Saved in:
Bibliographic Details
Published in:The Journal of the Acoustical Society of America 2004-05, Vol.115 (5_Supplement), p.2427-2427
Main Authors: Shah, J. K., Iyer, A. N., Smolenski, B. Y., Yantorno, R. E.
Format: Article
Language:English
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 2427
container_issue 5_Supplement
container_start_page 2427
container_title The Journal of the Acoustical Society of America
container_volume 115
creator Shah, J. K.
Iyer, A. N.
Smolenski, B. Y.
Yantorno, R. E.
description The accuracy of speech processing applications degrades when operating in co-channel environment. Co-channel speech occurs when more than one person is talking at the same time. The idea of usable speech segmentation is to identify and extract those portions of co-channel speech that are minimally degraded but still useful for speech processing applications (such as speaker identification or speech recognition) which do not work in co-channel environments. Usable speech measures are features that are extracted from the co-channel signal to distinguish between usable and unusable speech. Several usable speech extraction methods have recently been developed based on a single feature of the speech signal being considered. In this paper, however, a new usable speech extraction technique, which sequentially and contextually selects several features of the given signal using the K-nearest neighbor classifier, is being investigated. This new approach considers periodicity and structure based features simultaneously in order to achieve the maximum classification rate, and by observing all the incoming frames, avoids the problem of deciding the amount of data needed to make accurate decisions. A 100% accuracy can be achieved in speech processing applications by using this extracted usable speech segment.
doi_str_mv 10.1121/1.1669313
format article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1121_1_1669313</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1121_1_1669313</sourcerecordid><originalsourceid>FETCH-crossref_primary_10_1121_1_16693133</originalsourceid><addsrcrecordid>eNqVj72OwjAQhC0EEuGn4A3cUoTzJiQiNdwJifb6yJg1GBk7eA2CtycIRI-mGH0z0wxjExAzgAx-YAZlWeWQd1gCRSbSRZHNuywRQkA6r8qyzwZExxaLRV4l7LjCK1rfnNBF7jVX3kW8Rb7DBt3uGRKeL60bafkmdSgDUuQOzf6w9YErK4mMNhi4bvFCcmuRU4OoDp9SyWi8G7GelpZw_PYhm_79_i_XqQqeKKCum2BOMtxrEPXzTN3qdSb_ZvsAscVRIg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Development of context dependent sequential K-nearest neighbor classifier for usable speech classification</title><source>American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)</source><creator>Shah, J. K. ; Iyer, A. N. ; Smolenski, B. Y. ; Yantorno, R. E.</creator><creatorcontrib>Shah, J. K. ; Iyer, A. N. ; Smolenski, B. Y. ; Yantorno, R. E.</creatorcontrib><description>The accuracy of speech processing applications degrades when operating in co-channel environment. Co-channel speech occurs when more than one person is talking at the same time. The idea of usable speech segmentation is to identify and extract those portions of co-channel speech that are minimally degraded but still useful for speech processing applications (such as speaker identification or speech recognition) which do not work in co-channel environments. Usable speech measures are features that are extracted from the co-channel signal to distinguish between usable and unusable speech. Several usable speech extraction methods have recently been developed based on a single feature of the speech signal being considered. In this paper, however, a new usable speech extraction technique, which sequentially and contextually selects several features of the given signal using the K-nearest neighbor classifier, is being investigated. This new approach considers periodicity and structure based features simultaneously in order to achieve the maximum classification rate, and by observing all the incoming frames, avoids the problem of deciding the amount of data needed to make accurate decisions. A 100% accuracy can be achieved in speech processing applications by using this extracted usable speech segment.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/1.1669313</identifier><language>eng</language><ispartof>The Journal of the Acoustical Society of America, 2004-05, Vol.115 (5_Supplement), p.2427-2427</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Shah, J. K.</creatorcontrib><creatorcontrib>Iyer, A. N.</creatorcontrib><creatorcontrib>Smolenski, B. Y.</creatorcontrib><creatorcontrib>Yantorno, R. E.</creatorcontrib><title>Development of context dependent sequential K-nearest neighbor classifier for usable speech classification</title><title>The Journal of the Acoustical Society of America</title><description>The accuracy of speech processing applications degrades when operating in co-channel environment. Co-channel speech occurs when more than one person is talking at the same time. The idea of usable speech segmentation is to identify and extract those portions of co-channel speech that are minimally degraded but still useful for speech processing applications (such as speaker identification or speech recognition) which do not work in co-channel environments. Usable speech measures are features that are extracted from the co-channel signal to distinguish between usable and unusable speech. Several usable speech extraction methods have recently been developed based on a single feature of the speech signal being considered. In this paper, however, a new usable speech extraction technique, which sequentially and contextually selects several features of the given signal using the K-nearest neighbor classifier, is being investigated. This new approach considers periodicity and structure based features simultaneously in order to achieve the maximum classification rate, and by observing all the incoming frames, avoids the problem of deciding the amount of data needed to make accurate decisions. A 100% accuracy can be achieved in speech processing applications by using this extracted usable speech segment.</description><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><recordid>eNqVj72OwjAQhC0EEuGn4A3cUoTzJiQiNdwJifb6yJg1GBk7eA2CtycIRI-mGH0z0wxjExAzgAx-YAZlWeWQd1gCRSbSRZHNuywRQkA6r8qyzwZExxaLRV4l7LjCK1rfnNBF7jVX3kW8Rb7DBt3uGRKeL60bafkmdSgDUuQOzf6w9YErK4mMNhi4bvFCcmuRU4OoDp9SyWi8G7GelpZw_PYhm_79_i_XqQqeKKCum2BOMtxrEPXzTN3qdSb_ZvsAscVRIg</recordid><startdate>20040501</startdate><enddate>20040501</enddate><creator>Shah, J. K.</creator><creator>Iyer, A. N.</creator><creator>Smolenski, B. Y.</creator><creator>Yantorno, R. E.</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20040501</creationdate><title>Development of context dependent sequential K-nearest neighbor classifier for usable speech classification</title><author>Shah, J. K. ; Iyer, A. N. ; Smolenski, B. Y. ; Yantorno, R. E.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-crossref_primary_10_1121_1_16693133</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shah, J. K.</creatorcontrib><creatorcontrib>Iyer, A. N.</creatorcontrib><creatorcontrib>Smolenski, B. Y.</creatorcontrib><creatorcontrib>Yantorno, R. E.</creatorcontrib><collection>CrossRef</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shah, J. K.</au><au>Iyer, A. N.</au><au>Smolenski, B. Y.</au><au>Yantorno, R. E.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Development of context dependent sequential K-nearest neighbor classifier for usable speech classification</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><date>2004-05-01</date><risdate>2004</risdate><volume>115</volume><issue>5_Supplement</issue><spage>2427</spage><epage>2427</epage><pages>2427-2427</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><abstract>The accuracy of speech processing applications degrades when operating in co-channel environment. Co-channel speech occurs when more than one person is talking at the same time. The idea of usable speech segmentation is to identify and extract those portions of co-channel speech that are minimally degraded but still useful for speech processing applications (such as speaker identification or speech recognition) which do not work in co-channel environments. Usable speech measures are features that are extracted from the co-channel signal to distinguish between usable and unusable speech. Several usable speech extraction methods have recently been developed based on a single feature of the speech signal being considered. In this paper, however, a new usable speech extraction technique, which sequentially and contextually selects several features of the given signal using the K-nearest neighbor classifier, is being investigated. This new approach considers periodicity and structure based features simultaneously in order to achieve the maximum classification rate, and by observing all the incoming frames, avoids the problem of deciding the amount of data needed to make accurate decisions. A 100% accuracy can be achieved in speech processing applications by using this extracted usable speech segment.</abstract><doi>10.1121/1.1669313</doi></addata></record>
fulltext fulltext
identifier ISSN: 0001-4966
ispartof The Journal of the Acoustical Society of America, 2004-05, Vol.115 (5_Supplement), p.2427-2427
issn 0001-4966
1520-8524
language eng
recordid cdi_crossref_primary_10_1121_1_1669313
source American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)
title Development of context dependent sequential K-nearest neighbor classifier for usable speech classification
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T22%3A21%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Development%20of%20context%20dependent%20sequential%20K-nearest%20neighbor%20classifier%20for%20usable%20speech%20classification&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Shah,%20J.%20K.&rft.date=2004-05-01&rft.volume=115&rft.issue=5_Supplement&rft.spage=2427&rft.epage=2427&rft.pages=2427-2427&rft.issn=0001-4966&rft.eissn=1520-8524&rft_id=info:doi/10.1121/1.1669313&rft_dat=%3Ccrossref%3E10_1121_1_1669313%3C/crossref%3E%3Cgrp_id%3Ecdi_FETCH-crossref_primary_10_1121_1_16693133%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true