Loading…
Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator
A powerful pitch estimation algorithm called SWIPE has been developed for processing speech and music. SWIPE is shown to outperform existing algorithms on several publicly available speech and musical instrument databases, and a disordered speech database, reducing the gross error rate by 40%, relat...
Saved in:
Published in: | The Journal of the Acoustical Society of America 2007-11, Vol.122 (5_Supplement), p.2960-2961 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Citations: | Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c1057-a48d663dfed07989da8766b4e1c56ae4805aa135d726b4387e1c17c1f7e48b643 |
---|---|
cites | |
container_end_page | 2961 |
container_issue | 5_Supplement |
container_start_page | 2960 |
container_title | The Journal of the Acoustical Society of America |
container_volume | 122 |
creator | Camacho, Arturo Harris, John G. |
description | A powerful pitch estimation algorithm called SWIPE has been developed for processing speech and music. SWIPE is shown to outperform existing algorithms on several publicly available speech and musical instrument databases, and a disordered speech database, reducing the gross error rate by 40%, relative to the best competing algorithm. In short, SWIPE estimates the pitch as the fundamental frequency of a sawtooth waveform, whose spectrum best matches the spectrum of the input signal. The short-time Fourier transform of the sawtooth waveform provides an extension to older frequency-based, sieve-type estimation algorithms by providing smooth peaks with decaying amplitudes to correlate with the fundamental frequency (if present) and its harmonics. An improvement on the algorithm is achieved by using only the first and prime harmonics, which significantly reduces subharmonic errors commonly found in other pitch estimation algorithms. |
doi_str_mv | 10.1121/1.2942550 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_85663397</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>85663397</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1057-a48d663dfed07989da8766b4e1c56ae4805aa135d726b4387e1c17c1f7e48b643</originalsourceid><addsrcrecordid>eNotUMtOwzAQtBBIlMKBP_AJiUOKN37miCpeUiUuwNVybYcaJXGwEyr-HqP2sq-ZnV0NQtdAVgA13MGqbljNOTlBC-A1qRSv2SlaEEKgYo0Q5-gi56_SckWbBfpYx36cpzB84jFMdodji_PofanM4HA_52BxCQU3OJv9FOO0w3vz49uYehyGPIbk3XHZ5yn0ZorpEp21psv-6piX6P3x4W39XG1en17W95vKAuGyMkw5IahrvSOyUY0zSgqxZR4sF8YzRbgxQLmTdZlSJQsA0kIrC7YVjC7RzUF3TPF7Lud1H7L1XWcGH-esFS_ytJGFeHsg2hRzTr7VYyqvpl8NRP87p0EfnaN_1clgiw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>85663397</pqid></control><display><type>article</type><title>Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator</title><source>American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)</source><source>Linguistics and Language Behavior Abstracts (LLBA)</source><creator>Camacho, Arturo ; Harris, John G.</creator><creatorcontrib>Camacho, Arturo ; Harris, John G.</creatorcontrib><description>A powerful pitch estimation algorithm called SWIPE has been developed for processing speech and music. SWIPE is shown to outperform existing algorithms on several publicly available speech and musical instrument databases, and a disordered speech database, reducing the gross error rate by 40%, relative to the best competing algorithm. In short, SWIPE estimates the pitch as the fundamental frequency of a sawtooth waveform, whose spectrum best matches the spectrum of the input signal. The short-time Fourier transform of the sawtooth waveform provides an extension to older frequency-based, sieve-type estimation algorithms by providing smooth peaks with decaying amplitudes to correlate with the fundamental frequency (if present) and its harmonics. An improvement on the algorithm is achieved by using only the first and prime harmonics, which significantly reduces subharmonic errors commonly found in other pitch estimation algorithms.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/1.2942550</identifier><identifier>CODEN: JASMAN</identifier><language>eng</language><ispartof>The Journal of the Acoustical Society of America, 2007-11, Vol.122 (5_Supplement), p.2960-2961</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c1057-a48d663dfed07989da8766b4e1c56ae4805aa135d726b4387e1c17c1f7e48b643</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925,31270</link.rule.ids></links><search><creatorcontrib>Camacho, Arturo</creatorcontrib><creatorcontrib>Harris, John G.</creatorcontrib><title>Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator</title><title>The Journal of the Acoustical Society of America</title><description>A powerful pitch estimation algorithm called SWIPE has been developed for processing speech and music. SWIPE is shown to outperform existing algorithms on several publicly available speech and musical instrument databases, and a disordered speech database, reducing the gross error rate by 40%, relative to the best competing algorithm. In short, SWIPE estimates the pitch as the fundamental frequency of a sawtooth waveform, whose spectrum best matches the spectrum of the input signal. The short-time Fourier transform of the sawtooth waveform provides an extension to older frequency-based, sieve-type estimation algorithms by providing smooth peaks with decaying amplitudes to correlate with the fundamental frequency (if present) and its harmonics. An improvement on the algorithm is achieved by using only the first and prime harmonics, which significantly reduces subharmonic errors commonly found in other pitch estimation algorithms.</description><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2007</creationdate><recordtype>article</recordtype><sourceid>7T9</sourceid><recordid>eNotUMtOwzAQtBBIlMKBP_AJiUOKN37miCpeUiUuwNVybYcaJXGwEyr-HqP2sq-ZnV0NQtdAVgA13MGqbljNOTlBC-A1qRSv2SlaEEKgYo0Q5-gi56_SckWbBfpYx36cpzB84jFMdodji_PofanM4HA_52BxCQU3OJv9FOO0w3vz49uYehyGPIbk3XHZ5yn0ZorpEp21psv-6piX6P3x4W39XG1en17W95vKAuGyMkw5IahrvSOyUY0zSgqxZR4sF8YzRbgxQLmTdZlSJQsA0kIrC7YVjC7RzUF3TPF7Lud1H7L1XWcGH-esFS_ytJGFeHsg2hRzTr7VYyqvpl8NRP87p0EfnaN_1clgiw</recordid><startdate>20071101</startdate><enddate>20071101</enddate><creator>Camacho, Arturo</creator><creator>Harris, John G.</creator><scope>AAYXX</scope><scope>CITATION</scope><scope>7T9</scope></search><sort><creationdate>20071101</creationdate><title>Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator</title><author>Camacho, Arturo ; Harris, John G.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1057-a48d663dfed07989da8766b4e1c56ae4805aa135d726b4387e1c17c1f7e48b643</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2007</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Camacho, Arturo</creatorcontrib><creatorcontrib>Harris, John G.</creatorcontrib><collection>CrossRef</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Camacho, Arturo</au><au>Harris, John G.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><date>2007-11-01</date><risdate>2007</risdate><volume>122</volume><issue>5_Supplement</issue><spage>2960</spage><epage>2961</epage><pages>2960-2961</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><coden>JASMAN</coden><abstract>A powerful pitch estimation algorithm called SWIPE has been developed for processing speech and music. SWIPE is shown to outperform existing algorithms on several publicly available speech and musical instrument databases, and a disordered speech database, reducing the gross error rate by 40%, relative to the best competing algorithm. In short, SWIPE estimates the pitch as the fundamental frequency of a sawtooth waveform, whose spectrum best matches the spectrum of the input signal. The short-time Fourier transform of the sawtooth waveform provides an extension to older frequency-based, sieve-type estimation algorithms by providing smooth peaks with decaying amplitudes to correlate with the fundamental frequency (if present) and its harmonics. An improvement on the algorithm is achieved by using only the first and prime harmonics, which significantly reduces subharmonic errors commonly found in other pitch estimation algorithms.</abstract><doi>10.1121/1.2942550</doi><tpages>2</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0001-4966 |
ispartof | The Journal of the Acoustical Society of America, 2007-11, Vol.122 (5_Supplement), p.2960-2961 |
issn | 0001-4966 1520-8524 |
language | eng |
recordid | cdi_proquest_miscellaneous_85663397 |
source | American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list); Linguistics and Language Behavior Abstracts (LLBA) |
title | Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T19%3A43%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Computing%20pitch%20of%20speech%20and%20music%20using%20a%20sawtooth%20waveform%20inspired%20pitch%20estimator&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Camacho,%20Arturo&rft.date=2007-11-01&rft.volume=122&rft.issue=5_Supplement&rft.spage=2960&rft.epage=2961&rft.pages=2960-2961&rft.issn=0001-4966&rft.eissn=1520-8524&rft.coden=JASMAN&rft_id=info:doi/10.1121/1.2942550&rft_dat=%3Cproquest_cross%3E85663397%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c1057-a48d663dfed07989da8766b4e1c56ae4805aa135d726b4387e1c17c1f7e48b643%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=85663397&rft_id=info:pmid/&rfr_iscdi=true |