Loading…

Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator

A powerful pitch estimation algorithm called SWIPE has been developed for processing speech and music. SWIPE is shown to outperform existing algorithms on several publicly available speech and musical instrument databases, and a disordered speech database, reducing the gross error rate by 40%, relat...

Full description

Saved in:
Bibliographic Details
Published in:The Journal of the Acoustical Society of America 2007-11, Vol.122 (5_Supplement), p.2960-2961
Main Authors: Camacho, Arturo, Harris, John G.
Format: Article
Language:English
Citations: Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c1057-a48d663dfed07989da8766b4e1c56ae4805aa135d726b4387e1c17c1f7e48b643
cites
container_end_page 2961
container_issue 5_Supplement
container_start_page 2960
container_title The Journal of the Acoustical Society of America
container_volume 122
creator Camacho, Arturo
Harris, John G.
description A powerful pitch estimation algorithm called SWIPE has been developed for processing speech and music. SWIPE is shown to outperform existing algorithms on several publicly available speech and musical instrument databases, and a disordered speech database, reducing the gross error rate by 40%, relative to the best competing algorithm. In short, SWIPE estimates the pitch as the fundamental frequency of a sawtooth waveform, whose spectrum best matches the spectrum of the input signal. The short-time Fourier transform of the sawtooth waveform provides an extension to older frequency-based, sieve-type estimation algorithms by providing smooth peaks with decaying amplitudes to correlate with the fundamental frequency (if present) and its harmonics. An improvement on the algorithm is achieved by using only the first and prime harmonics, which significantly reduces subharmonic errors commonly found in other pitch estimation algorithms.
doi_str_mv 10.1121/1.2942550
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_85663397</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>85663397</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1057-a48d663dfed07989da8766b4e1c56ae4805aa135d726b4387e1c17c1f7e48b643</originalsourceid><addsrcrecordid>eNotUMtOwzAQtBBIlMKBP_AJiUOKN37miCpeUiUuwNVybYcaJXGwEyr-HqP2sq-ZnV0NQtdAVgA13MGqbljNOTlBC-A1qRSv2SlaEEKgYo0Q5-gi56_SckWbBfpYx36cpzB84jFMdodji_PofanM4HA_52BxCQU3OJv9FOO0w3vz49uYehyGPIbk3XHZ5yn0ZorpEp21psv-6piX6P3x4W39XG1en17W95vKAuGyMkw5IahrvSOyUY0zSgqxZR4sF8YzRbgxQLmTdZlSJQsA0kIrC7YVjC7RzUF3TPF7Lud1H7L1XWcGH-esFS_ytJGFeHsg2hRzTr7VYyqvpl8NRP87p0EfnaN_1clgiw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>85663397</pqid></control><display><type>article</type><title>Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator</title><source>American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)</source><source>Linguistics and Language Behavior Abstracts (LLBA)</source><creator>Camacho, Arturo ; Harris, John G.</creator><creatorcontrib>Camacho, Arturo ; Harris, John G.</creatorcontrib><description>A powerful pitch estimation algorithm called SWIPE has been developed for processing speech and music. SWIPE is shown to outperform existing algorithms on several publicly available speech and musical instrument databases, and a disordered speech database, reducing the gross error rate by 40%, relative to the best competing algorithm. In short, SWIPE estimates the pitch as the fundamental frequency of a sawtooth waveform, whose spectrum best matches the spectrum of the input signal. The short-time Fourier transform of the sawtooth waveform provides an extension to older frequency-based, sieve-type estimation algorithms by providing smooth peaks with decaying amplitudes to correlate with the fundamental frequency (if present) and its harmonics. An improvement on the algorithm is achieved by using only the first and prime harmonics, which significantly reduces subharmonic errors commonly found in other pitch estimation algorithms.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/1.2942550</identifier><identifier>CODEN: JASMAN</identifier><language>eng</language><ispartof>The Journal of the Acoustical Society of America, 2007-11, Vol.122 (5_Supplement), p.2960-2961</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c1057-a48d663dfed07989da8766b4e1c56ae4805aa135d726b4387e1c17c1f7e48b643</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925,31270</link.rule.ids></links><search><creatorcontrib>Camacho, Arturo</creatorcontrib><creatorcontrib>Harris, John G.</creatorcontrib><title>Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator</title><title>The Journal of the Acoustical Society of America</title><description>A powerful pitch estimation algorithm called SWIPE has been developed for processing speech and music. SWIPE is shown to outperform existing algorithms on several publicly available speech and musical instrument databases, and a disordered speech database, reducing the gross error rate by 40%, relative to the best competing algorithm. In short, SWIPE estimates the pitch as the fundamental frequency of a sawtooth waveform, whose spectrum best matches the spectrum of the input signal. The short-time Fourier transform of the sawtooth waveform provides an extension to older frequency-based, sieve-type estimation algorithms by providing smooth peaks with decaying amplitudes to correlate with the fundamental frequency (if present) and its harmonics. An improvement on the algorithm is achieved by using only the first and prime harmonics, which significantly reduces subharmonic errors commonly found in other pitch estimation algorithms.</description><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2007</creationdate><recordtype>article</recordtype><sourceid>7T9</sourceid><recordid>eNotUMtOwzAQtBBIlMKBP_AJiUOKN37miCpeUiUuwNVybYcaJXGwEyr-HqP2sq-ZnV0NQtdAVgA13MGqbljNOTlBC-A1qRSv2SlaEEKgYo0Q5-gi56_SckWbBfpYx36cpzB84jFMdodji_PofanM4HA_52BxCQU3OJv9FOO0w3vz49uYehyGPIbk3XHZ5yn0ZorpEp21psv-6piX6P3x4W39XG1en17W95vKAuGyMkw5IahrvSOyUY0zSgqxZR4sF8YzRbgxQLmTdZlSJQsA0kIrC7YVjC7RzUF3TPF7Lud1H7L1XWcGH-esFS_ytJGFeHsg2hRzTr7VYyqvpl8NRP87p0EfnaN_1clgiw</recordid><startdate>20071101</startdate><enddate>20071101</enddate><creator>Camacho, Arturo</creator><creator>Harris, John G.</creator><scope>AAYXX</scope><scope>CITATION</scope><scope>7T9</scope></search><sort><creationdate>20071101</creationdate><title>Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator</title><author>Camacho, Arturo ; Harris, John G.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1057-a48d663dfed07989da8766b4e1c56ae4805aa135d726b4387e1c17c1f7e48b643</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2007</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Camacho, Arturo</creatorcontrib><creatorcontrib>Harris, John G.</creatorcontrib><collection>CrossRef</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Camacho, Arturo</au><au>Harris, John G.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><date>2007-11-01</date><risdate>2007</risdate><volume>122</volume><issue>5_Supplement</issue><spage>2960</spage><epage>2961</epage><pages>2960-2961</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><coden>JASMAN</coden><abstract>A powerful pitch estimation algorithm called SWIPE has been developed for processing speech and music. SWIPE is shown to outperform existing algorithms on several publicly available speech and musical instrument databases, and a disordered speech database, reducing the gross error rate by 40%, relative to the best competing algorithm. In short, SWIPE estimates the pitch as the fundamental frequency of a sawtooth waveform, whose spectrum best matches the spectrum of the input signal. The short-time Fourier transform of the sawtooth waveform provides an extension to older frequency-based, sieve-type estimation algorithms by providing smooth peaks with decaying amplitudes to correlate with the fundamental frequency (if present) and its harmonics. An improvement on the algorithm is achieved by using only the first and prime harmonics, which significantly reduces subharmonic errors commonly found in other pitch estimation algorithms.</abstract><doi>10.1121/1.2942550</doi><tpages>2</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0001-4966
ispartof The Journal of the Acoustical Society of America, 2007-11, Vol.122 (5_Supplement), p.2960-2961
issn 0001-4966
1520-8524
language eng
recordid cdi_proquest_miscellaneous_85663397
source American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list); Linguistics and Language Behavior Abstracts (LLBA)
title Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T19%3A43%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Computing%20pitch%20of%20speech%20and%20music%20using%20a%20sawtooth%20waveform%20inspired%20pitch%20estimator&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Camacho,%20Arturo&rft.date=2007-11-01&rft.volume=122&rft.issue=5_Supplement&rft.spage=2960&rft.epage=2961&rft.pages=2960-2961&rft.issn=0001-4966&rft.eissn=1520-8524&rft.coden=JASMAN&rft_id=info:doi/10.1121/1.2942550&rft_dat=%3Cproquest_cross%3E85663397%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c1057-a48d663dfed07989da8766b4e1c56ae4805aa135d726b4387e1c17c1f7e48b643%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=85663397&rft_id=info:pmid/&rfr_iscdi=true