Loading…

Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator

A powerful pitch estimation algorithm called SWIPE has been developed for processing speech and music. SWIPE is shown to outperform existing algorithms on several publicly available speech and musical instrument databases, and a disordered speech database, reducing the gross error rate by 40%, relat...

Full description

Saved in:
Bibliographic Details
Published in:The Journal of the Acoustical Society of America 2007-11, Vol.122 (5_Supplement), p.2960-2961
Main Authors: Camacho, Arturo, Harris, John G.
Format: Article
Language:English
Citations: Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:A powerful pitch estimation algorithm called SWIPE has been developed for processing speech and music. SWIPE is shown to outperform existing algorithms on several publicly available speech and musical instrument databases, and a disordered speech database, reducing the gross error rate by 40%, relative to the best competing algorithm. In short, SWIPE estimates the pitch as the fundamental frequency of a sawtooth waveform, whose spectrum best matches the spectrum of the input signal. The short-time Fourier transform of the sawtooth waveform provides an extension to older frequency-based, sieve-type estimation algorithms by providing smooth peaks with decaying amplitudes to correlate with the fundamental frequency (if present) and its harmonics. An improvement on the algorithm is achieved by using only the first and prime harmonics, which significantly reduces subharmonic errors commonly found in other pitch estimation algorithms.
ISSN:0001-4966
1520-8524
DOI:10.1121/1.2942550