Loading…

An incremental technique for real-time bioacoustic signal segmentation

•An incremental transformation of ZCR and energy without using temporal windows.•With our method is possible to save memory and transmission costs.•Solution to process large amounts of data by resource-constrained devices as WSN. A bioacoustical animal recognition system is composed of two parts: (1...

Full description

Saved in:
Bibliographic Details
Published in:Expert systems with applications 2015-11, Vol.42 (21), p.7367-7374
Main Authors: Colonna, Juan Gabriel, Cristo, Marco, Salvatierra, Mario, Nakamura, Eduardo Freire
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•An incremental transformation of ZCR and energy without using temporal windows.•With our method is possible to save memory and transmission costs.•Solution to process large amounts of data by resource-constrained devices as WSN. A bioacoustical animal recognition system is composed of two parts: (1) the segmenter, responsible for detecting syllables (animal vocalization) in the audio; and (2) the classifier, which determines the species/animal whose the syllables belong to. In this work, we first present a novel technique for automatic segmentation of anuran calls in real time; then, we present a method to assess the performance of the whole system. The proposed segmentation method performs an unsupervised binary classification of time series (audio) that incrementally computes two exponentially-weighted features (Energy and Zero Crossing Rate). In our proposal, classical sliding temporal windows are replaced with counters that give higher weights to new data, allowing us to distinguish between a syllable and ambient noise (considered as silences). Compared to sliding-window approaches, the associated memory cost of our proposal is lower, and processing speed is higher. Our evaluation of the segmentation component considers three metrics: (1) the Matthews Correlation Coefficient for point-to-point comparison; (2) the WinPR to quantify the precision of boundaries; and (3) the AEER for event-to-event counting. The experiments were carried out in a dataset with 896 syllables of seven different species of anurans. To evaluate the whole system, we derived four equations that helps understand the impact that the precision and recall of the segmentation component has on the classification task. Finally, our experiments show a segmentation/recognition improvement of 37%, while reducing memory and data communication. Therefore, results suggest that our proposal is suitable for resource-constrained systems, such as Wireless Sensor Networks (WSNs).
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2015.05.030