Loading…

Music Tempo Estimation via Neural Networks -- A Comparative Analysis

This paper presents a comparative analysis on two artificial neural networks (with different architectures) for the task of tempo estimation. For this purpose, it also proposes the modeling, training and evaluation of a B-RNN (Bidirectional Recurrent Neural Network) model capable of estimating tempo...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2021-07
Main Authors: Mila Soares de Oliveira de Souza, Pedro Nuno de Souza Moura, Briot, Jean-Pierre
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Mila Soares de Oliveira de Souza
Pedro Nuno de Souza Moura
Briot, Jean-Pierre
description This paper presents a comparative analysis on two artificial neural networks (with different architectures) for the task of tempo estimation. For this purpose, it also proposes the modeling, training and evaluation of a B-RNN (Bidirectional Recurrent Neural Network) model capable of estimating tempo in bpm (beats per minutes) of musical pieces, without using external auxiliary modules. An extensive database (12,550 pieces in total) was curated to conduct a quantitative and qualitative analysis over the experiment. Percussion-only tracks were also included in the dataset. The performance of the B-RNN is compared to that of state-of-the-art models. For further comparison, a state-of-the-art CNN was also retrained with the same datasets used for the B-RNN training. Evaluation results for each model and datasets are presented and discussed, as well as observations and ideas for future research. Tempo estimation was more accurate for the percussion only dataset, suggesting that the estimation can be more accurate for percussion-only tracks, although further experiments (with more of such datasets) should be made to gather stronger evidence.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2553630938</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2553630938</sourcerecordid><originalsourceid>FETCH-proquest_journals_25536309383</originalsourceid><addsrcrecordid>eNqNzUELgjAYgOERBEn5Hz7oPFhbmh3FjC518i5DFszUrX2b0b9vQT-g03t54F2QhAuxo8We8xVJEXvGGM8PPMtEQk7XgLqDRo3WQI1ej9JrM8GsJdxUcHKI8S_jHgiUQgmVGa10Ec0KykkOb9S4Icu7HFClv67J9lw31YVaZ55BoW97E1zE2H6vuWBHUYj_1AfXFznC</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2553630938</pqid></control><display><type>article</type><title>Music Tempo Estimation via Neural Networks -- A Comparative Analysis</title><source>Publicly Available Content Database</source><creator>Mila Soares de Oliveira de Souza ; Pedro Nuno de Souza Moura ; Briot, Jean-Pierre</creator><creatorcontrib>Mila Soares de Oliveira de Souza ; Pedro Nuno de Souza Moura ; Briot, Jean-Pierre</creatorcontrib><description>This paper presents a comparative analysis on two artificial neural networks (with different architectures) for the task of tempo estimation. For this purpose, it also proposes the modeling, training and evaluation of a B-RNN (Bidirectional Recurrent Neural Network) model capable of estimating tempo in bpm (beats per minutes) of musical pieces, without using external auxiliary modules. An extensive database (12,550 pieces in total) was curated to conduct a quantitative and qualitative analysis over the experiment. Percussion-only tracks were also included in the dataset. The performance of the B-RNN is compared to that of state-of-the-art models. For further comparison, a state-of-the-art CNN was also retrained with the same datasets used for the B-RNN training. Evaluation results for each model and datasets are presented and discussed, as well as observations and ideas for future research. Tempo estimation was more accurate for the percussion only dataset, suggesting that the estimation can be more accurate for percussion-only tracks, although further experiments (with more of such datasets) should be made to gather stronger evidence.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Artificial neural networks ; Comparative analysis ; Datasets ; Neural networks ; Percussion ; Qualitative analysis ; Recurrent neural networks ; Training</subject><ispartof>arXiv.org, 2021-07</ispartof><rights>2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2553630938?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>776,780,25731,36989,44566</link.rule.ids></links><search><creatorcontrib>Mila Soares de Oliveira de Souza</creatorcontrib><creatorcontrib>Pedro Nuno de Souza Moura</creatorcontrib><creatorcontrib>Briot, Jean-Pierre</creatorcontrib><title>Music Tempo Estimation via Neural Networks -- A Comparative Analysis</title><title>arXiv.org</title><description>This paper presents a comparative analysis on two artificial neural networks (with different architectures) for the task of tempo estimation. For this purpose, it also proposes the modeling, training and evaluation of a B-RNN (Bidirectional Recurrent Neural Network) model capable of estimating tempo in bpm (beats per minutes) of musical pieces, without using external auxiliary modules. An extensive database (12,550 pieces in total) was curated to conduct a quantitative and qualitative analysis over the experiment. Percussion-only tracks were also included in the dataset. The performance of the B-RNN is compared to that of state-of-the-art models. For further comparison, a state-of-the-art CNN was also retrained with the same datasets used for the B-RNN training. Evaluation results for each model and datasets are presented and discussed, as well as observations and ideas for future research. Tempo estimation was more accurate for the percussion only dataset, suggesting that the estimation can be more accurate for percussion-only tracks, although further experiments (with more of such datasets) should be made to gather stronger evidence.</description><subject>Artificial neural networks</subject><subject>Comparative analysis</subject><subject>Datasets</subject><subject>Neural networks</subject><subject>Percussion</subject><subject>Qualitative analysis</subject><subject>Recurrent neural networks</subject><subject>Training</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNzUELgjAYgOERBEn5Hz7oPFhbmh3FjC518i5DFszUrX2b0b9vQT-g03t54F2QhAuxo8We8xVJEXvGGM8PPMtEQk7XgLqDRo3WQI1ej9JrM8GsJdxUcHKI8S_jHgiUQgmVGa10Ec0KykkOb9S4Icu7HFClv67J9lw31YVaZ55BoW97E1zE2H6vuWBHUYj_1AfXFznC</recordid><startdate>20210720</startdate><enddate>20210720</enddate><creator>Mila Soares de Oliveira de Souza</creator><creator>Pedro Nuno de Souza Moura</creator><creator>Briot, Jean-Pierre</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PHGZM</scope><scope>PHGZT</scope><scope>PIMPY</scope><scope>PKEHL</scope><scope>PQEST</scope><scope>PQGLB</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20210720</creationdate><title>Music Tempo Estimation via Neural Networks -- A Comparative Analysis</title><author>Mila Soares de Oliveira de Souza ; Pedro Nuno de Souza Moura ; Briot, Jean-Pierre</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_25536309383</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Artificial neural networks</topic><topic>Comparative analysis</topic><topic>Datasets</topic><topic>Neural networks</topic><topic>Percussion</topic><topic>Qualitative analysis</topic><topic>Recurrent neural networks</topic><topic>Training</topic><toplevel>online_resources</toplevel><creatorcontrib>Mila Soares de Oliveira de Souza</creatorcontrib><creatorcontrib>Pedro Nuno de Souza Moura</creatorcontrib><creatorcontrib>Briot, Jean-Pierre</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>ProQuest Central (New)</collection><collection>ProQuest One Academic (New)</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Middle East (New)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Applied &amp; Life Sciences</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mila Soares de Oliveira de Souza</au><au>Pedro Nuno de Souza Moura</au><au>Briot, Jean-Pierre</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Music Tempo Estimation via Neural Networks -- A Comparative Analysis</atitle><jtitle>arXiv.org</jtitle><date>2021-07-20</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>This paper presents a comparative analysis on two artificial neural networks (with different architectures) for the task of tempo estimation. For this purpose, it also proposes the modeling, training and evaluation of a B-RNN (Bidirectional Recurrent Neural Network) model capable of estimating tempo in bpm (beats per minutes) of musical pieces, without using external auxiliary modules. An extensive database (12,550 pieces in total) was curated to conduct a quantitative and qualitative analysis over the experiment. Percussion-only tracks were also included in the dataset. The performance of the B-RNN is compared to that of state-of-the-art models. For further comparison, a state-of-the-art CNN was also retrained with the same datasets used for the B-RNN training. Evaluation results for each model and datasets are presented and discussed, as well as observations and ideas for future research. Tempo estimation was more accurate for the percussion only dataset, suggesting that the estimation can be more accurate for percussion-only tracks, although further experiments (with more of such datasets) should be made to gather stronger evidence.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2021-07
issn 2331-8422
language eng
recordid cdi_proquest_journals_2553630938
source Publicly Available Content Database
subjects Artificial neural networks
Comparative analysis
Datasets
Neural networks
Percussion
Qualitative analysis
Recurrent neural networks
Training
title Music Tempo Estimation via Neural Networks -- A Comparative Analysis
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-23T15%3A50%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Music%20Tempo%20Estimation%20via%20Neural%20Networks%20--%20A%20Comparative%20Analysis&rft.jtitle=arXiv.org&rft.au=Mila%20Soares%20de%20Oliveira%20de%20Souza&rft.date=2021-07-20&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2553630938%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_25536309383%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2553630938&rft_id=info:pmid/&rfr_iscdi=true