Loading…

Music structure analysis using self-similarity matrix and two-stage categorization

Music tends to have a distinct structure consisting of repetition and variation of components such as verse and chorus. Understanding such a music structure and its pattern has become increasingly important for music information retrieval (MIR). Thus far, many different methods for music segmentatio...

Full description

Saved in:
Bibliographic Details
Published in:Multimedia tools and applications 2015-01, Vol.74 (1), p.287-302
Main Authors: Jun, Sanghoon, Rho, Seungmin, Hwang, Eenjun
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233
cites cdi_FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233
container_end_page 302
container_issue 1
container_start_page 287
container_title Multimedia tools and applications
container_volume 74
creator Jun, Sanghoon
Rho, Seungmin
Hwang, Eenjun
description Music tends to have a distinct structure consisting of repetition and variation of components such as verse and chorus. Understanding such a music structure and its pattern has become increasingly important for music information retrieval (MIR). Thus far, many different methods for music segmentation and structure analysis have been proposed; however, each method has its advantages and disadvantages. By considering the significant variations in timbre, articulation and tempo of music, this is still a challenging task. In this paper, we propose a novel method for music segmentation and its structure analysis. For this, we first extract the timbre feature from the acoustic music signal and construct a self-similarity matrix that shows the similarities among the features within the music clip. Further, we determine the candidate boundaries for music segmentation by tracking the standard deviation in the matrix. Furthermore, we perform two-stage categorization: (i) categorization of the segments in a music clip on the basis of the timbre feature and (ii) categorization of segments in the same category on the basis of the successive chromagram features. In this way, each music clip is represented by a sequence of states where each state represents a certain category defined by two-stage categorization. We show the performance of our proposed method through experiments.
doi_str_mv 10.1007/s11042-013-1761-9
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1669857825</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1669857825</sourcerecordid><originalsourceid>FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233</originalsourceid><addsrcrecordid>eNp10MtKxDAUBuAgCo6jD-Cu4MZNNNemWcrgDUYE0XXItGnJ0MuYk6Lj05tSFyK4Sgjff8j5ETqn5IoSoq6BUiIYJpRjqnKK9QFaUKk4VorRw3TnBcFKEnqMTgC2hNBcMrFAL08j-DKDGMYyjsFltrftHjxk6b1vMnBtjcF3vrXBx33W2Rj8Z1JVFj8GDNE2LittdM0Q_JeNfuhP0VFtW3BnP-cSvd3dvq4e8Pr5_nF1s8aloDriiinLq3qjRV1UOq85sUVeKiFzTaSlghVKpT86YWsuCmGllcS5TaErqQrBOF-iy3nuLgzvo4NoOg-la1vbu2EEQ_NcF8kymejFH7odxpA2nZRIjGmlk6KzKsMAEFxtdsF3NuwNJWZq2cwtm9SymVo2U4bNGUi2b1z4Nfnf0Dedw39Z</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1646982979</pqid></control><display><type>article</type><title>Music structure analysis using self-similarity matrix and two-stage categorization</title><source>ABI/INFORM Global</source><source>Springer Nature</source><creator>Jun, Sanghoon ; Rho, Seungmin ; Hwang, Eenjun</creator><creatorcontrib>Jun, Sanghoon ; Rho, Seungmin ; Hwang, Eenjun</creatorcontrib><description>Music tends to have a distinct structure consisting of repetition and variation of components such as verse and chorus. Understanding such a music structure and its pattern has become increasingly important for music information retrieval (MIR). Thus far, many different methods for music segmentation and structure analysis have been proposed; however, each method has its advantages and disadvantages. By considering the significant variations in timbre, articulation and tempo of music, this is still a challenging task. In this paper, we propose a novel method for music segmentation and its structure analysis. For this, we first extract the timbre feature from the acoustic music signal and construct a self-similarity matrix that shows the similarities among the features within the music clip. Further, we determine the candidate boundaries for music segmentation by tracking the standard deviation in the matrix. Furthermore, we perform two-stage categorization: (i) categorization of the segments in a music clip on the basis of the timbre feature and (ii) categorization of segments in the same category on the basis of the successive chromagram features. In this way, each music clip is represented by a sequence of states where each state represents a certain category defined by two-stage categorization. We show the performance of our proposed method through experiments.</description><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-013-1761-9</identifier><language>eng</language><publisher>Boston: Springer US</publisher><subject>Acoustic music ; Analysis ; Boundaries ; Categories ; Clips ; Clustering ; Computer Communication Networks ; Computer engineering ; Computer Science ; Data Structures and Information Theory ; Digital music ; Feature extraction ; Information retrieval ; Methods ; Multimedia Information Systems ; Music ; Popular music ; Segmentation ; Segments ; Self-similarity ; Signal processing ; Special Purpose and Application-Based Systems ; Standard deviation ; Studies ; Tracking</subject><ispartof>Multimedia tools and applications, 2015-01, Vol.74 (1), p.287-302</ispartof><rights>Springer Science+Business Media New York 2013</rights><rights>Springer Science+Business Media New York 2015</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233</citedby><cites>FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/1646982979/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/1646982979?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,776,780,11668,27903,27904,36039,36040,44342,74642</link.rule.ids></links><search><creatorcontrib>Jun, Sanghoon</creatorcontrib><creatorcontrib>Rho, Seungmin</creatorcontrib><creatorcontrib>Hwang, Eenjun</creatorcontrib><title>Music structure analysis using self-similarity matrix and two-stage categorization</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>Music tends to have a distinct structure consisting of repetition and variation of components such as verse and chorus. Understanding such a music structure and its pattern has become increasingly important for music information retrieval (MIR). Thus far, many different methods for music segmentation and structure analysis have been proposed; however, each method has its advantages and disadvantages. By considering the significant variations in timbre, articulation and tempo of music, this is still a challenging task. In this paper, we propose a novel method for music segmentation and its structure analysis. For this, we first extract the timbre feature from the acoustic music signal and construct a self-similarity matrix that shows the similarities among the features within the music clip. Further, we determine the candidate boundaries for music segmentation by tracking the standard deviation in the matrix. Furthermore, we perform two-stage categorization: (i) categorization of the segments in a music clip on the basis of the timbre feature and (ii) categorization of segments in the same category on the basis of the successive chromagram features. In this way, each music clip is represented by a sequence of states where each state represents a certain category defined by two-stage categorization. We show the performance of our proposed method through experiments.</description><subject>Acoustic music</subject><subject>Analysis</subject><subject>Boundaries</subject><subject>Categories</subject><subject>Clips</subject><subject>Clustering</subject><subject>Computer Communication Networks</subject><subject>Computer engineering</subject><subject>Computer Science</subject><subject>Data Structures and Information Theory</subject><subject>Digital music</subject><subject>Feature extraction</subject><subject>Information retrieval</subject><subject>Methods</subject><subject>Multimedia Information Systems</subject><subject>Music</subject><subject>Popular music</subject><subject>Segmentation</subject><subject>Segments</subject><subject>Self-similarity</subject><subject>Signal processing</subject><subject>Special Purpose and Application-Based Systems</subject><subject>Standard deviation</subject><subject>Studies</subject><subject>Tracking</subject><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>M0C</sourceid><recordid>eNp10MtKxDAUBuAgCo6jD-Cu4MZNNNemWcrgDUYE0XXItGnJ0MuYk6Lj05tSFyK4Sgjff8j5ETqn5IoSoq6BUiIYJpRjqnKK9QFaUKk4VorRw3TnBcFKEnqMTgC2hNBcMrFAL08j-DKDGMYyjsFltrftHjxk6b1vMnBtjcF3vrXBx33W2Rj8Z1JVFj8GDNE2LittdM0Q_JeNfuhP0VFtW3BnP-cSvd3dvq4e8Pr5_nF1s8aloDriiinLq3qjRV1UOq85sUVeKiFzTaSlghVKpT86YWsuCmGllcS5TaErqQrBOF-iy3nuLgzvo4NoOg-la1vbu2EEQ_NcF8kymejFH7odxpA2nZRIjGmlk6KzKsMAEFxtdsF3NuwNJWZq2cwtm9SymVo2U4bNGUi2b1z4Nfnf0Dedw39Z</recordid><startdate>20150101</startdate><enddate>20150101</enddate><creator>Jun, Sanghoon</creator><creator>Rho, Seungmin</creator><creator>Hwang, Eenjun</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20150101</creationdate><title>Music structure analysis using self-similarity matrix and two-stage categorization</title><author>Jun, Sanghoon ; Rho, Seungmin ; Hwang, Eenjun</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Acoustic music</topic><topic>Analysis</topic><topic>Boundaries</topic><topic>Categories</topic><topic>Clips</topic><topic>Clustering</topic><topic>Computer Communication Networks</topic><topic>Computer engineering</topic><topic>Computer Science</topic><topic>Data Structures and Information Theory</topic><topic>Digital music</topic><topic>Feature extraction</topic><topic>Information retrieval</topic><topic>Methods</topic><topic>Multimedia Information Systems</topic><topic>Music</topic><topic>Popular music</topic><topic>Segmentation</topic><topic>Segments</topic><topic>Self-similarity</topic><topic>Signal processing</topic><topic>Special Purpose and Application-Based Systems</topic><topic>Standard deviation</topic><topic>Studies</topic><topic>Tracking</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jun, Sanghoon</creatorcontrib><creatorcontrib>Rho, Seungmin</creatorcontrib><creatorcontrib>Hwang, Eenjun</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jun, Sanghoon</au><au>Rho, Seungmin</au><au>Hwang, Eenjun</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Music structure analysis using self-similarity matrix and two-stage categorization</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2015-01-01</date><risdate>2015</risdate><volume>74</volume><issue>1</issue><spage>287</spage><epage>302</epage><pages>287-302</pages><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>Music tends to have a distinct structure consisting of repetition and variation of components such as verse and chorus. Understanding such a music structure and its pattern has become increasingly important for music information retrieval (MIR). Thus far, many different methods for music segmentation and structure analysis have been proposed; however, each method has its advantages and disadvantages. By considering the significant variations in timbre, articulation and tempo of music, this is still a challenging task. In this paper, we propose a novel method for music segmentation and its structure analysis. For this, we first extract the timbre feature from the acoustic music signal and construct a self-similarity matrix that shows the similarities among the features within the music clip. Further, we determine the candidate boundaries for music segmentation by tracking the standard deviation in the matrix. Furthermore, we perform two-stage categorization: (i) categorization of the segments in a music clip on the basis of the timbre feature and (ii) categorization of segments in the same category on the basis of the successive chromagram features. In this way, each music clip is represented by a sequence of states where each state represents a certain category defined by two-stage categorization. We show the performance of our proposed method through experiments.</abstract><cop>Boston</cop><pub>Springer US</pub><doi>10.1007/s11042-013-1761-9</doi><tpages>16</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1380-7501
ispartof Multimedia tools and applications, 2015-01, Vol.74 (1), p.287-302
issn 1380-7501
1573-7721
language eng
recordid cdi_proquest_miscellaneous_1669857825
source ABI/INFORM Global; Springer Nature
subjects Acoustic music
Analysis
Boundaries
Categories
Clips
Clustering
Computer Communication Networks
Computer engineering
Computer Science
Data Structures and Information Theory
Digital music
Feature extraction
Information retrieval
Methods
Multimedia Information Systems
Music
Popular music
Segmentation
Segments
Self-similarity
Signal processing
Special Purpose and Application-Based Systems
Standard deviation
Studies
Tracking
title Music structure analysis using self-similarity matrix and two-stage categorization
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T17%3A18%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Music%20structure%20analysis%20using%20self-similarity%20matrix%20and%20two-stage%20categorization&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Jun,%20Sanghoon&rft.date=2015-01-01&rft.volume=74&rft.issue=1&rft.spage=287&rft.epage=302&rft.pages=287-302&rft.issn=1380-7501&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-013-1761-9&rft_dat=%3Cproquest_cross%3E1669857825%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1646982979&rft_id=info:pmid/&rfr_iscdi=true