Loading…
Music structure analysis using self-similarity matrix and two-stage categorization
Music tends to have a distinct structure consisting of repetition and variation of components such as verse and chorus. Understanding such a music structure and its pattern has become increasingly important for music information retrieval (MIR). Thus far, many different methods for music segmentatio...
Saved in:
Published in: | Multimedia tools and applications 2015-01, Vol.74 (1), p.287-302 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233 |
---|---|
cites | cdi_FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233 |
container_end_page | 302 |
container_issue | 1 |
container_start_page | 287 |
container_title | Multimedia tools and applications |
container_volume | 74 |
creator | Jun, Sanghoon Rho, Seungmin Hwang, Eenjun |
description | Music tends to have a distinct structure consisting of repetition and variation of components such as verse and chorus. Understanding such a music structure and its pattern has become increasingly important for music information retrieval (MIR). Thus far, many different methods for music segmentation and structure analysis have been proposed; however, each method has its advantages and disadvantages. By considering the significant variations in timbre, articulation and tempo of music, this is still a challenging task. In this paper, we propose a novel method for music segmentation and its structure analysis. For this, we first extract the timbre feature from the acoustic music signal and construct a self-similarity matrix that shows the similarities among the features within the music clip. Further, we determine the candidate boundaries for music segmentation by tracking the standard deviation in the matrix. Furthermore, we perform two-stage categorization: (i) categorization of the segments in a music clip on the basis of the timbre feature and (ii) categorization of segments in the same category on the basis of the successive chromagram features. In this way, each music clip is represented by a sequence of states where each state represents a certain category defined by two-stage categorization. We show the performance of our proposed method through experiments. |
doi_str_mv | 10.1007/s11042-013-1761-9 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1669857825</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1669857825</sourcerecordid><originalsourceid>FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233</originalsourceid><addsrcrecordid>eNp10MtKxDAUBuAgCo6jD-Cu4MZNNNemWcrgDUYE0XXItGnJ0MuYk6Lj05tSFyK4Sgjff8j5ETqn5IoSoq6BUiIYJpRjqnKK9QFaUKk4VorRw3TnBcFKEnqMTgC2hNBcMrFAL08j-DKDGMYyjsFltrftHjxk6b1vMnBtjcF3vrXBx33W2Rj8Z1JVFj8GDNE2LittdM0Q_JeNfuhP0VFtW3BnP-cSvd3dvq4e8Pr5_nF1s8aloDriiinLq3qjRV1UOq85sUVeKiFzTaSlghVKpT86YWsuCmGllcS5TaErqQrBOF-iy3nuLgzvo4NoOg-la1vbu2EEQ_NcF8kymejFH7odxpA2nZRIjGmlk6KzKsMAEFxtdsF3NuwNJWZq2cwtm9SymVo2U4bNGUi2b1z4Nfnf0Dedw39Z</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1646982979</pqid></control><display><type>article</type><title>Music structure analysis using self-similarity matrix and two-stage categorization</title><source>ABI/INFORM Global</source><source>Springer Nature</source><creator>Jun, Sanghoon ; Rho, Seungmin ; Hwang, Eenjun</creator><creatorcontrib>Jun, Sanghoon ; Rho, Seungmin ; Hwang, Eenjun</creatorcontrib><description>Music tends to have a distinct structure consisting of repetition and variation of components such as verse and chorus. Understanding such a music structure and its pattern has become increasingly important for music information retrieval (MIR). Thus far, many different methods for music segmentation and structure analysis have been proposed; however, each method has its advantages and disadvantages. By considering the significant variations in timbre, articulation and tempo of music, this is still a challenging task. In this paper, we propose a novel method for music segmentation and its structure analysis. For this, we first extract the timbre feature from the acoustic music signal and construct a self-similarity matrix that shows the similarities among the features within the music clip. Further, we determine the candidate boundaries for music segmentation by tracking the standard deviation in the matrix. Furthermore, we perform two-stage categorization: (i) categorization of the segments in a music clip on the basis of the timbre feature and (ii) categorization of segments in the same category on the basis of the successive chromagram features. In this way, each music clip is represented by a sequence of states where each state represents a certain category defined by two-stage categorization. We show the performance of our proposed method through experiments.</description><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-013-1761-9</identifier><language>eng</language><publisher>Boston: Springer US</publisher><subject>Acoustic music ; Analysis ; Boundaries ; Categories ; Clips ; Clustering ; Computer Communication Networks ; Computer engineering ; Computer Science ; Data Structures and Information Theory ; Digital music ; Feature extraction ; Information retrieval ; Methods ; Multimedia Information Systems ; Music ; Popular music ; Segmentation ; Segments ; Self-similarity ; Signal processing ; Special Purpose and Application-Based Systems ; Standard deviation ; Studies ; Tracking</subject><ispartof>Multimedia tools and applications, 2015-01, Vol.74 (1), p.287-302</ispartof><rights>Springer Science+Business Media New York 2013</rights><rights>Springer Science+Business Media New York 2015</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233</citedby><cites>FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/1646982979/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/1646982979?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,776,780,11668,27903,27904,36039,36040,44342,74642</link.rule.ids></links><search><creatorcontrib>Jun, Sanghoon</creatorcontrib><creatorcontrib>Rho, Seungmin</creatorcontrib><creatorcontrib>Hwang, Eenjun</creatorcontrib><title>Music structure analysis using self-similarity matrix and two-stage categorization</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>Music tends to have a distinct structure consisting of repetition and variation of components such as verse and chorus. Understanding such a music structure and its pattern has become increasingly important for music information retrieval (MIR). Thus far, many different methods for music segmentation and structure analysis have been proposed; however, each method has its advantages and disadvantages. By considering the significant variations in timbre, articulation and tempo of music, this is still a challenging task. In this paper, we propose a novel method for music segmentation and its structure analysis. For this, we first extract the timbre feature from the acoustic music signal and construct a self-similarity matrix that shows the similarities among the features within the music clip. Further, we determine the candidate boundaries for music segmentation by tracking the standard deviation in the matrix. Furthermore, we perform two-stage categorization: (i) categorization of the segments in a music clip on the basis of the timbre feature and (ii) categorization of segments in the same category on the basis of the successive chromagram features. In this way, each music clip is represented by a sequence of states where each state represents a certain category defined by two-stage categorization. We show the performance of our proposed method through experiments.</description><subject>Acoustic music</subject><subject>Analysis</subject><subject>Boundaries</subject><subject>Categories</subject><subject>Clips</subject><subject>Clustering</subject><subject>Computer Communication Networks</subject><subject>Computer engineering</subject><subject>Computer Science</subject><subject>Data Structures and Information Theory</subject><subject>Digital music</subject><subject>Feature extraction</subject><subject>Information retrieval</subject><subject>Methods</subject><subject>Multimedia Information Systems</subject><subject>Music</subject><subject>Popular music</subject><subject>Segmentation</subject><subject>Segments</subject><subject>Self-similarity</subject><subject>Signal processing</subject><subject>Special Purpose and Application-Based Systems</subject><subject>Standard deviation</subject><subject>Studies</subject><subject>Tracking</subject><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>M0C</sourceid><recordid>eNp10MtKxDAUBuAgCo6jD-Cu4MZNNNemWcrgDUYE0XXItGnJ0MuYk6Lj05tSFyK4Sgjff8j5ETqn5IoSoq6BUiIYJpRjqnKK9QFaUKk4VorRw3TnBcFKEnqMTgC2hNBcMrFAL08j-DKDGMYyjsFltrftHjxk6b1vMnBtjcF3vrXBx33W2Rj8Z1JVFj8GDNE2LittdM0Q_JeNfuhP0VFtW3BnP-cSvd3dvq4e8Pr5_nF1s8aloDriiinLq3qjRV1UOq85sUVeKiFzTaSlghVKpT86YWsuCmGllcS5TaErqQrBOF-iy3nuLgzvo4NoOg-la1vbu2EEQ_NcF8kymejFH7odxpA2nZRIjGmlk6KzKsMAEFxtdsF3NuwNJWZq2cwtm9SymVo2U4bNGUi2b1z4Nfnf0Dedw39Z</recordid><startdate>20150101</startdate><enddate>20150101</enddate><creator>Jun, Sanghoon</creator><creator>Rho, Seungmin</creator><creator>Hwang, Eenjun</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20150101</creationdate><title>Music structure analysis using self-similarity matrix and two-stage categorization</title><author>Jun, Sanghoon ; Rho, Seungmin ; Hwang, Eenjun</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Acoustic music</topic><topic>Analysis</topic><topic>Boundaries</topic><topic>Categories</topic><topic>Clips</topic><topic>Clustering</topic><topic>Computer Communication Networks</topic><topic>Computer engineering</topic><topic>Computer Science</topic><topic>Data Structures and Information Theory</topic><topic>Digital music</topic><topic>Feature extraction</topic><topic>Information retrieval</topic><topic>Methods</topic><topic>Multimedia Information Systems</topic><topic>Music</topic><topic>Popular music</topic><topic>Segmentation</topic><topic>Segments</topic><topic>Self-similarity</topic><topic>Signal processing</topic><topic>Special Purpose and Application-Based Systems</topic><topic>Standard deviation</topic><topic>Studies</topic><topic>Tracking</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jun, Sanghoon</creatorcontrib><creatorcontrib>Rho, Seungmin</creatorcontrib><creatorcontrib>Hwang, Eenjun</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jun, Sanghoon</au><au>Rho, Seungmin</au><au>Hwang, Eenjun</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Music structure analysis using self-similarity matrix and two-stage categorization</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2015-01-01</date><risdate>2015</risdate><volume>74</volume><issue>1</issue><spage>287</spage><epage>302</epage><pages>287-302</pages><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>Music tends to have a distinct structure consisting of repetition and variation of components such as verse and chorus. Understanding such a music structure and its pattern has become increasingly important for music information retrieval (MIR). Thus far, many different methods for music segmentation and structure analysis have been proposed; however, each method has its advantages and disadvantages. By considering the significant variations in timbre, articulation and tempo of music, this is still a challenging task. In this paper, we propose a novel method for music segmentation and its structure analysis. For this, we first extract the timbre feature from the acoustic music signal and construct a self-similarity matrix that shows the similarities among the features within the music clip. Further, we determine the candidate boundaries for music segmentation by tracking the standard deviation in the matrix. Furthermore, we perform two-stage categorization: (i) categorization of the segments in a music clip on the basis of the timbre feature and (ii) categorization of segments in the same category on the basis of the successive chromagram features. In this way, each music clip is represented by a sequence of states where each state represents a certain category defined by two-stage categorization. We show the performance of our proposed method through experiments.</abstract><cop>Boston</cop><pub>Springer US</pub><doi>10.1007/s11042-013-1761-9</doi><tpages>16</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1380-7501 |
ispartof | Multimedia tools and applications, 2015-01, Vol.74 (1), p.287-302 |
issn | 1380-7501 1573-7721 |
language | eng |
recordid | cdi_proquest_miscellaneous_1669857825 |
source | ABI/INFORM Global; Springer Nature |
subjects | Acoustic music Analysis Boundaries Categories Clips Clustering Computer Communication Networks Computer engineering Computer Science Data Structures and Information Theory Digital music Feature extraction Information retrieval Methods Multimedia Information Systems Music Popular music Segmentation Segments Self-similarity Signal processing Special Purpose and Application-Based Systems Standard deviation Studies Tracking |
title | Music structure analysis using self-similarity matrix and two-stage categorization |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T17%3A18%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Music%20structure%20analysis%20using%20self-similarity%20matrix%20and%20two-stage%20categorization&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Jun,%20Sanghoon&rft.date=2015-01-01&rft.volume=74&rft.issue=1&rft.spage=287&rft.epage=302&rft.pages=287-302&rft.issn=1380-7501&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-013-1761-9&rft_dat=%3Cproquest_cross%3E1669857825%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c419t-d27a3dfb94f8d96f30a86c7456905a142877652e4af3484a5a50eeb89d5784233%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1646982979&rft_id=info:pmid/&rfr_iscdi=true |