Loading…

On the Use of Minhash and Locality Sensitive Hashing for Detecting Similar Lyrics

In this paper, we propose a retrieval system based on similarities between songs. We consider the similarity of songs regarding their lyrics, emotions, genres, or a combination of these attributes. To detect similar lyrics, we applied both minhash and locality-sensitive hashing (LSH) methods to a se...

Full description

Saved in:
Bibliographic Details
Published in:Engineering letters 2022-02, Vol.30 (1), p.227
Main Authors: Arboleda, Francisco Javier Moreno, Norena, Felipe Cortes, Alvarez, Benjamin Cruz
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this paper, we propose a retrieval system based on similarities between songs. We consider the similarity of songs regarding their lyrics, emotions, genres, or a combination of these attributes. To detect similar lyrics, we applied both minhash and locality-sensitive hashing (LSH) methods to a set of songs. We also applied the Watson Tone Analyzer service for detecting emotions. Although experiments with more songs are necessary, our results did not show, e.g., lyrics plagiarism. This finding suggests, at least from a textual point of view, that lyricists are careful on this matter. We also included some artificial similar songs in our set of songs to validate our proposal. Although there were false positives and true negatives, as expected in LSH, this experiment showed the fairness of our proposal.
ISSN:1816-093X
1816-0948