Loading…
On the Use of Minhash and Locality Sensitive Hashing for Detecting Similar Lyrics
In this paper, we propose a retrieval system based on similarities between songs. We consider the similarity of songs regarding their lyrics, emotions, genres, or a combination of these attributes. To detect similar lyrics, we applied both minhash and locality-sensitive hashing (LSH) methods to a se...
Saved in:
Published in: | Engineering letters 2022-02, Vol.30 (1), p.227 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | In this paper, we propose a retrieval system based on similarities between songs. We consider the similarity of songs regarding their lyrics, emotions, genres, or a combination of these attributes. To detect similar lyrics, we applied both minhash and locality-sensitive hashing (LSH) methods to a set of songs. We also applied the Watson Tone Analyzer service for detecting emotions. Although experiments with more songs are necessary, our results did not show, e.g., lyrics plagiarism. This finding suggests, at least from a textual point of view, that lyricists are careful on this matter. We also included some artificial similar songs in our set of songs to validate our proposal. Although there were false positives and true negatives, as expected in LSH, this experiment showed the fairness of our proposal. |
---|---|
ISSN: | 1816-093X 1816-0948 |