Loading…

A Novel Short Text Clustering Model Based on Grey System Theory

Short text clustering has great challenges due to the structural reasons, especially when applied to small datasets. Limited number of words leads to a poor-quality feature vector, low clustering accuracy, and failure of analysis. Although some approaches have been observed in the related literature...

Full description

Saved in:
Bibliographic Details
Published in:Arabian journal for science and engineering (2011) 2020-04, Vol.45 (4), p.2865-2882
Main Authors: Fidan, Hüseyin, Yuksel, Mehmet Erkan
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853
cites cdi_FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853
container_end_page 2882
container_issue 4
container_start_page 2865
container_title Arabian journal for science and engineering (2011)
container_volume 45
creator Fidan, Hüseyin
Yuksel, Mehmet Erkan
description Short text clustering has great challenges due to the structural reasons, especially when applied to small datasets. Limited number of words leads to a poor-quality feature vector, low clustering accuracy, and failure of analysis. Although some approaches have been observed in the related literature, there is still no agreement on an efficient solution. On the other hand, the Grey system theory, which gives better results in numerical analyses with insufficient data, has not yet been applied to short text clustering. The purpose of our study is to develop a short text clustering model based on Grey system theory applicable to small datasets. In order to measure the efficiency of our method, book reviews labeled as negative or positive were obtained from Amazon.com dataset collections, and small datasets have been created. The Grey relational clustering as well as hierarchical and partitional algorithms has been applied to the small datasets separately. According to the results, our model has better accuracy values than the other algorithms in clustering of small datasets containing short text. Consequently, we demonstrated that the Grey relational clustering should be applied to short text clustering for much better results.
doi_str_mv 10.1007/s13369-019-04191-0
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2386947178</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2386947178</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853</originalsourceid><addsrcrecordid>eNp9UE1LAzEQDaJgqf0DngKeVzP52E1OUotWoeqhFbyFbXbWVtqmJlvp_nvTruDNgWEevHlvmEfIJbBrYKy4iSBEbjIGqSUYyNgJ6fEDkFzD6RGLTOXF-zkZxLicM6mFUQCiR26H9MV_44pOFz40dIb7ho5Wu9hgWG4-6LOvEndXRqyo39BxwJZO28Su6WyBPrQX5KwuVxEHv7NP3h7uZ6PHbPI6fhoNJ5kTYJosr51hRggskJt54ZjKU3GjgbnaVAq0FMhLZaRTztTGlbLKa25UAaiZVqJPrjrfbfBfO4yN_fS7sEknLRc6N7KAQqct3m254GMMWNttWK7L0Fpg9pCV7bKyKSt7zMqyJBKdKG4PP2P4s_5H9QPnSmnF</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2386947178</pqid></control><display><type>article</type><title>A Novel Short Text Clustering Model Based on Grey System Theory</title><source>Springer Nature</source><creator>Fidan, Hüseyin ; Yuksel, Mehmet Erkan</creator><creatorcontrib>Fidan, Hüseyin ; Yuksel, Mehmet Erkan</creatorcontrib><description>Short text clustering has great challenges due to the structural reasons, especially when applied to small datasets. Limited number of words leads to a poor-quality feature vector, low clustering accuracy, and failure of analysis. Although some approaches have been observed in the related literature, there is still no agreement on an efficient solution. On the other hand, the Grey system theory, which gives better results in numerical analyses with insufficient data, has not yet been applied to short text clustering. The purpose of our study is to develop a short text clustering model based on Grey system theory applicable to small datasets. In order to measure the efficiency of our method, book reviews labeled as negative or positive were obtained from Amazon.com dataset collections, and small datasets have been created. The Grey relational clustering as well as hierarchical and partitional algorithms has been applied to the small datasets separately. According to the results, our model has better accuracy values than the other algorithms in clustering of small datasets containing short text. Consequently, we demonstrated that the Grey relational clustering should be applied to short text clustering for much better results.</description><identifier>ISSN: 2193-567X</identifier><identifier>ISSN: 1319-8025</identifier><identifier>EISSN: 2191-4281</identifier><identifier>DOI: 10.1007/s13369-019-04191-0</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Algorithms ; Clustering ; Datasets ; Electronic commerce ; Engineering ; Failure analysis ; Humanities and Social Sciences ; Model accuracy ; multidisciplinary ; Research Article - Computer Engineering and Computer Science ; Science ; System theory ; Systems theory</subject><ispartof>Arabian journal for science and engineering (2011), 2020-04, Vol.45 (4), p.2865-2882</ispartof><rights>King Fahd University of Petroleum &amp; Minerals 2019</rights><rights>King Fahd University of Petroleum &amp; Minerals 2019.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853</citedby><cites>FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Fidan, Hüseyin</creatorcontrib><creatorcontrib>Yuksel, Mehmet Erkan</creatorcontrib><title>A Novel Short Text Clustering Model Based on Grey System Theory</title><title>Arabian journal for science and engineering (2011)</title><addtitle>Arab J Sci Eng</addtitle><description>Short text clustering has great challenges due to the structural reasons, especially when applied to small datasets. Limited number of words leads to a poor-quality feature vector, low clustering accuracy, and failure of analysis. Although some approaches have been observed in the related literature, there is still no agreement on an efficient solution. On the other hand, the Grey system theory, which gives better results in numerical analyses with insufficient data, has not yet been applied to short text clustering. The purpose of our study is to develop a short text clustering model based on Grey system theory applicable to small datasets. In order to measure the efficiency of our method, book reviews labeled as negative or positive were obtained from Amazon.com dataset collections, and small datasets have been created. The Grey relational clustering as well as hierarchical and partitional algorithms has been applied to the small datasets separately. According to the results, our model has better accuracy values than the other algorithms in clustering of small datasets containing short text. Consequently, we demonstrated that the Grey relational clustering should be applied to short text clustering for much better results.</description><subject>Algorithms</subject><subject>Clustering</subject><subject>Datasets</subject><subject>Electronic commerce</subject><subject>Engineering</subject><subject>Failure analysis</subject><subject>Humanities and Social Sciences</subject><subject>Model accuracy</subject><subject>multidisciplinary</subject><subject>Research Article - Computer Engineering and Computer Science</subject><subject>Science</subject><subject>System theory</subject><subject>Systems theory</subject><issn>2193-567X</issn><issn>1319-8025</issn><issn>2191-4281</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9UE1LAzEQDaJgqf0DngKeVzP52E1OUotWoeqhFbyFbXbWVtqmJlvp_nvTruDNgWEevHlvmEfIJbBrYKy4iSBEbjIGqSUYyNgJ6fEDkFzD6RGLTOXF-zkZxLicM6mFUQCiR26H9MV_44pOFz40dIb7ho5Wu9hgWG4-6LOvEndXRqyo39BxwJZO28Su6WyBPrQX5KwuVxEHv7NP3h7uZ6PHbPI6fhoNJ5kTYJosr51hRggskJt54ZjKU3GjgbnaVAq0FMhLZaRTztTGlbLKa25UAaiZVqJPrjrfbfBfO4yN_fS7sEknLRc6N7KAQqct3m254GMMWNttWK7L0Fpg9pCV7bKyKSt7zMqyJBKdKG4PP2P4s_5H9QPnSmnF</recordid><startdate>20200401</startdate><enddate>20200401</enddate><creator>Fidan, Hüseyin</creator><creator>Yuksel, Mehmet Erkan</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20200401</creationdate><title>A Novel Short Text Clustering Model Based on Grey System Theory</title><author>Fidan, Hüseyin ; Yuksel, Mehmet Erkan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Clustering</topic><topic>Datasets</topic><topic>Electronic commerce</topic><topic>Engineering</topic><topic>Failure analysis</topic><topic>Humanities and Social Sciences</topic><topic>Model accuracy</topic><topic>multidisciplinary</topic><topic>Research Article - Computer Engineering and Computer Science</topic><topic>Science</topic><topic>System theory</topic><topic>Systems theory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fidan, Hüseyin</creatorcontrib><creatorcontrib>Yuksel, Mehmet Erkan</creatorcontrib><collection>CrossRef</collection><jtitle>Arabian journal for science and engineering (2011)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fidan, Hüseyin</au><au>Yuksel, Mehmet Erkan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Novel Short Text Clustering Model Based on Grey System Theory</atitle><jtitle>Arabian journal for science and engineering (2011)</jtitle><stitle>Arab J Sci Eng</stitle><date>2020-04-01</date><risdate>2020</risdate><volume>45</volume><issue>4</issue><spage>2865</spage><epage>2882</epage><pages>2865-2882</pages><issn>2193-567X</issn><issn>1319-8025</issn><eissn>2191-4281</eissn><abstract>Short text clustering has great challenges due to the structural reasons, especially when applied to small datasets. Limited number of words leads to a poor-quality feature vector, low clustering accuracy, and failure of analysis. Although some approaches have been observed in the related literature, there is still no agreement on an efficient solution. On the other hand, the Grey system theory, which gives better results in numerical analyses with insufficient data, has not yet been applied to short text clustering. The purpose of our study is to develop a short text clustering model based on Grey system theory applicable to small datasets. In order to measure the efficiency of our method, book reviews labeled as negative or positive were obtained from Amazon.com dataset collections, and small datasets have been created. The Grey relational clustering as well as hierarchical and partitional algorithms has been applied to the small datasets separately. According to the results, our model has better accuracy values than the other algorithms in clustering of small datasets containing short text. Consequently, we demonstrated that the Grey relational clustering should be applied to short text clustering for much better results.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s13369-019-04191-0</doi><tpages>18</tpages></addata></record>
fulltext fulltext
identifier ISSN: 2193-567X
ispartof Arabian journal for science and engineering (2011), 2020-04, Vol.45 (4), p.2865-2882
issn 2193-567X
1319-8025
2191-4281
language eng
recordid cdi_proquest_journals_2386947178
source Springer Nature
subjects Algorithms
Clustering
Datasets
Electronic commerce
Engineering
Failure analysis
Humanities and Social Sciences
Model accuracy
multidisciplinary
Research Article - Computer Engineering and Computer Science
Science
System theory
Systems theory
title A Novel Short Text Clustering Model Based on Grey System Theory
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T11%3A04%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Novel%20Short%20Text%20Clustering%20Model%20Based%20on%20Grey%20System%20Theory&rft.jtitle=Arabian%20journal%20for%20science%20and%20engineering%20(2011)&rft.au=Fidan,%20H%C3%BCseyin&rft.date=2020-04-01&rft.volume=45&rft.issue=4&rft.spage=2865&rft.epage=2882&rft.pages=2865-2882&rft.issn=2193-567X&rft.eissn=2191-4281&rft_id=info:doi/10.1007/s13369-019-04191-0&rft_dat=%3Cproquest_cross%3E2386947178%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2386947178&rft_id=info:pmid/&rfr_iscdi=true