Loading…
A Novel Short Text Clustering Model Based on Grey System Theory
Short text clustering has great challenges due to the structural reasons, especially when applied to small datasets. Limited number of words leads to a poor-quality feature vector, low clustering accuracy, and failure of analysis. Although some approaches have been observed in the related literature...
Saved in:
Published in: | Arabian journal for science and engineering (2011) 2020-04, Vol.45 (4), p.2865-2882 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853 |
---|---|
cites | cdi_FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853 |
container_end_page | 2882 |
container_issue | 4 |
container_start_page | 2865 |
container_title | Arabian journal for science and engineering (2011) |
container_volume | 45 |
creator | Fidan, Hüseyin Yuksel, Mehmet Erkan |
description | Short text clustering has great challenges due to the structural reasons, especially when applied to small datasets. Limited number of words leads to a poor-quality feature vector, low clustering accuracy, and failure of analysis. Although some approaches have been observed in the related literature, there is still no agreement on an efficient solution. On the other hand, the Grey system theory, which gives better results in numerical analyses with insufficient data, has not yet been applied to short text clustering. The purpose of our study is to develop a short text clustering model based on Grey system theory applicable to small datasets. In order to measure the efficiency of our method, book reviews labeled as negative or positive were obtained from Amazon.com dataset collections, and small datasets have been created. The Grey relational clustering as well as hierarchical and partitional algorithms has been applied to the small datasets separately. According to the results, our model has better accuracy values than the other algorithms in clustering of small datasets containing short text. Consequently, we demonstrated that the Grey relational clustering should be applied to short text clustering for much better results. |
doi_str_mv | 10.1007/s13369-019-04191-0 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2386947178</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2386947178</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853</originalsourceid><addsrcrecordid>eNp9UE1LAzEQDaJgqf0DngKeVzP52E1OUotWoeqhFbyFbXbWVtqmJlvp_nvTruDNgWEevHlvmEfIJbBrYKy4iSBEbjIGqSUYyNgJ6fEDkFzD6RGLTOXF-zkZxLicM6mFUQCiR26H9MV_44pOFz40dIb7ho5Wu9hgWG4-6LOvEndXRqyo39BxwJZO28Su6WyBPrQX5KwuVxEHv7NP3h7uZ6PHbPI6fhoNJ5kTYJosr51hRggskJt54ZjKU3GjgbnaVAq0FMhLZaRTztTGlbLKa25UAaiZVqJPrjrfbfBfO4yN_fS7sEknLRc6N7KAQqct3m254GMMWNttWK7L0Fpg9pCV7bKyKSt7zMqyJBKdKG4PP2P4s_5H9QPnSmnF</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2386947178</pqid></control><display><type>article</type><title>A Novel Short Text Clustering Model Based on Grey System Theory</title><source>Springer Nature</source><creator>Fidan, Hüseyin ; Yuksel, Mehmet Erkan</creator><creatorcontrib>Fidan, Hüseyin ; Yuksel, Mehmet Erkan</creatorcontrib><description>Short text clustering has great challenges due to the structural reasons, especially when applied to small datasets. Limited number of words leads to a poor-quality feature vector, low clustering accuracy, and failure of analysis. Although some approaches have been observed in the related literature, there is still no agreement on an efficient solution. On the other hand, the Grey system theory, which gives better results in numerical analyses with insufficient data, has not yet been applied to short text clustering. The purpose of our study is to develop a short text clustering model based on Grey system theory applicable to small datasets. In order to measure the efficiency of our method, book reviews labeled as negative or positive were obtained from Amazon.com dataset collections, and small datasets have been created. The Grey relational clustering as well as hierarchical and partitional algorithms has been applied to the small datasets separately. According to the results, our model has better accuracy values than the other algorithms in clustering of small datasets containing short text. Consequently, we demonstrated that the Grey relational clustering should be applied to short text clustering for much better results.</description><identifier>ISSN: 2193-567X</identifier><identifier>ISSN: 1319-8025</identifier><identifier>EISSN: 2191-4281</identifier><identifier>DOI: 10.1007/s13369-019-04191-0</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Algorithms ; Clustering ; Datasets ; Electronic commerce ; Engineering ; Failure analysis ; Humanities and Social Sciences ; Model accuracy ; multidisciplinary ; Research Article - Computer Engineering and Computer Science ; Science ; System theory ; Systems theory</subject><ispartof>Arabian journal for science and engineering (2011), 2020-04, Vol.45 (4), p.2865-2882</ispartof><rights>King Fahd University of Petroleum & Minerals 2019</rights><rights>King Fahd University of Petroleum & Minerals 2019.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853</citedby><cites>FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Fidan, Hüseyin</creatorcontrib><creatorcontrib>Yuksel, Mehmet Erkan</creatorcontrib><title>A Novel Short Text Clustering Model Based on Grey System Theory</title><title>Arabian journal for science and engineering (2011)</title><addtitle>Arab J Sci Eng</addtitle><description>Short text clustering has great challenges due to the structural reasons, especially when applied to small datasets. Limited number of words leads to a poor-quality feature vector, low clustering accuracy, and failure of analysis. Although some approaches have been observed in the related literature, there is still no agreement on an efficient solution. On the other hand, the Grey system theory, which gives better results in numerical analyses with insufficient data, has not yet been applied to short text clustering. The purpose of our study is to develop a short text clustering model based on Grey system theory applicable to small datasets. In order to measure the efficiency of our method, book reviews labeled as negative or positive were obtained from Amazon.com dataset collections, and small datasets have been created. The Grey relational clustering as well as hierarchical and partitional algorithms has been applied to the small datasets separately. According to the results, our model has better accuracy values than the other algorithms in clustering of small datasets containing short text. Consequently, we demonstrated that the Grey relational clustering should be applied to short text clustering for much better results.</description><subject>Algorithms</subject><subject>Clustering</subject><subject>Datasets</subject><subject>Electronic commerce</subject><subject>Engineering</subject><subject>Failure analysis</subject><subject>Humanities and Social Sciences</subject><subject>Model accuracy</subject><subject>multidisciplinary</subject><subject>Research Article - Computer Engineering and Computer Science</subject><subject>Science</subject><subject>System theory</subject><subject>Systems theory</subject><issn>2193-567X</issn><issn>1319-8025</issn><issn>2191-4281</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9UE1LAzEQDaJgqf0DngKeVzP52E1OUotWoeqhFbyFbXbWVtqmJlvp_nvTruDNgWEevHlvmEfIJbBrYKy4iSBEbjIGqSUYyNgJ6fEDkFzD6RGLTOXF-zkZxLicM6mFUQCiR26H9MV_44pOFz40dIb7ho5Wu9hgWG4-6LOvEndXRqyo39BxwJZO28Su6WyBPrQX5KwuVxEHv7NP3h7uZ6PHbPI6fhoNJ5kTYJosr51hRggskJt54ZjKU3GjgbnaVAq0FMhLZaRTztTGlbLKa25UAaiZVqJPrjrfbfBfO4yN_fS7sEknLRc6N7KAQqct3m254GMMWNttWK7L0Fpg9pCV7bKyKSt7zMqyJBKdKG4PP2P4s_5H9QPnSmnF</recordid><startdate>20200401</startdate><enddate>20200401</enddate><creator>Fidan, Hüseyin</creator><creator>Yuksel, Mehmet Erkan</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20200401</creationdate><title>A Novel Short Text Clustering Model Based on Grey System Theory</title><author>Fidan, Hüseyin ; Yuksel, Mehmet Erkan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Clustering</topic><topic>Datasets</topic><topic>Electronic commerce</topic><topic>Engineering</topic><topic>Failure analysis</topic><topic>Humanities and Social Sciences</topic><topic>Model accuracy</topic><topic>multidisciplinary</topic><topic>Research Article - Computer Engineering and Computer Science</topic><topic>Science</topic><topic>System theory</topic><topic>Systems theory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fidan, Hüseyin</creatorcontrib><creatorcontrib>Yuksel, Mehmet Erkan</creatorcontrib><collection>CrossRef</collection><jtitle>Arabian journal for science and engineering (2011)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fidan, Hüseyin</au><au>Yuksel, Mehmet Erkan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Novel Short Text Clustering Model Based on Grey System Theory</atitle><jtitle>Arabian journal for science and engineering (2011)</jtitle><stitle>Arab J Sci Eng</stitle><date>2020-04-01</date><risdate>2020</risdate><volume>45</volume><issue>4</issue><spage>2865</spage><epage>2882</epage><pages>2865-2882</pages><issn>2193-567X</issn><issn>1319-8025</issn><eissn>2191-4281</eissn><abstract>Short text clustering has great challenges due to the structural reasons, especially when applied to small datasets. Limited number of words leads to a poor-quality feature vector, low clustering accuracy, and failure of analysis. Although some approaches have been observed in the related literature, there is still no agreement on an efficient solution. On the other hand, the Grey system theory, which gives better results in numerical analyses with insufficient data, has not yet been applied to short text clustering. The purpose of our study is to develop a short text clustering model based on Grey system theory applicable to small datasets. In order to measure the efficiency of our method, book reviews labeled as negative or positive were obtained from Amazon.com dataset collections, and small datasets have been created. The Grey relational clustering as well as hierarchical and partitional algorithms has been applied to the small datasets separately. According to the results, our model has better accuracy values than the other algorithms in clustering of small datasets containing short text. Consequently, we demonstrated that the Grey relational clustering should be applied to short text clustering for much better results.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s13369-019-04191-0</doi><tpages>18</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2193-567X |
ispartof | Arabian journal for science and engineering (2011), 2020-04, Vol.45 (4), p.2865-2882 |
issn | 2193-567X 1319-8025 2191-4281 |
language | eng |
recordid | cdi_proquest_journals_2386947178 |
source | Springer Nature |
subjects | Algorithms Clustering Datasets Electronic commerce Engineering Failure analysis Humanities and Social Sciences Model accuracy multidisciplinary Research Article - Computer Engineering and Computer Science Science System theory Systems theory |
title | A Novel Short Text Clustering Model Based on Grey System Theory |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T11%3A04%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Novel%20Short%20Text%20Clustering%20Model%20Based%20on%20Grey%20System%20Theory&rft.jtitle=Arabian%20journal%20for%20science%20and%20engineering%20(2011)&rft.au=Fidan,%20H%C3%BCseyin&rft.date=2020-04-01&rft.volume=45&rft.issue=4&rft.spage=2865&rft.epage=2882&rft.pages=2865-2882&rft.issn=2193-567X&rft.eissn=2191-4281&rft_id=info:doi/10.1007/s13369-019-04191-0&rft_dat=%3Cproquest_cross%3E2386947178%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c319t-6fc90933e7e29b7c05666629810cf9d51843e2a594c5c9f9ca4d6f29571e80853%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2386947178&rft_id=info:pmid/&rfr_iscdi=true |