Loading…
RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms
In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the un...
Saved in:
Published in: | Soft computing (Berlin, Germany) Germany), 2021, Vol.25 (2), p.1129-1145 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763 |
container_end_page | 1145 |
container_issue | 2 |
container_start_page | 1129 |
container_title | Soft computing (Berlin, Germany) |
container_volume | 25 |
creator | Janani, R. Vijayarani, S. |
description | In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the unstructured set of documents into their respective categories based on its content. The main aim of this research work is to automatically classify the documents which are stored in the personal computer into their relevant categories. This work has two significant phases. In the first phase, the important features are selected for classification and the second phase is the classification of text documents. For selecting the optimal features, this research work proposes a new algorithm, optimization technique for feature selection (OTFS) algorithm. To estimate the proficiency of proposed feature selection algorithm, the OTFS algorithm was compared with the existing approaches artificial bee colony, firefly algorithm, ant colony optimization and particle swarm optimization. In the second phase, this research work proposed machine learning-based automatic text classification (MLearn-ATC) algorithm for text classification. In classification, the MLearn-ATC algorithm was compared with widely used classification techniques probabilistic neural network, support vector machine, K-nearest neighbor and Naïve Bayes. From this, the output of first phase is used as the input for classification phase. The decisive results establish that the proposed algorithms achieve the better accuracy for optimizing the features and classifying the text documents based on their content. |
doi_str_mv | 10.1007/s00500-020-05209-8 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3117808121</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3117808121</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763</originalsourceid><addsrcrecordid>eNp9kE9LxDAQxYMouK5-AU8Fz9VJ0japt1KrLiwISz2HNE13s_TPmrSgfnqzW8Gbh2GGx3tv4IfQLYZ7DMAeHEAMEALxExNIQ36GFjiiNGQRS89PNwlZEtFLdOXcHoBgFtMFUpui3GR5WTwF2aZc5eviMcimcejkaFQw6s8xUK10zjRGeWnog8mZfht0Uu1Mr4NWS9sfBdnXwXAYTWe-Z59st4M1465z1-iika3TN797id6fizJ_DddvL6s8W4cK45iHUnMqa5VWCY1qznhVEdqkPEprzeIUKl7HoJWKGFDZ-IjkhGnNGURJIxlL6BLdzb0HO3xM2o1iP0y29y8FxZhx4Jhg7yKzS9nBOasbcbCmk_ZLYBBHmGKGKTxMcYIpuA_ROeS8ud9q-1f9T-oHAYR3DQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3117808121</pqid></control><display><type>article</type><title>RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms</title><source>Springer Link</source><creator>Janani, R. ; Vijayarani, S.</creator><creatorcontrib>Janani, R. ; Vijayarani, S.</creatorcontrib><description>In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the unstructured set of documents into their respective categories based on its content. The main aim of this research work is to automatically classify the documents which are stored in the personal computer into their relevant categories. This work has two significant phases. In the first phase, the important features are selected for classification and the second phase is the classification of text documents. For selecting the optimal features, this research work proposes a new algorithm, optimization technique for feature selection (OTFS) algorithm. To estimate the proficiency of proposed feature selection algorithm, the OTFS algorithm was compared with the existing approaches artificial bee colony, firefly algorithm, ant colony optimization and particle swarm optimization. In the second phase, this research work proposed machine learning-based automatic text classification (MLearn-ATC) algorithm for text classification. In classification, the MLearn-ATC algorithm was compared with widely used classification techniques probabilistic neural network, support vector machine, K-nearest neighbor and Naïve Bayes. From this, the output of first phase is used as the input for classification phase. The decisive results establish that the proposed algorithms achieve the better accuracy for optimizing the features and classifying the text documents based on their content.</description><identifier>ISSN: 1432-7643</identifier><identifier>EISSN: 1433-7479</identifier><identifier>DOI: 10.1007/s00500-020-05209-8</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Accuracy ; Algorithms ; Ant colony optimization ; Artificial Intelligence ; Classification ; Computational Intelligence ; Control ; Datasets ; Decision trees ; Documents ; Engineering ; Feature selection ; Genetic algorithms ; Heuristic methods ; Machine learning ; Mathematical Logic and Foundations ; Mechatronics ; Methodologies and Application ; Methods ; Neural networks ; Optimization techniques ; Particle swarm optimization ; Personal computers ; Robotics ; Support vector machines ; Swarm intelligence ; Text categorization ; Unstructured data</subject><ispartof>Soft computing (Berlin, Germany), 2021, Vol.25 (2), p.1129-1145</ispartof><rights>Springer-Verlag GmbH Germany, part of Springer Nature 2020. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27900,27901</link.rule.ids></links><search><creatorcontrib>Janani, R.</creatorcontrib><creatorcontrib>Vijayarani, S.</creatorcontrib><title>RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms</title><title>Soft computing (Berlin, Germany)</title><addtitle>Soft Comput</addtitle><description>In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the unstructured set of documents into their respective categories based on its content. The main aim of this research work is to automatically classify the documents which are stored in the personal computer into their relevant categories. This work has two significant phases. In the first phase, the important features are selected for classification and the second phase is the classification of text documents. For selecting the optimal features, this research work proposes a new algorithm, optimization technique for feature selection (OTFS) algorithm. To estimate the proficiency of proposed feature selection algorithm, the OTFS algorithm was compared with the existing approaches artificial bee colony, firefly algorithm, ant colony optimization and particle swarm optimization. In the second phase, this research work proposed machine learning-based automatic text classification (MLearn-ATC) algorithm for text classification. In classification, the MLearn-ATC algorithm was compared with widely used classification techniques probabilistic neural network, support vector machine, K-nearest neighbor and Naïve Bayes. From this, the output of first phase is used as the input for classification phase. The decisive results establish that the proposed algorithms achieve the better accuracy for optimizing the features and classifying the text documents based on their content.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Ant colony optimization</subject><subject>Artificial Intelligence</subject><subject>Classification</subject><subject>Computational Intelligence</subject><subject>Control</subject><subject>Datasets</subject><subject>Decision trees</subject><subject>Documents</subject><subject>Engineering</subject><subject>Feature selection</subject><subject>Genetic algorithms</subject><subject>Heuristic methods</subject><subject>Machine learning</subject><subject>Mathematical Logic and Foundations</subject><subject>Mechatronics</subject><subject>Methodologies and Application</subject><subject>Methods</subject><subject>Neural networks</subject><subject>Optimization techniques</subject><subject>Particle swarm optimization</subject><subject>Personal computers</subject><subject>Robotics</subject><subject>Support vector machines</subject><subject>Swarm intelligence</subject><subject>Text categorization</subject><subject>Unstructured data</subject><issn>1432-7643</issn><issn>1433-7479</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LxDAQxYMouK5-AU8Fz9VJ0japt1KrLiwISz2HNE13s_TPmrSgfnqzW8Gbh2GGx3tv4IfQLYZ7DMAeHEAMEALxExNIQ36GFjiiNGQRS89PNwlZEtFLdOXcHoBgFtMFUpui3GR5WTwF2aZc5eviMcimcejkaFQw6s8xUK10zjRGeWnog8mZfht0Uu1Mr4NWS9sfBdnXwXAYTWe-Z59st4M1465z1-iika3TN797id6fizJ_DddvL6s8W4cK45iHUnMqa5VWCY1qznhVEdqkPEprzeIUKl7HoJWKGFDZ-IjkhGnNGURJIxlL6BLdzb0HO3xM2o1iP0y29y8FxZhx4Jhg7yKzS9nBOasbcbCmk_ZLYBBHmGKGKTxMcYIpuA_ROeS8ud9q-1f9T-oHAYR3DQ</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Janani, R.</creator><creator>Vijayarani, S.</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PHGZM</scope><scope>PHGZT</scope><scope>PKEHL</scope><scope>PQEST</scope><scope>PQGLB</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope></search><sort><creationdate>2021</creationdate><title>RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms</title><author>Janani, R. ; Vijayarani, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Ant colony optimization</topic><topic>Artificial Intelligence</topic><topic>Classification</topic><topic>Computational Intelligence</topic><topic>Control</topic><topic>Datasets</topic><topic>Decision trees</topic><topic>Documents</topic><topic>Engineering</topic><topic>Feature selection</topic><topic>Genetic algorithms</topic><topic>Heuristic methods</topic><topic>Machine learning</topic><topic>Mathematical Logic and Foundations</topic><topic>Mechatronics</topic><topic>Methodologies and Application</topic><topic>Methods</topic><topic>Neural networks</topic><topic>Optimization techniques</topic><topic>Particle swarm optimization</topic><topic>Personal computers</topic><topic>Robotics</topic><topic>Support vector machines</topic><topic>Swarm intelligence</topic><topic>Text categorization</topic><topic>Unstructured data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Janani, R.</creatorcontrib><creatorcontrib>Vijayarani, S.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Database (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central (New)</collection><collection>ProQuest One Academic (New)</collection><collection>ProQuest One Academic Middle East (New)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Applied & Life Sciences</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><jtitle>Soft computing (Berlin, Germany)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Janani, R.</au><au>Vijayarani, S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms</atitle><jtitle>Soft computing (Berlin, Germany)</jtitle><stitle>Soft Comput</stitle><date>2021</date><risdate>2021</risdate><volume>25</volume><issue>2</issue><spage>1129</spage><epage>1145</epage><pages>1129-1145</pages><issn>1432-7643</issn><eissn>1433-7479</eissn><abstract>In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the unstructured set of documents into their respective categories based on its content. The main aim of this research work is to automatically classify the documents which are stored in the personal computer into their relevant categories. This work has two significant phases. In the first phase, the important features are selected for classification and the second phase is the classification of text documents. For selecting the optimal features, this research work proposes a new algorithm, optimization technique for feature selection (OTFS) algorithm. To estimate the proficiency of proposed feature selection algorithm, the OTFS algorithm was compared with the existing approaches artificial bee colony, firefly algorithm, ant colony optimization and particle swarm optimization. In the second phase, this research work proposed machine learning-based automatic text classification (MLearn-ATC) algorithm for text classification. In classification, the MLearn-ATC algorithm was compared with widely used classification techniques probabilistic neural network, support vector machine, K-nearest neighbor and Naïve Bayes. From this, the output of first phase is used as the input for classification phase. The decisive results establish that the proposed algorithms achieve the better accuracy for optimizing the features and classifying the text documents based on their content.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s00500-020-05209-8</doi><tpages>17</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1432-7643 |
ispartof | Soft computing (Berlin, Germany), 2021, Vol.25 (2), p.1129-1145 |
issn | 1432-7643 1433-7479 |
language | eng |
recordid | cdi_proquest_journals_3117808121 |
source | Springer Link |
subjects | Accuracy Algorithms Ant colony optimization Artificial Intelligence Classification Computational Intelligence Control Datasets Decision trees Documents Engineering Feature selection Genetic algorithms Heuristic methods Machine learning Mathematical Logic and Foundations Mechatronics Methodologies and Application Methods Neural networks Optimization techniques Particle swarm optimization Personal computers Robotics Support vector machines Swarm intelligence Text categorization Unstructured data |
title | RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-25T12%3A38%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RETRACTED%20ARTICLE:%20Automatic%20text%20classification%20using%20machine%20learning%20and%20optimization%20algorithms&rft.jtitle=Soft%20computing%20(Berlin,%20Germany)&rft.au=Janani,%20R.&rft.date=2021&rft.volume=25&rft.issue=2&rft.spage=1129&rft.epage=1145&rft.pages=1129-1145&rft.issn=1432-7643&rft.eissn=1433-7479&rft_id=info:doi/10.1007/s00500-020-05209-8&rft_dat=%3Cproquest_cross%3E3117808121%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3117808121&rft_id=info:pmid/&rfr_iscdi=true |