Loading…

RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms

In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the un...

Full description

Saved in:
Bibliographic Details
Published in:Soft computing (Berlin, Germany) Germany), 2021, Vol.25 (2), p.1129-1145
Main Authors: Janani, R., Vijayarani, S.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763
container_end_page 1145
container_issue 2
container_start_page 1129
container_title Soft computing (Berlin, Germany)
container_volume 25
creator Janani, R.
Vijayarani, S.
description In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the unstructured set of documents into their respective categories based on its content. The main aim of this research work is to automatically classify the documents which are stored in the personal computer into their relevant categories. This work has two significant phases. In the first phase, the important features are selected for classification and the second phase is the classification of text documents. For selecting the optimal features, this research work proposes a new algorithm, optimization technique for feature selection (OTFS) algorithm. To estimate the proficiency of proposed feature selection algorithm, the OTFS algorithm was compared with the existing approaches artificial bee colony, firefly algorithm, ant colony optimization and particle swarm optimization. In the second phase, this research work proposed machine learning-based automatic text classification (MLearn-ATC) algorithm for text classification. In classification, the MLearn-ATC algorithm was compared with widely used classification techniques probabilistic neural network, support vector machine, K-nearest neighbor and Naïve Bayes. From this, the output of first phase is used as the input for classification phase. The decisive results establish that the proposed algorithms achieve the better accuracy for optimizing the features and classifying the text documents based on their content.
doi_str_mv 10.1007/s00500-020-05209-8
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3117808121</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3117808121</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763</originalsourceid><addsrcrecordid>eNp9kE9LxDAQxYMouK5-AU8Fz9VJ0japt1KrLiwISz2HNE13s_TPmrSgfnqzW8Gbh2GGx3tv4IfQLYZ7DMAeHEAMEALxExNIQ36GFjiiNGQRS89PNwlZEtFLdOXcHoBgFtMFUpui3GR5WTwF2aZc5eviMcimcejkaFQw6s8xUK10zjRGeWnog8mZfht0Uu1Mr4NWS9sfBdnXwXAYTWe-Z59st4M1465z1-iika3TN797id6fizJ_DddvL6s8W4cK45iHUnMqa5VWCY1qznhVEdqkPEprzeIUKl7HoJWKGFDZ-IjkhGnNGURJIxlL6BLdzb0HO3xM2o1iP0y29y8FxZhx4Jhg7yKzS9nBOasbcbCmk_ZLYBBHmGKGKTxMcYIpuA_ROeS8ud9q-1f9T-oHAYR3DQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3117808121</pqid></control><display><type>article</type><title>RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms</title><source>Springer Link</source><creator>Janani, R. ; Vijayarani, S.</creator><creatorcontrib>Janani, R. ; Vijayarani, S.</creatorcontrib><description>In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the unstructured set of documents into their respective categories based on its content. The main aim of this research work is to automatically classify the documents which are stored in the personal computer into their relevant categories. This work has two significant phases. In the first phase, the important features are selected for classification and the second phase is the classification of text documents. For selecting the optimal features, this research work proposes a new algorithm, optimization technique for feature selection (OTFS) algorithm. To estimate the proficiency of proposed feature selection algorithm, the OTFS algorithm was compared with the existing approaches artificial bee colony, firefly algorithm, ant colony optimization and particle swarm optimization. In the second phase, this research work proposed machine learning-based automatic text classification (MLearn-ATC) algorithm for text classification. In classification, the MLearn-ATC algorithm was compared with widely used classification techniques probabilistic neural network, support vector machine, K-nearest neighbor and Naïve Bayes. From this, the output of first phase is used as the input for classification phase. The decisive results establish that the proposed algorithms achieve the better accuracy for optimizing the features and classifying the text documents based on their content.</description><identifier>ISSN: 1432-7643</identifier><identifier>EISSN: 1433-7479</identifier><identifier>DOI: 10.1007/s00500-020-05209-8</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Accuracy ; Algorithms ; Ant colony optimization ; Artificial Intelligence ; Classification ; Computational Intelligence ; Control ; Datasets ; Decision trees ; Documents ; Engineering ; Feature selection ; Genetic algorithms ; Heuristic methods ; Machine learning ; Mathematical Logic and Foundations ; Mechatronics ; Methodologies and Application ; Methods ; Neural networks ; Optimization techniques ; Particle swarm optimization ; Personal computers ; Robotics ; Support vector machines ; Swarm intelligence ; Text categorization ; Unstructured data</subject><ispartof>Soft computing (Berlin, Germany), 2021, Vol.25 (2), p.1129-1145</ispartof><rights>Springer-Verlag GmbH Germany, part of Springer Nature 2020. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27900,27901</link.rule.ids></links><search><creatorcontrib>Janani, R.</creatorcontrib><creatorcontrib>Vijayarani, S.</creatorcontrib><title>RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms</title><title>Soft computing (Berlin, Germany)</title><addtitle>Soft Comput</addtitle><description>In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the unstructured set of documents into their respective categories based on its content. The main aim of this research work is to automatically classify the documents which are stored in the personal computer into their relevant categories. This work has two significant phases. In the first phase, the important features are selected for classification and the second phase is the classification of text documents. For selecting the optimal features, this research work proposes a new algorithm, optimization technique for feature selection (OTFS) algorithm. To estimate the proficiency of proposed feature selection algorithm, the OTFS algorithm was compared with the existing approaches artificial bee colony, firefly algorithm, ant colony optimization and particle swarm optimization. In the second phase, this research work proposed machine learning-based automatic text classification (MLearn-ATC) algorithm for text classification. In classification, the MLearn-ATC algorithm was compared with widely used classification techniques probabilistic neural network, support vector machine, K-nearest neighbor and Naïve Bayes. From this, the output of first phase is used as the input for classification phase. The decisive results establish that the proposed algorithms achieve the better accuracy for optimizing the features and classifying the text documents based on their content.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Ant colony optimization</subject><subject>Artificial Intelligence</subject><subject>Classification</subject><subject>Computational Intelligence</subject><subject>Control</subject><subject>Datasets</subject><subject>Decision trees</subject><subject>Documents</subject><subject>Engineering</subject><subject>Feature selection</subject><subject>Genetic algorithms</subject><subject>Heuristic methods</subject><subject>Machine learning</subject><subject>Mathematical Logic and Foundations</subject><subject>Mechatronics</subject><subject>Methodologies and Application</subject><subject>Methods</subject><subject>Neural networks</subject><subject>Optimization techniques</subject><subject>Particle swarm optimization</subject><subject>Personal computers</subject><subject>Robotics</subject><subject>Support vector machines</subject><subject>Swarm intelligence</subject><subject>Text categorization</subject><subject>Unstructured data</subject><issn>1432-7643</issn><issn>1433-7479</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LxDAQxYMouK5-AU8Fz9VJ0japt1KrLiwISz2HNE13s_TPmrSgfnqzW8Gbh2GGx3tv4IfQLYZ7DMAeHEAMEALxExNIQ36GFjiiNGQRS89PNwlZEtFLdOXcHoBgFtMFUpui3GR5WTwF2aZc5eviMcimcejkaFQw6s8xUK10zjRGeWnog8mZfht0Uu1Mr4NWS9sfBdnXwXAYTWe-Z59st4M1465z1-iika3TN797id6fizJ_DddvL6s8W4cK45iHUnMqa5VWCY1qznhVEdqkPEprzeIUKl7HoJWKGFDZ-IjkhGnNGURJIxlL6BLdzb0HO3xM2o1iP0y29y8FxZhx4Jhg7yKzS9nBOasbcbCmk_ZLYBBHmGKGKTxMcYIpuA_ROeS8ud9q-1f9T-oHAYR3DQ</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Janani, R.</creator><creator>Vijayarani, S.</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PHGZM</scope><scope>PHGZT</scope><scope>PKEHL</scope><scope>PQEST</scope><scope>PQGLB</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope></search><sort><creationdate>2021</creationdate><title>RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms</title><author>Janani, R. ; Vijayarani, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Ant colony optimization</topic><topic>Artificial Intelligence</topic><topic>Classification</topic><topic>Computational Intelligence</topic><topic>Control</topic><topic>Datasets</topic><topic>Decision trees</topic><topic>Documents</topic><topic>Engineering</topic><topic>Feature selection</topic><topic>Genetic algorithms</topic><topic>Heuristic methods</topic><topic>Machine learning</topic><topic>Mathematical Logic and Foundations</topic><topic>Mechatronics</topic><topic>Methodologies and Application</topic><topic>Methods</topic><topic>Neural networks</topic><topic>Optimization techniques</topic><topic>Particle swarm optimization</topic><topic>Personal computers</topic><topic>Robotics</topic><topic>Support vector machines</topic><topic>Swarm intelligence</topic><topic>Text categorization</topic><topic>Unstructured data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Janani, R.</creatorcontrib><creatorcontrib>Vijayarani, S.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Database‎ (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central (New)</collection><collection>ProQuest One Academic (New)</collection><collection>ProQuest One Academic Middle East (New)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Applied &amp; Life Sciences</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><jtitle>Soft computing (Berlin, Germany)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Janani, R.</au><au>Vijayarani, S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms</atitle><jtitle>Soft computing (Berlin, Germany)</jtitle><stitle>Soft Comput</stitle><date>2021</date><risdate>2021</risdate><volume>25</volume><issue>2</issue><spage>1129</spage><epage>1145</epage><pages>1129-1145</pages><issn>1432-7643</issn><eissn>1433-7479</eissn><abstract>In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the unstructured set of documents into their respective categories based on its content. The main aim of this research work is to automatically classify the documents which are stored in the personal computer into their relevant categories. This work has two significant phases. In the first phase, the important features are selected for classification and the second phase is the classification of text documents. For selecting the optimal features, this research work proposes a new algorithm, optimization technique for feature selection (OTFS) algorithm. To estimate the proficiency of proposed feature selection algorithm, the OTFS algorithm was compared with the existing approaches artificial bee colony, firefly algorithm, ant colony optimization and particle swarm optimization. In the second phase, this research work proposed machine learning-based automatic text classification (MLearn-ATC) algorithm for text classification. In classification, the MLearn-ATC algorithm was compared with widely used classification techniques probabilistic neural network, support vector machine, K-nearest neighbor and Naïve Bayes. From this, the output of first phase is used as the input for classification phase. The decisive results establish that the proposed algorithms achieve the better accuracy for optimizing the features and classifying the text documents based on their content.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s00500-020-05209-8</doi><tpages>17</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1432-7643
ispartof Soft computing (Berlin, Germany), 2021, Vol.25 (2), p.1129-1145
issn 1432-7643
1433-7479
language eng
recordid cdi_proquest_journals_3117808121
source Springer Link
subjects Accuracy
Algorithms
Ant colony optimization
Artificial Intelligence
Classification
Computational Intelligence
Control
Datasets
Decision trees
Documents
Engineering
Feature selection
Genetic algorithms
Heuristic methods
Machine learning
Mathematical Logic and Foundations
Mechatronics
Methodologies and Application
Methods
Neural networks
Optimization techniques
Particle swarm optimization
Personal computers
Robotics
Support vector machines
Swarm intelligence
Text categorization
Unstructured data
title RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-25T12%3A38%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RETRACTED%20ARTICLE:%20Automatic%20text%20classification%20using%20machine%20learning%20and%20optimization%20algorithms&rft.jtitle=Soft%20computing%20(Berlin,%20Germany)&rft.au=Janani,%20R.&rft.date=2021&rft.volume=25&rft.issue=2&rft.spage=1129&rft.epage=1145&rft.pages=1129-1145&rft.issn=1432-7643&rft.eissn=1433-7479&rft_id=info:doi/10.1007/s00500-020-05209-8&rft_dat=%3Cproquest_cross%3E3117808121%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3117808121&rft_id=info:pmid/&rfr_iscdi=true