Loading…

RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms

In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the un...

Full description

Saved in:

Bibliographic Details
Published in:	Soft computing (Berlin, Germany) Germany), 2021, Vol.25 (2), p.1129-1145
Main Authors:	Janani, R., Vijayarani, S.
Format:	Article
Language:	English
Subjects:	Accuracy Algorithms Ant colony optimization Artificial Intelligence Classification Computational Intelligence Control Datasets Decision trees Documents Engineering Feature selection Genetic algorithms Heuristic methods Machine learning Mathematical Logic and Foundations Mechatronics Methodologies and Application Methods Neural networks Optimization techniques Particle swarm optimization Personal computers Robotics Support vector machines Swarm intelligence Text categorization Unstructured data
Citations:	Items that this one cites
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites	cdi_FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763
container_end_page	1145
container_issue	2
container_start_page	1129
container_title	Soft computing (Berlin, Germany)
container_volume	25
creator	Janani, R. Vijayarani, S.
description	In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the unstructured set of documents into their respective categories based on its content. The main aim of this research work is to automatically classify the documents which are stored in the personal computer into their relevant categories. This work has two significant phases. In the first phase, the important features are selected for classification and the second phase is the classification of text documents. For selecting the optimal features, this research work proposes a new algorithm, optimization technique for feature selection (OTFS) algorithm. To estimate the proficiency of proposed feature selection algorithm, the OTFS algorithm was compared with the existing approaches artificial bee colony, firefly algorithm, ant colony optimization and particle swarm optimization. In the second phase, this research work proposed machine learning-based automatic text classification (MLearn-ATC) algorithm for text classification. In classification, the MLearn-ATC algorithm was compared with widely used classification techniques probabilistic neural network, support vector machine, K-nearest neighbor and Naïve Bayes. From this, the output of first phase is used as the input for classification phase. The decisive results establish that the proposed algorithms achieve the better accuracy for optimizing the features and classifying the text documents based on their content.
doi_str_mv	10.1007/s00500-020-05209-8
format	article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3117808121</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3117808121</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763</originalsourceid><addsrcrecordid>eNp9kE9LxDAQxYMouK5-AU8Fz9VJ0japt1KrLiwISz2HNE13s_TPmrSgfnqzW8Gbh2GGx3tv4IfQLYZ7DMAeHEAMEALxExNIQ36GFjiiNGQRS89PNwlZEtFLdOXcHoBgFtMFUpui3GR5WTwF2aZc5eviMcimcejkaFQw6s8xUK10zjRGeWnog8mZfht0Uu1Mr4NWS9sfBdnXwXAYTWe-Z59st4M1465z1-iika3TN797id6fizJ_DddvL6s8W4cK45iHUnMqa5VWCY1qznhVEdqkPEprzeIUKl7HoJWKGFDZ-IjkhGnNGURJIxlL6BLdzb0HO3xM2o1iP0y29y8FxZhx4Jhg7yKzS9nBOasbcbCmk_ZLYBBHmGKGKTxMcYIpuA_ROeS8ud9q-1f9T-oHAYR3DQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3117808121</pqid></control><display><type>article</type><title>RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms</title><source>Springer Link</source><creator>Janani, R. ; Vijayarani, S.</creator><creatorcontrib>Janani, R. ; Vijayarani, S.</creatorcontrib><description>In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the unstructured set of documents into their respective categories based on its content. The main aim of this research work is to automatically classify the documents which are stored in the personal computer into their relevant categories. This work has two significant phases. In the first phase, the important features are selected for classification and the second phase is the classification of text documents. For selecting the optimal features, this research work proposes a new algorithm, optimization technique for feature selection (OTFS) algorithm. To estimate the proficiency of proposed feature selection algorithm, the OTFS algorithm was compared with the existing approaches artificial bee colony, firefly algorithm, ant colony optimization and particle swarm optimization. In the second phase, this research work proposed machine learning-based automatic text classification (MLearn-ATC) algorithm for text classification. In classification, the MLearn-ATC algorithm was compared with widely used classification techniques probabilistic neural network, support vector machine, K-nearest neighbor and Naïve Bayes. From this, the output of first phase is used as the input for classification phase. The decisive results establish that the proposed algorithms achieve the better accuracy for optimizing the features and classifying the text documents based on their content.</description><identifier>ISSN: 1432-7643</identifier><identifier>EISSN: 1433-7479</identifier><identifier>DOI: 10.1007/s00500-020-05209-8</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Accuracy ; Algorithms ; Ant colony optimization ; Artificial Intelligence ; Classification ; Computational Intelligence ; Control ; Datasets ; Decision trees ; Documents ; Engineering ; Feature selection ; Genetic algorithms ; Heuristic methods ; Machine learning ; Mathematical Logic and Foundations ; Mechatronics ; Methodologies and Application ; Methods ; Neural networks ; Optimization techniques ; Particle swarm optimization ; Personal computers ; Robotics ; Support vector machines ; Swarm intelligence ; Text categorization ; Unstructured data</subject><ispartof>Soft computing (Berlin, Germany), 2021, Vol.25 (2), p.1129-1145</ispartof><rights>Springer-Verlag GmbH Germany, part of Springer Nature 2020. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27900,27901</link.rule.ids></links><search><creatorcontrib>Janani, R.</creatorcontrib><creatorcontrib>Vijayarani, S.</creatorcontrib><title>RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms</title><title>Soft computing (Berlin, Germany)</title><addtitle>Soft Comput</addtitle><description>In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the unstructured set of documents into their respective categories based on its content. The main aim of this research work is to automatically classify the documents which are stored in the personal computer into their relevant categories. This work has two significant phases. In the first phase, the important features are selected for classification and the second phase is the classification of text documents. For selecting the optimal features, this research work proposes a new algorithm, optimization technique for feature selection (OTFS) algorithm. To estimate the proficiency of proposed feature selection algorithm, the OTFS algorithm was compared with the existing approaches artificial bee colony, firefly algorithm, ant colony optimization and particle swarm optimization. In the second phase, this research work proposed machine learning-based automatic text classification (MLearn-ATC) algorithm for text classification. In classification, the MLearn-ATC algorithm was compared with widely used classification techniques probabilistic neural network, support vector machine, K-nearest neighbor and Naïve Bayes. From this, the output of first phase is used as the input for classification phase. The decisive results establish that the proposed algorithms achieve the better accuracy for optimizing the features and classifying the text documents based on their content.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Ant colony optimization</subject><subject>Artificial Intelligence</subject><subject>Classification</subject><subject>Computational Intelligence</subject><subject>Control</subject><subject>Datasets</subject><subject>Decision trees</subject><subject>Documents</subject><subject>Engineering</subject><subject>Feature selection</subject><subject>Genetic algorithms</subject><subject>Heuristic methods</subject><subject>Machine learning</subject><subject>Mathematical Logic and Foundations</subject><subject>Mechatronics</subject><subject>Methodologies and Application</subject><subject>Methods</subject><subject>Neural networks</subject><subject>Optimization techniques</subject><subject>Particle swarm optimization</subject><subject>Personal computers</subject><subject>Robotics</subject><subject>Support vector machines</subject><subject>Swarm intelligence</subject><subject>Text categorization</subject><subject>Unstructured data</subject><issn>1432-7643</issn><issn>1433-7479</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LxDAQxYMouK5-AU8Fz9VJ0japt1KrLiwISz2HNE13s_TPmrSgfnqzW8Gbh2GGx3tv4IfQLYZ7DMAeHEAMEALxExNIQ36GFjiiNGQRS89PNwlZEtFLdOXcHoBgFtMFUpui3GR5WTwF2aZc5eviMcimcejkaFQw6s8xUK10zjRGeWnog8mZfht0Uu1Mr4NWS9sfBdnXwXAYTWe-Z59st4M1465z1-iika3TN797id6fizJ_DddvL6s8W4cK45iHUnMqa5VWCY1qznhVEdqkPEprzeIUKl7HoJWKGFDZ-IjkhGnNGURJIxlL6BLdzb0HO3xM2o1iP0y29y8FxZhx4Jhg7yKzS9nBOasbcbCmk_ZLYBBHmGKGKTxMcYIpuA_ROeS8ud9q-1f9T-oHAYR3DQ</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Janani, R.</creator><creator>Vijayarani, S.</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PHGZM</scope><scope>PHGZT</scope><scope>PKEHL</scope><scope>PQEST</scope><scope>PQGLB</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope></search><sort><creationdate>2021</creationdate><title>RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms</title><author>Janani, R. ; Vijayarani, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Ant colony optimization</topic><topic>Artificial Intelligence</topic><topic>Classification</topic><topic>Computational Intelligence</topic><topic>Control</topic><topic>Datasets</topic><topic>Decision trees</topic><topic>Documents</topic><topic>Engineering</topic><topic>Feature selection</topic><topic>Genetic algorithms</topic><topic>Heuristic methods</topic><topic>Machine learning</topic><topic>Mathematical Logic and Foundations</topic><topic>Mechatronics</topic><topic>Methodologies and Application</topic><topic>Methods</topic><topic>Neural networks</topic><topic>Optimization techniques</topic><topic>Particle swarm optimization</topic><topic>Personal computers</topic><topic>Robotics</topic><topic>Support vector machines</topic><topic>Swarm intelligence</topic><topic>Text categorization</topic><topic>Unstructured data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Janani, R.</creatorcontrib><creatorcontrib>Vijayarani, S.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Database‎ (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central (New)</collection><collection>ProQuest One Academic (New)</collection><collection>ProQuest One Academic Middle East (New)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Applied & Life Sciences</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><jtitle>Soft computing (Berlin, Germany)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Janani, R.</au><au>Vijayarani, S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms</atitle><jtitle>Soft computing (Berlin, Germany)</jtitle><stitle>Soft Comput</stitle><date>2021</date><risdate>2021</risdate><volume>25</volume><issue>2</issue><spage>1129</spage><epage>1145</epage><pages>1129-1145</pages><issn>1432-7643</issn><eissn>1433-7479</eissn><abstract>In the recent years, the volume of text documents in the form of digital way has grown up extremely in size. As significance, there is a need to be competent to automatically bring together and classify the documents based on their content. The main goal of text classification is to partition the unstructured set of documents into their respective categories based on its content. The main aim of this research work is to automatically classify the documents which are stored in the personal computer into their relevant categories. This work has two significant phases. In the first phase, the important features are selected for classification and the second phase is the classification of text documents. For selecting the optimal features, this research work proposes a new algorithm, optimization technique for feature selection (OTFS) algorithm. To estimate the proficiency of proposed feature selection algorithm, the OTFS algorithm was compared with the existing approaches artificial bee colony, firefly algorithm, ant colony optimization and particle swarm optimization. In the second phase, this research work proposed machine learning-based automatic text classification (MLearn-ATC) algorithm for text classification. In classification, the MLearn-ATC algorithm was compared with widely used classification techniques probabilistic neural network, support vector machine, K-nearest neighbor and Naïve Bayes. From this, the output of first phase is used as the input for classification phase. The decisive results establish that the proposed algorithms achieve the better accuracy for optimizing the features and classifying the text documents based on their content.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s00500-020-05209-8</doi><tpages>17</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 1432-7643
ispartof	Soft computing (Berlin, Germany), 2021, Vol.25 (2), p.1129-1145
issn	1432-7643 1433-7479
language	eng
recordid	cdi_proquest_journals_3117808121
source	Springer Link
subjects	Accuracy Algorithms Ant colony optimization Artificial Intelligence Classification Computational Intelligence Control Datasets Decision trees Documents Engineering Feature selection Genetic algorithms Heuristic methods Machine learning Mathematical Logic and Foundations Mechatronics Methodologies and Application Methods Neural networks Optimization techniques Particle swarm optimization Personal computers Robotics Support vector machines Swarm intelligence Text categorization Unstructured data
title	RETRACTED ARTICLE: Automatic text classification using machine learning and optimization algorithms
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-25T12%3A38%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RETRACTED%20ARTICLE:%20Automatic%20text%20classification%20using%20machine%20learning%20and%20optimization%20algorithms&rft.jtitle=Soft%20computing%20(Berlin,%20Germany)&rft.au=Janani,%20R.&rft.date=2021&rft.volume=25&rft.issue=2&rft.spage=1129&rft.epage=1145&rft.pages=1129-1145&rft.issn=1432-7643&rft.eissn=1433-7479&rft_id=info:doi/10.1007/s00500-020-05209-8&rft_dat=%3Cproquest_cross%3E3117808121%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c1158-ae83adc9b634d878bb23f9849de7590b8d50ecc4703afc11a827ee87046fa7763%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3117808121&rft_id=info:pmid/&rfr_iscdi=true