Loading…

Data sampling approach using heuristic Learning Vector Quantization (LVQ) classifier for software defect prediction

On the basis of quality estimate, early prediction and identification of software flaws is crucial in the software area. Prediction of Software Defects SDP is defined as the process of exposing software to flaws through the use of prediction models and defect datasets. This study recommended a metho...

Full description

Saved in:
Bibliographic Details
Published in:Journal of intelligent & fuzzy systems 2023-01, Vol.44 (3), p.3867-3876
Main Authors: Amanullah, M., Thanga Ramya, S., Sudha, M., Gladis Pushparathi, V.P., Haldorai, Anandakumar, Pant, Bhaskar
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483
cites cdi_FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483
container_end_page 3876
container_issue 3
container_start_page 3867
container_title Journal of intelligent & fuzzy systems
container_volume 44
creator Amanullah, M.
Thanga Ramya, S.
Sudha, M.
Gladis Pushparathi, V.P.
Haldorai, Anandakumar
Pant, Bhaskar
description On the basis of quality estimate, early prediction and identification of software flaws is crucial in the software area. Prediction of Software Defects SDP is defined as the process of exposing software to flaws through the use of prediction models and defect datasets. This study recommended a method for dealing with the class imbalance problem based on Improved Random Synthetic Minority Oversampling Technique (SMOTE), followed by Linear Pearson Correlation Technique to perform feature selection to predict software failure. On the basis of the SMOTE data sampling approach, a strategy for software defect prediction is given in this paper. To address the class imbalance, the defect datasets were initially processed using the Improved Random-SMOTE Oversampling technique. Then, using the Linear Pearson Correlation approach, the features were chosen, and using the k-fold cross validation process, the samples were split into training and testing datasets. Finally, Heuristic Learning Vector Quantization is used to classify data in order to predict software problems. Based on measures like sensitivity, specificity, FPR, and accuracy rate for two separate datasets, the performance of the proposed strategy is contrasted with the approaches to classification that presently exist.
doi_str_mv 10.3233/JIFS-220480
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2785237234</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2785237234</sourcerecordid><originalsourceid>FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483</originalsourceid><addsrcrecordid>eNotkEtLAzEQgIMoWKsn_0DAiyKreW02PUq1WilIUXtdxtnEprS7a5JF9Ne7Sz3Ng29mmI-Qc85upJDy9nk-e82EYMqwAzLipsgzM9HFYZ8zrTIulD4mJzFuGONFLtiIxHtIQCPs2q2vPym0bWgA17SLQ7m2XfAxeaQLC6EeWiuLqQl02UGd_C8k39T0crFaXlHcQozeeRuo64nYuPQNwdLKun6GtsFWHgf-lBw52EZ79h_H5H328DZ9yhYvj_Pp3SJDoXnKcmE-EAuNhjutGQhmUUlXOawqq6vcKaG4y01hxYRDpZHlBjRaZFYbpYwck4v93v6nr87GVG6aLtT9yVIUJheyEFL11PWewtDEGKwr2-B3EH5KzsrBajlYLfdW5R8lbmwB</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2785237234</pqid></control><display><type>article</type><title>Data sampling approach using heuristic Learning Vector Quantization (LVQ) classifier for software defect prediction</title><source>Business Source Ultimate</source><source>SAGE:Jisc Collections:SAGE Journals Read and Publish 2023-2024:2025 extension (reading list)</source><creator>Amanullah, M. ; Thanga Ramya, S. ; Sudha, M. ; Gladis Pushparathi, V.P. ; Haldorai, Anandakumar ; Pant, Bhaskar</creator><creatorcontrib>Amanullah, M. ; Thanga Ramya, S. ; Sudha, M. ; Gladis Pushparathi, V.P. ; Haldorai, Anandakumar ; Pant, Bhaskar</creatorcontrib><description>On the basis of quality estimate, early prediction and identification of software flaws is crucial in the software area. Prediction of Software Defects SDP is defined as the process of exposing software to flaws through the use of prediction models and defect datasets. This study recommended a method for dealing with the class imbalance problem based on Improved Random Synthetic Minority Oversampling Technique (SMOTE), followed by Linear Pearson Correlation Technique to perform feature selection to predict software failure. On the basis of the SMOTE data sampling approach, a strategy for software defect prediction is given in this paper. To address the class imbalance, the defect datasets were initially processed using the Improved Random-SMOTE Oversampling technique. Then, using the Linear Pearson Correlation approach, the features were chosen, and using the k-fold cross validation process, the samples were split into training and testing datasets. Finally, Heuristic Learning Vector Quantization is used to classify data in order to predict software problems. Based on measures like sensitivity, specificity, FPR, and accuracy rate for two separate datasets, the performance of the proposed strategy is contrasted with the approaches to classification that presently exist.</description><identifier>ISSN: 1064-1246</identifier><identifier>EISSN: 1875-8967</identifier><identifier>DOI: 10.3233/JIFS-220480</identifier><language>eng</language><publisher>Amsterdam: IOS Press BV</publisher><subject>Data sampling ; Datasets ; Defects ; Heuristic ; Learning vector quantization networks ; Oversampling ; Prediction models ; Software</subject><ispartof>Journal of intelligent &amp; fuzzy systems, 2023-01, Vol.44 (3), p.3867-3876</ispartof><rights>Copyright IOS Press BV 2023</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483</citedby><cites>FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Amanullah, M.</creatorcontrib><creatorcontrib>Thanga Ramya, S.</creatorcontrib><creatorcontrib>Sudha, M.</creatorcontrib><creatorcontrib>Gladis Pushparathi, V.P.</creatorcontrib><creatorcontrib>Haldorai, Anandakumar</creatorcontrib><creatorcontrib>Pant, Bhaskar</creatorcontrib><title>Data sampling approach using heuristic Learning Vector Quantization (LVQ) classifier for software defect prediction</title><title>Journal of intelligent &amp; fuzzy systems</title><description>On the basis of quality estimate, early prediction and identification of software flaws is crucial in the software area. Prediction of Software Defects SDP is defined as the process of exposing software to flaws through the use of prediction models and defect datasets. This study recommended a method for dealing with the class imbalance problem based on Improved Random Synthetic Minority Oversampling Technique (SMOTE), followed by Linear Pearson Correlation Technique to perform feature selection to predict software failure. On the basis of the SMOTE data sampling approach, a strategy for software defect prediction is given in this paper. To address the class imbalance, the defect datasets were initially processed using the Improved Random-SMOTE Oversampling technique. Then, using the Linear Pearson Correlation approach, the features were chosen, and using the k-fold cross validation process, the samples were split into training and testing datasets. Finally, Heuristic Learning Vector Quantization is used to classify data in order to predict software problems. Based on measures like sensitivity, specificity, FPR, and accuracy rate for two separate datasets, the performance of the proposed strategy is contrasted with the approaches to classification that presently exist.</description><subject>Data sampling</subject><subject>Datasets</subject><subject>Defects</subject><subject>Heuristic</subject><subject>Learning vector quantization networks</subject><subject>Oversampling</subject><subject>Prediction models</subject><subject>Software</subject><issn>1064-1246</issn><issn>1875-8967</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNotkEtLAzEQgIMoWKsn_0DAiyKreW02PUq1WilIUXtdxtnEprS7a5JF9Ne7Sz3Ng29mmI-Qc85upJDy9nk-e82EYMqwAzLipsgzM9HFYZ8zrTIulD4mJzFuGONFLtiIxHtIQCPs2q2vPym0bWgA17SLQ7m2XfAxeaQLC6EeWiuLqQl02UGd_C8k39T0crFaXlHcQozeeRuo64nYuPQNwdLKun6GtsFWHgf-lBw52EZ79h_H5H328DZ9yhYvj_Pp3SJDoXnKcmE-EAuNhjutGQhmUUlXOawqq6vcKaG4y01hxYRDpZHlBjRaZFYbpYwck4v93v6nr87GVG6aLtT9yVIUJheyEFL11PWewtDEGKwr2-B3EH5KzsrBajlYLfdW5R8lbmwB</recordid><startdate>20230101</startdate><enddate>20230101</enddate><creator>Amanullah, M.</creator><creator>Thanga Ramya, S.</creator><creator>Sudha, M.</creator><creator>Gladis Pushparathi, V.P.</creator><creator>Haldorai, Anandakumar</creator><creator>Pant, Bhaskar</creator><general>IOS Press BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20230101</creationdate><title>Data sampling approach using heuristic Learning Vector Quantization (LVQ) classifier for software defect prediction</title><author>Amanullah, M. ; Thanga Ramya, S. ; Sudha, M. ; Gladis Pushparathi, V.P. ; Haldorai, Anandakumar ; Pant, Bhaskar</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Data sampling</topic><topic>Datasets</topic><topic>Defects</topic><topic>Heuristic</topic><topic>Learning vector quantization networks</topic><topic>Oversampling</topic><topic>Prediction models</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Amanullah, M.</creatorcontrib><creatorcontrib>Thanga Ramya, S.</creatorcontrib><creatorcontrib>Sudha, M.</creatorcontrib><creatorcontrib>Gladis Pushparathi, V.P.</creatorcontrib><creatorcontrib>Haldorai, Anandakumar</creatorcontrib><creatorcontrib>Pant, Bhaskar</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Journal of intelligent &amp; fuzzy systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Amanullah, M.</au><au>Thanga Ramya, S.</au><au>Sudha, M.</au><au>Gladis Pushparathi, V.P.</au><au>Haldorai, Anandakumar</au><au>Pant, Bhaskar</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Data sampling approach using heuristic Learning Vector Quantization (LVQ) classifier for software defect prediction</atitle><jtitle>Journal of intelligent &amp; fuzzy systems</jtitle><date>2023-01-01</date><risdate>2023</risdate><volume>44</volume><issue>3</issue><spage>3867</spage><epage>3876</epage><pages>3867-3876</pages><issn>1064-1246</issn><eissn>1875-8967</eissn><abstract>On the basis of quality estimate, early prediction and identification of software flaws is crucial in the software area. Prediction of Software Defects SDP is defined as the process of exposing software to flaws through the use of prediction models and defect datasets. This study recommended a method for dealing with the class imbalance problem based on Improved Random Synthetic Minority Oversampling Technique (SMOTE), followed by Linear Pearson Correlation Technique to perform feature selection to predict software failure. On the basis of the SMOTE data sampling approach, a strategy for software defect prediction is given in this paper. To address the class imbalance, the defect datasets were initially processed using the Improved Random-SMOTE Oversampling technique. Then, using the Linear Pearson Correlation approach, the features were chosen, and using the k-fold cross validation process, the samples were split into training and testing datasets. Finally, Heuristic Learning Vector Quantization is used to classify data in order to predict software problems. Based on measures like sensitivity, specificity, FPR, and accuracy rate for two separate datasets, the performance of the proposed strategy is contrasted with the approaches to classification that presently exist.</abstract><cop>Amsterdam</cop><pub>IOS Press BV</pub><doi>10.3233/JIFS-220480</doi><tpages>10</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1064-1246
ispartof Journal of intelligent & fuzzy systems, 2023-01, Vol.44 (3), p.3867-3876
issn 1064-1246
1875-8967
language eng
recordid cdi_proquest_journals_2785237234
source Business Source Ultimate; SAGE:Jisc Collections:SAGE Journals Read and Publish 2023-2024:2025 extension (reading list)
subjects Data sampling
Datasets
Defects
Heuristic
Learning vector quantization networks
Oversampling
Prediction models
Software
title Data sampling approach using heuristic Learning Vector Quantization (LVQ) classifier for software defect prediction
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T12%3A56%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Data%20sampling%20approach%20using%20heuristic%20Learning%20Vector%20Quantization%20(LVQ)%20classifier%20for%20software%20defect%20prediction&rft.jtitle=Journal%20of%20intelligent%20&%20fuzzy%20systems&rft.au=Amanullah,%20M.&rft.date=2023-01-01&rft.volume=44&rft.issue=3&rft.spage=3867&rft.epage=3876&rft.pages=3867-3876&rft.issn=1064-1246&rft.eissn=1875-8967&rft_id=info:doi/10.3233/JIFS-220480&rft_dat=%3Cproquest_cross%3E2785237234%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2785237234&rft_id=info:pmid/&rfr_iscdi=true