Loading…
Data sampling approach using heuristic Learning Vector Quantization (LVQ) classifier for software defect prediction
On the basis of quality estimate, early prediction and identification of software flaws is crucial in the software area. Prediction of Software Defects SDP is defined as the process of exposing software to flaws through the use of prediction models and defect datasets. This study recommended a metho...
Saved in:
Published in: | Journal of intelligent & fuzzy systems 2023-01, Vol.44 (3), p.3867-3876 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483 |
---|---|
cites | cdi_FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483 |
container_end_page | 3876 |
container_issue | 3 |
container_start_page | 3867 |
container_title | Journal of intelligent & fuzzy systems |
container_volume | 44 |
creator | Amanullah, M. Thanga Ramya, S. Sudha, M. Gladis Pushparathi, V.P. Haldorai, Anandakumar Pant, Bhaskar |
description | On the basis of quality estimate, early prediction and identification of software flaws is crucial in the software area. Prediction of Software Defects SDP is defined as the process of exposing software to flaws through the use of prediction models and defect datasets. This study recommended a method for dealing with the class imbalance problem based on Improved Random Synthetic Minority Oversampling Technique (SMOTE), followed by Linear Pearson Correlation Technique to perform feature selection to predict software failure. On the basis of the SMOTE data sampling approach, a strategy for software defect prediction is given in this paper. To address the class imbalance, the defect datasets were initially processed using the Improved Random-SMOTE Oversampling technique. Then, using the Linear Pearson Correlation approach, the features were chosen, and using the k-fold cross validation process, the samples were split into training and testing datasets. Finally, Heuristic Learning Vector Quantization is used to classify data in order to predict software problems. Based on measures like sensitivity, specificity, FPR, and accuracy rate for two separate datasets, the performance of the proposed strategy is contrasted with the approaches to classification that presently exist. |
doi_str_mv | 10.3233/JIFS-220480 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2785237234</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2785237234</sourcerecordid><originalsourceid>FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483</originalsourceid><addsrcrecordid>eNotkEtLAzEQgIMoWKsn_0DAiyKreW02PUq1WilIUXtdxtnEprS7a5JF9Ne7Sz3Ng29mmI-Qc85upJDy9nk-e82EYMqwAzLipsgzM9HFYZ8zrTIulD4mJzFuGONFLtiIxHtIQCPs2q2vPym0bWgA17SLQ7m2XfAxeaQLC6EeWiuLqQl02UGd_C8k39T0crFaXlHcQozeeRuo64nYuPQNwdLKun6GtsFWHgf-lBw52EZ79h_H5H328DZ9yhYvj_Pp3SJDoXnKcmE-EAuNhjutGQhmUUlXOawqq6vcKaG4y01hxYRDpZHlBjRaZFYbpYwck4v93v6nr87GVG6aLtT9yVIUJheyEFL11PWewtDEGKwr2-B3EH5KzsrBajlYLfdW5R8lbmwB</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2785237234</pqid></control><display><type>article</type><title>Data sampling approach using heuristic Learning Vector Quantization (LVQ) classifier for software defect prediction</title><source>Business Source Ultimate</source><source>SAGE:Jisc Collections:SAGE Journals Read and Publish 2023-2024:2025 extension (reading list)</source><creator>Amanullah, M. ; Thanga Ramya, S. ; Sudha, M. ; Gladis Pushparathi, V.P. ; Haldorai, Anandakumar ; Pant, Bhaskar</creator><creatorcontrib>Amanullah, M. ; Thanga Ramya, S. ; Sudha, M. ; Gladis Pushparathi, V.P. ; Haldorai, Anandakumar ; Pant, Bhaskar</creatorcontrib><description>On the basis of quality estimate, early prediction and identification of software flaws is crucial in the software area. Prediction of Software Defects SDP is defined as the process of exposing software to flaws through the use of prediction models and defect datasets. This study recommended a method for dealing with the class imbalance problem based on Improved Random Synthetic Minority Oversampling Technique (SMOTE), followed by Linear Pearson Correlation Technique to perform feature selection to predict software failure. On the basis of the SMOTE data sampling approach, a strategy for software defect prediction is given in this paper. To address the class imbalance, the defect datasets were initially processed using the Improved Random-SMOTE Oversampling technique. Then, using the Linear Pearson Correlation approach, the features were chosen, and using the k-fold cross validation process, the samples were split into training and testing datasets. Finally, Heuristic Learning Vector Quantization is used to classify data in order to predict software problems. Based on measures like sensitivity, specificity, FPR, and accuracy rate for two separate datasets, the performance of the proposed strategy is contrasted with the approaches to classification that presently exist.</description><identifier>ISSN: 1064-1246</identifier><identifier>EISSN: 1875-8967</identifier><identifier>DOI: 10.3233/JIFS-220480</identifier><language>eng</language><publisher>Amsterdam: IOS Press BV</publisher><subject>Data sampling ; Datasets ; Defects ; Heuristic ; Learning vector quantization networks ; Oversampling ; Prediction models ; Software</subject><ispartof>Journal of intelligent & fuzzy systems, 2023-01, Vol.44 (3), p.3867-3876</ispartof><rights>Copyright IOS Press BV 2023</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483</citedby><cites>FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Amanullah, M.</creatorcontrib><creatorcontrib>Thanga Ramya, S.</creatorcontrib><creatorcontrib>Sudha, M.</creatorcontrib><creatorcontrib>Gladis Pushparathi, V.P.</creatorcontrib><creatorcontrib>Haldorai, Anandakumar</creatorcontrib><creatorcontrib>Pant, Bhaskar</creatorcontrib><title>Data sampling approach using heuristic Learning Vector Quantization (LVQ) classifier for software defect prediction</title><title>Journal of intelligent & fuzzy systems</title><description>On the basis of quality estimate, early prediction and identification of software flaws is crucial in the software area. Prediction of Software Defects SDP is defined as the process of exposing software to flaws through the use of prediction models and defect datasets. This study recommended a method for dealing with the class imbalance problem based on Improved Random Synthetic Minority Oversampling Technique (SMOTE), followed by Linear Pearson Correlation Technique to perform feature selection to predict software failure. On the basis of the SMOTE data sampling approach, a strategy for software defect prediction is given in this paper. To address the class imbalance, the defect datasets were initially processed using the Improved Random-SMOTE Oversampling technique. Then, using the Linear Pearson Correlation approach, the features were chosen, and using the k-fold cross validation process, the samples were split into training and testing datasets. Finally, Heuristic Learning Vector Quantization is used to classify data in order to predict software problems. Based on measures like sensitivity, specificity, FPR, and accuracy rate for two separate datasets, the performance of the proposed strategy is contrasted with the approaches to classification that presently exist.</description><subject>Data sampling</subject><subject>Datasets</subject><subject>Defects</subject><subject>Heuristic</subject><subject>Learning vector quantization networks</subject><subject>Oversampling</subject><subject>Prediction models</subject><subject>Software</subject><issn>1064-1246</issn><issn>1875-8967</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNotkEtLAzEQgIMoWKsn_0DAiyKreW02PUq1WilIUXtdxtnEprS7a5JF9Ne7Sz3Ng29mmI-Qc85upJDy9nk-e82EYMqwAzLipsgzM9HFYZ8zrTIulD4mJzFuGONFLtiIxHtIQCPs2q2vPym0bWgA17SLQ7m2XfAxeaQLC6EeWiuLqQl02UGd_C8k39T0crFaXlHcQozeeRuo64nYuPQNwdLKun6GtsFWHgf-lBw52EZ79h_H5H328DZ9yhYvj_Pp3SJDoXnKcmE-EAuNhjutGQhmUUlXOawqq6vcKaG4y01hxYRDpZHlBjRaZFYbpYwck4v93v6nr87GVG6aLtT9yVIUJheyEFL11PWewtDEGKwr2-B3EH5KzsrBajlYLfdW5R8lbmwB</recordid><startdate>20230101</startdate><enddate>20230101</enddate><creator>Amanullah, M.</creator><creator>Thanga Ramya, S.</creator><creator>Sudha, M.</creator><creator>Gladis Pushparathi, V.P.</creator><creator>Haldorai, Anandakumar</creator><creator>Pant, Bhaskar</creator><general>IOS Press BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20230101</creationdate><title>Data sampling approach using heuristic Learning Vector Quantization (LVQ) classifier for software defect prediction</title><author>Amanullah, M. ; Thanga Ramya, S. ; Sudha, M. ; Gladis Pushparathi, V.P. ; Haldorai, Anandakumar ; Pant, Bhaskar</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Data sampling</topic><topic>Datasets</topic><topic>Defects</topic><topic>Heuristic</topic><topic>Learning vector quantization networks</topic><topic>Oversampling</topic><topic>Prediction models</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Amanullah, M.</creatorcontrib><creatorcontrib>Thanga Ramya, S.</creatorcontrib><creatorcontrib>Sudha, M.</creatorcontrib><creatorcontrib>Gladis Pushparathi, V.P.</creatorcontrib><creatorcontrib>Haldorai, Anandakumar</creatorcontrib><creatorcontrib>Pant, Bhaskar</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Journal of intelligent & fuzzy systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Amanullah, M.</au><au>Thanga Ramya, S.</au><au>Sudha, M.</au><au>Gladis Pushparathi, V.P.</au><au>Haldorai, Anandakumar</au><au>Pant, Bhaskar</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Data sampling approach using heuristic Learning Vector Quantization (LVQ) classifier for software defect prediction</atitle><jtitle>Journal of intelligent & fuzzy systems</jtitle><date>2023-01-01</date><risdate>2023</risdate><volume>44</volume><issue>3</issue><spage>3867</spage><epage>3876</epage><pages>3867-3876</pages><issn>1064-1246</issn><eissn>1875-8967</eissn><abstract>On the basis of quality estimate, early prediction and identification of software flaws is crucial in the software area. Prediction of Software Defects SDP is defined as the process of exposing software to flaws through the use of prediction models and defect datasets. This study recommended a method for dealing with the class imbalance problem based on Improved Random Synthetic Minority Oversampling Technique (SMOTE), followed by Linear Pearson Correlation Technique to perform feature selection to predict software failure. On the basis of the SMOTE data sampling approach, a strategy for software defect prediction is given in this paper. To address the class imbalance, the defect datasets were initially processed using the Improved Random-SMOTE Oversampling technique. Then, using the Linear Pearson Correlation approach, the features were chosen, and using the k-fold cross validation process, the samples were split into training and testing datasets. Finally, Heuristic Learning Vector Quantization is used to classify data in order to predict software problems. Based on measures like sensitivity, specificity, FPR, and accuracy rate for two separate datasets, the performance of the proposed strategy is contrasted with the approaches to classification that presently exist.</abstract><cop>Amsterdam</cop><pub>IOS Press BV</pub><doi>10.3233/JIFS-220480</doi><tpages>10</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1064-1246 |
ispartof | Journal of intelligent & fuzzy systems, 2023-01, Vol.44 (3), p.3867-3876 |
issn | 1064-1246 1875-8967 |
language | eng |
recordid | cdi_proquest_journals_2785237234 |
source | Business Source Ultimate; SAGE:Jisc Collections:SAGE Journals Read and Publish 2023-2024:2025 extension (reading list) |
subjects | Data sampling Datasets Defects Heuristic Learning vector quantization networks Oversampling Prediction models Software |
title | Data sampling approach using heuristic Learning Vector Quantization (LVQ) classifier for software defect prediction |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T12%3A56%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Data%20sampling%20approach%20using%20heuristic%20Learning%20Vector%20Quantization%20(LVQ)%20classifier%20for%20software%20defect%20prediction&rft.jtitle=Journal%20of%20intelligent%20&%20fuzzy%20systems&rft.au=Amanullah,%20M.&rft.date=2023-01-01&rft.volume=44&rft.issue=3&rft.spage=3867&rft.epage=3876&rft.pages=3867-3876&rft.issn=1064-1246&rft.eissn=1875-8967&rft_id=info:doi/10.3233/JIFS-220480&rft_dat=%3Cproquest_cross%3E2785237234%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c261t-528bcc76c81f660a20ec43fdfcdde6d5f4241f587e291ad6c058a6cec0e684483%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2785237234&rft_id=info:pmid/&rfr_iscdi=true |