Loading…

Code Smells Detection and Visualization: A Systematic Literature Review

Code smells tend to compromise software quality and also demand more effort by developers to maintain and evolve the application throughout its life-cycle. They have long been catalogued with corresponding mitigating solutions called refactoring operations. Researchers have argued that due to the su...

Full description

Saved in:

Bibliographic Details
Published in:	Archives of computational methods in engineering 2022-01, Vol.29 (1), p.47-94
Main Authors:	Pereira dos Reis, José, Brito e Abreu, Fernando, de Figueiredo Carneiro, Glauco, Anslow, Craig
Format:	Article
Language:	English
Subjects:	Best practice Decision trees Engineering Feature extraction Genetic algorithms Literature reviews Machine learning Mathematical and Computational Engineering Metaphor Open source software Programming languages Review Article Software Source code State-of-the-art reviews Support vector machines Systematic review Visualization
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c411t-36e727df6eb5de2e22d80cd1377d571585b28eeabe6cbebb512b0d278216505f3
cites	cdi_FETCH-LOGICAL-c411t-36e727df6eb5de2e22d80cd1377d571585b28eeabe6cbebb512b0d278216505f3
container_end_page	94
container_issue	1
container_start_page	47
container_title	Archives of computational methods in engineering
container_volume	29
creator	Pereira dos Reis, José Brito e Abreu, Fernando de Figueiredo Carneiro, Glauco Anslow, Craig
description	Code smells tend to compromise software quality and also demand more effort by developers to maintain and evolve the application throughout its life-cycle. They have long been catalogued with corresponding mitigating solutions called refactoring operations. Researchers have argued that due to the subjectiveness of the code smells detection process, proposing an effective use of automatic support for this end is a non trivial task. This systematic literature review (SLR) has a twofold goal: the first is to identify the main code smells detection techniques and tools discussed in the literature, and the second is to analyze to which extent visual techniques have been applied to support the former. Over eighty primary studies indexed in major scientific repositories were identified by our search string in this SLR. Then, following existing best practices for secondary studies, we applied inclusion/exclusion criteria to select the most relevant works, extract their features and classify them. We found that the most commonly used approaches to code smells detection are search-based (30.1%), metric-based (24.1%), and symptom-based approaches (19.3%). Most of the studies (83.1%) use open-source software, with the Java language occupying the first position (77.1%). In terms of code smells, God Class (51.8%), Feature Envy (33.7%), and Long Method (26.5%) are the most covered ones. Machine learning (ML) techniques are used in 35% of the studies, with genetic programming, decision tree, support vector machines and association rules being the most used algorithms. Around 80% of the studies only detect code smells, without providing visualization techniques. In visualization-based approaches several methods are used, such as: city metaphors, 3D visualization techniques, interactive ambient visualization, polymetric views, or graph models. This paper presents an up-to-date review on the state-of-the-art techniques and tools used for code smells detection and visualization. We confirm that the detection of code smells is a non trivial task, and there is still a lot of work to be done in terms of: reducing the subjectivity associated with the definition and detection of code smells; increasing the diversity of detected code smells and of supported programming languages; constructing and sharing oracles and datasets to facilitate the replication of code smells detection and visualization techniques validation experiments.
doi_str_mv	10.1007/s11831-021-09566-x
format	article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2616383247</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2616383247</sourcerecordid><originalsourceid>FETCH-LOGICAL-c411t-36e727df6eb5de2e22d80cd1377d571585b28eeabe6cbebb512b0d278216505f3</originalsourceid><addsrcrecordid>eNp9kEtLw0AUhQdRsFb_gKsB19G5M5lH3ZWqVSgIVt0OSeZGUtKkziTa-uudGsGdi8t9cM658BFyDuwSGNNXAcAISBiPNZFKJdsDMgJjVALapIdxBpEmgil2TE5CWDEm08mEj8h81jqkyzXWdaA32GHRVW1Ds8bR1yr0WV19ZfvLNZ3S5S50uI5rQRdVhz7reo_0CT8q_DwlR2VWBzz77WPycnf7PLtPFo_zh9l0kRQpQJcIhZprVyrMpUOOnDvDCgdCayc1SCNzbhCzHFWRY55L4DlzXBsOSjJZijG5GHI3vn3vMXR21fa-iS8tV6CEETzVUcUHVeHbEDyWduOrdeZ3FpjdA7MDMBuB2R9gdhtNYjCFKG7e0P9F_-P6Br8abnc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2616383247</pqid></control><display><type>article</type><title>Code Smells Detection and Visualization: A Systematic Literature Review</title><source>Springer Link</source><creator>Pereira dos Reis, José ; Brito e Abreu, Fernando ; de Figueiredo Carneiro, Glauco ; Anslow, Craig</creator><creatorcontrib>Pereira dos Reis, José ; Brito e Abreu, Fernando ; de Figueiredo Carneiro, Glauco ; Anslow, Craig</creatorcontrib><description>Code smells tend to compromise software quality and also demand more effort by developers to maintain and evolve the application throughout its life-cycle. They have long been catalogued with corresponding mitigating solutions called refactoring operations. Researchers have argued that due to the subjectiveness of the code smells detection process, proposing an effective use of automatic support for this end is a non trivial task. This systematic literature review (SLR) has a twofold goal: the first is to identify the main code smells detection techniques and tools discussed in the literature, and the second is to analyze to which extent visual techniques have been applied to support the former. Over eighty primary studies indexed in major scientific repositories were identified by our search string in this SLR. Then, following existing best practices for secondary studies, we applied inclusion/exclusion criteria to select the most relevant works, extract their features and classify them. We found that the most commonly used approaches to code smells detection are search-based (30.1%), metric-based (24.1%), and symptom-based approaches (19.3%). Most of the studies (83.1%) use open-source software, with the Java language occupying the first position (77.1%). In terms of code smells, God Class (51.8%), Feature Envy (33.7%), and Long Method (26.5%) are the most covered ones. Machine learning (ML) techniques are used in 35% of the studies, with genetic programming, decision tree, support vector machines and association rules being the most used algorithms. Around 80% of the studies only detect code smells, without providing visualization techniques. In visualization-based approaches several methods are used, such as: city metaphors, 3D visualization techniques, interactive ambient visualization, polymetric views, or graph models. This paper presents an up-to-date review on the state-of-the-art techniques and tools used for code smells detection and visualization. We confirm that the detection of code smells is a non trivial task, and there is still a lot of work to be done in terms of: reducing the subjectivity associated with the definition and detection of code smells; increasing the diversity of detected code smells and of supported programming languages; constructing and sharing oracles and datasets to facilitate the replication of code smells detection and visualization techniques validation experiments.</description><identifier>ISSN: 1134-3060</identifier><identifier>EISSN: 1886-1784</identifier><identifier>DOI: 10.1007/s11831-021-09566-x</identifier><language>eng</language><publisher>Dordrecht: Springer Netherlands</publisher><subject>Best practice ; Decision trees ; Engineering ; Feature extraction ; Genetic algorithms ; Literature reviews ; Machine learning ; Mathematical and Computational Engineering ; Metaphor ; Open source software ; Programming languages ; Review Article ; Software ; Source code ; State-of-the-art reviews ; Support vector machines ; Systematic review ; Visualization</subject><ispartof>Archives of computational methods in engineering, 2022-01, Vol.29 (1), p.47-94</ispartof><rights>CIMNE, Barcelona, Spain 2021</rights><rights>CIMNE, Barcelona, Spain 2021.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c411t-36e727df6eb5de2e22d80cd1377d571585b28eeabe6cbebb512b0d278216505f3</citedby><cites>FETCH-LOGICAL-c411t-36e727df6eb5de2e22d80cd1377d571585b28eeabe6cbebb512b0d278216505f3</cites><orcidid>0000-0002-9086-4122 ; 0000-0002-2505-9565 ; 0000-0001-6241-1612 ; 0000-0001-8064-6300</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Pereira dos Reis, José</creatorcontrib><creatorcontrib>Brito e Abreu, Fernando</creatorcontrib><creatorcontrib>de Figueiredo Carneiro, Glauco</creatorcontrib><creatorcontrib>Anslow, Craig</creatorcontrib><title>Code Smells Detection and Visualization: A Systematic Literature Review</title><title>Archives of computational methods in engineering</title><addtitle>Arch Computat Methods Eng</addtitle><description>Code smells tend to compromise software quality and also demand more effort by developers to maintain and evolve the application throughout its life-cycle. They have long been catalogued with corresponding mitigating solutions called refactoring operations. Researchers have argued that due to the subjectiveness of the code smells detection process, proposing an effective use of automatic support for this end is a non trivial task. This systematic literature review (SLR) has a twofold goal: the first is to identify the main code smells detection techniques and tools discussed in the literature, and the second is to analyze to which extent visual techniques have been applied to support the former. Over eighty primary studies indexed in major scientific repositories were identified by our search string in this SLR. Then, following existing best practices for secondary studies, we applied inclusion/exclusion criteria to select the most relevant works, extract their features and classify them. We found that the most commonly used approaches to code smells detection are search-based (30.1%), metric-based (24.1%), and symptom-based approaches (19.3%). Most of the studies (83.1%) use open-source software, with the Java language occupying the first position (77.1%). In terms of code smells, God Class (51.8%), Feature Envy (33.7%), and Long Method (26.5%) are the most covered ones. Machine learning (ML) techniques are used in 35% of the studies, with genetic programming, decision tree, support vector machines and association rules being the most used algorithms. Around 80% of the studies only detect code smells, without providing visualization techniques. In visualization-based approaches several methods are used, such as: city metaphors, 3D visualization techniques, interactive ambient visualization, polymetric views, or graph models. This paper presents an up-to-date review on the state-of-the-art techniques and tools used for code smells detection and visualization. We confirm that the detection of code smells is a non trivial task, and there is still a lot of work to be done in terms of: reducing the subjectivity associated with the definition and detection of code smells; increasing the diversity of detected code smells and of supported programming languages; constructing and sharing oracles and datasets to facilitate the replication of code smells detection and visualization techniques validation experiments.</description><subject>Best practice</subject><subject>Decision trees</subject><subject>Engineering</subject><subject>Feature extraction</subject><subject>Genetic algorithms</subject><subject>Literature reviews</subject><subject>Machine learning</subject><subject>Mathematical and Computational Engineering</subject><subject>Metaphor</subject><subject>Open source software</subject><subject>Programming languages</subject><subject>Review Article</subject><subject>Software</subject><subject>Source code</subject><subject>State-of-the-art reviews</subject><subject>Support vector machines</subject><subject>Systematic review</subject><subject>Visualization</subject><issn>1134-3060</issn><issn>1886-1784</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kEtLw0AUhQdRsFb_gKsB19G5M5lH3ZWqVSgIVt0OSeZGUtKkziTa-uudGsGdi8t9cM658BFyDuwSGNNXAcAISBiPNZFKJdsDMgJjVALapIdxBpEmgil2TE5CWDEm08mEj8h81jqkyzXWdaA32GHRVW1Ds8bR1yr0WV19ZfvLNZ3S5S50uI5rQRdVhz7reo_0CT8q_DwlR2VWBzz77WPycnf7PLtPFo_zh9l0kRQpQJcIhZprVyrMpUOOnDvDCgdCayc1SCNzbhCzHFWRY55L4DlzXBsOSjJZijG5GHI3vn3vMXR21fa-iS8tV6CEETzVUcUHVeHbEDyWduOrdeZ3FpjdA7MDMBuB2R9gdhtNYjCFKG7e0P9F_-P6Br8abnc</recordid><startdate>20220101</startdate><enddate>20220101</enddate><creator>Pereira dos Reis, José</creator><creator>Brito e Abreu, Fernando</creator><creator>de Figueiredo Carneiro, Glauco</creator><creator>Anslow, Craig</creator><general>Springer Netherlands</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><orcidid>https://orcid.org/0000-0002-9086-4122</orcidid><orcidid>https://orcid.org/0000-0002-2505-9565</orcidid><orcidid>https://orcid.org/0000-0001-6241-1612</orcidid><orcidid>https://orcid.org/0000-0001-8064-6300</orcidid></search><sort><creationdate>20220101</creationdate><title>Code Smells Detection and Visualization: A Systematic Literature Review</title><author>Pereira dos Reis, José ; Brito e Abreu, Fernando ; de Figueiredo Carneiro, Glauco ; Anslow, Craig</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c411t-36e727df6eb5de2e22d80cd1377d571585b28eeabe6cbebb512b0d278216505f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Best practice</topic><topic>Decision trees</topic><topic>Engineering</topic><topic>Feature extraction</topic><topic>Genetic algorithms</topic><topic>Literature reviews</topic><topic>Machine learning</topic><topic>Mathematical and Computational Engineering</topic><topic>Metaphor</topic><topic>Open source software</topic><topic>Programming languages</topic><topic>Review Article</topic><topic>Software</topic><topic>Source code</topic><topic>State-of-the-art reviews</topic><topic>Support vector machines</topic><topic>Systematic review</topic><topic>Visualization</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Pereira dos Reis, José</creatorcontrib><creatorcontrib>Brito e Abreu, Fernando</creatorcontrib><creatorcontrib>de Figueiredo Carneiro, Glauco</creatorcontrib><creatorcontrib>Anslow, Craig</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><jtitle>Archives of computational methods in engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pereira dos Reis, José</au><au>Brito e Abreu, Fernando</au><au>de Figueiredo Carneiro, Glauco</au><au>Anslow, Craig</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Code Smells Detection and Visualization: A Systematic Literature Review</atitle><jtitle>Archives of computational methods in engineering</jtitle><stitle>Arch Computat Methods Eng</stitle><date>2022-01-01</date><risdate>2022</risdate><volume>29</volume><issue>1</issue><spage>47</spage><epage>94</epage><pages>47-94</pages><issn>1134-3060</issn><eissn>1886-1784</eissn><abstract>Code smells tend to compromise software quality and also demand more effort by developers to maintain and evolve the application throughout its life-cycle. They have long been catalogued with corresponding mitigating solutions called refactoring operations. Researchers have argued that due to the subjectiveness of the code smells detection process, proposing an effective use of automatic support for this end is a non trivial task. This systematic literature review (SLR) has a twofold goal: the first is to identify the main code smells detection techniques and tools discussed in the literature, and the second is to analyze to which extent visual techniques have been applied to support the former. Over eighty primary studies indexed in major scientific repositories were identified by our search string in this SLR. Then, following existing best practices for secondary studies, we applied inclusion/exclusion criteria to select the most relevant works, extract their features and classify them. We found that the most commonly used approaches to code smells detection are search-based (30.1%), metric-based (24.1%), and symptom-based approaches (19.3%). Most of the studies (83.1%) use open-source software, with the Java language occupying the first position (77.1%). In terms of code smells, God Class (51.8%), Feature Envy (33.7%), and Long Method (26.5%) are the most covered ones. Machine learning (ML) techniques are used in 35% of the studies, with genetic programming, decision tree, support vector machines and association rules being the most used algorithms. Around 80% of the studies only detect code smells, without providing visualization techniques. In visualization-based approaches several methods are used, such as: city metaphors, 3D visualization techniques, interactive ambient visualization, polymetric views, or graph models. This paper presents an up-to-date review on the state-of-the-art techniques and tools used for code smells detection and visualization. We confirm that the detection of code smells is a non trivial task, and there is still a lot of work to be done in terms of: reducing the subjectivity associated with the definition and detection of code smells; increasing the diversity of detected code smells and of supported programming languages; constructing and sharing oracles and datasets to facilitate the replication of code smells detection and visualization techniques validation experiments.</abstract><cop>Dordrecht</cop><pub>Springer Netherlands</pub><doi>10.1007/s11831-021-09566-x</doi><tpages>48</tpages><orcidid>https://orcid.org/0000-0002-9086-4122</orcidid><orcidid>https://orcid.org/0000-0002-2505-9565</orcidid><orcidid>https://orcid.org/0000-0001-6241-1612</orcidid><orcidid>https://orcid.org/0000-0001-8064-6300</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1134-3060
ispartof	Archives of computational methods in engineering, 2022-01, Vol.29 (1), p.47-94
issn	1134-3060 1886-1784
language	eng
recordid	cdi_proquest_journals_2616383247
source	Springer Link
subjects	Best practice Decision trees Engineering Feature extraction Genetic algorithms Literature reviews Machine learning Mathematical and Computational Engineering Metaphor Open source software Programming languages Review Article Software Source code State-of-the-art reviews Support vector machines Systematic review Visualization
title	Code Smells Detection and Visualization: A Systematic Literature Review
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T14%3A24%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Code%20Smells%20Detection%20and%20Visualization:%20A%20Systematic%20Literature%20Review&rft.jtitle=Archives%20of%20computational%20methods%20in%20engineering&rft.au=Pereira%20dos%20Reis,%20Jos%C3%A9&rft.date=2022-01-01&rft.volume=29&rft.issue=1&rft.spage=47&rft.epage=94&rft.pages=47-94&rft.issn=1134-3060&rft.eissn=1886-1784&rft_id=info:doi/10.1007/s11831-021-09566-x&rft_dat=%3Cproquest_cross%3E2616383247%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c411t-36e727df6eb5de2e22d80cd1377d571585b28eeabe6cbebb512b0d278216505f3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2616383247&rft_id=info:pmid/&rfr_iscdi=true