Loading…

Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images

The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector...

Full description

Saved in:

Bibliographic Details
Published in:	Remote sensing (Basel, Switzerland) Switzerland), 2022-02, Vol.14 (3), p.574
Main Authors:	Adugna, Tesfaye, Xu, Wenbo, Fan, Jinlong
Format:	Article
Language:	English
Subjects:	Algorithms Classification Classifiers Datasets Image classification Land cover land cover mapping large area Learning algorithms Libraries Machine learning machine learning (ML) Mapping Mathematical models Neural networks Parameters Performance evaluation Python Radiometers random forest (RF) Remote sensing Satellite imagery Scikit-Learn (sklearn) Support vector machines
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23
cites	cdi_FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23
container_end_page
container_issue	3
container_start_page	574
container_title	Remote sensing (Basel, Switzerland)
container_volume	14
creator	Adugna, Tesfaye Xu, Wenbo Fan, Jinlong
description	The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector machine (SVM) are two of the most widely used ML algorithms to generate land cover (LC) maps from satellite imageries. Although several comparisons have been conducted between these two algorithms, the findings are contradicting. Moreover, the comparisons were made on local-scale LC map generation either from high or medium resolution images using various software, but not Python. In this paper, we compared the performance of these two algorithms for large area LC mapping of parts of Africa using coarse resolution imageries in the Python platform by the employing Scikit-Learn (sklearn) library. We employed a big dataset, 297 metrics, comprised of systematically selected 9-month composite FegnYun-3C (FY-3C) satellite images with 1 km resolution. Several experiments were performed using a range of values to determine the best values for the two most important parameters of each classifier, the number of trees and the number of variables, for RF, and penalty value and gamma for SVM, and to obtain the best model of each algorithm. Our results showed that RF outperformed SVM yielding 0.86 (OA) and 0.83 (k), which are 1–2% and 3% higher than the best SVM model, respectively. In addition, RF performed better in mixed class classification; however, it performed almost the same when classifying relatively pure classes with distinct spectral variation, i.e., consisting of less mixed pixels. Furthermore, RF is more efficient in handling large input datasets where the SVM fails. Hence, RF is a more robust ML algorithm especially for heterogeneous large area mapping using coarse resolution images. Finally, default parameter values in the sklearn library work well for satellite image classification with minor/or no adjustment for these algorithms.
doi_str_mv	10.3390/rs14030574
format	article
fullrecord	<record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_653b9e865e124752999e548872f174c2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_653b9e865e124752999e548872f174c2</doaj_id><sourcerecordid>2627828019</sourcerecordid><originalsourceid>FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23</originalsourceid><addsrcrecordid>eNpNkdtq3DAQhk1oICHJTZ5A0LuCG50tXRbTbRa2BHIo9ErI8nijxetxJG-hT5DXjt0taediTnzzD8wUxTWjn4Ww9CZlJqmgqpInxTmnFS8lt_zDf_lZcZXzjs4mBLNUnhevNe5Hn2LGgWBH7v3Q4p6sMEGeyFyQh8M4YprIDwgTJvLdh-c4AKl7n3PsIqRMurl_D9uIg-_JZhmq8Rcs7DjGYUue8uJr9CnDDGbsD9MMk9XPUtRkvfdbyJfFaef7DFd_40XxuPr6WN-Wm7tv6_rLpgxCs6kMSrLQemCqrbTglW2El15L1miuoTHamMDaRoOxzCsNoCxjVklOaWgbLi6K9VG2Rb9zY4p7n3479NH9aWDaOp-mGHpwWonGgtEKGJeV4tZaUNKYineskmHR-njUGhO-HOZ7uR0e0nyD7LjmleGGMjtTn45USJhzgu59K6NueZv79zbxBpwNiJI</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2627828019</pqid></control><display><type>article</type><title>Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images</title><source>Publicly Available Content Database</source><creator>Adugna, Tesfaye ; Xu, Wenbo ; Fan, Jinlong</creator><creatorcontrib>Adugna, Tesfaye ; Xu, Wenbo ; Fan, Jinlong</creatorcontrib><description>The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector machine (SVM) are two of the most widely used ML algorithms to generate land cover (LC) maps from satellite imageries. Although several comparisons have been conducted between these two algorithms, the findings are contradicting. Moreover, the comparisons were made on local-scale LC map generation either from high or medium resolution images using various software, but not Python. In this paper, we compared the performance of these two algorithms for large area LC mapping of parts of Africa using coarse resolution imageries in the Python platform by the employing Scikit-Learn (sklearn) library. We employed a big dataset, 297 metrics, comprised of systematically selected 9-month composite FegnYun-3C (FY-3C) satellite images with 1 km resolution. Several experiments were performed using a range of values to determine the best values for the two most important parameters of each classifier, the number of trees and the number of variables, for RF, and penalty value and gamma for SVM, and to obtain the best model of each algorithm. Our results showed that RF outperformed SVM yielding 0.86 (OA) and 0.83 (k), which are 1–2% and 3% higher than the best SVM model, respectively. In addition, RF performed better in mixed class classification; however, it performed almost the same when classifying relatively pure classes with distinct spectral variation, i.e., consisting of less mixed pixels. Furthermore, RF is more efficient in handling large input datasets where the SVM fails. Hence, RF is a more robust ML algorithm especially for heterogeneous large area mapping using coarse resolution images. Finally, default parameter values in the sklearn library work well for satellite image classification with minor/or no adjustment for these algorithms.</description><identifier>ISSN: 2072-4292</identifier><identifier>EISSN: 2072-4292</identifier><identifier>DOI: 10.3390/rs14030574</identifier><language>eng</language><publisher>Basel: MDPI AG</publisher><subject>Algorithms ; Classification ; Classifiers ; Datasets ; Image classification ; Land cover ; land cover mapping ; large area ; Learning algorithms ; Libraries ; Machine learning ; machine learning (ML) ; Mapping ; Mathematical models ; Neural networks ; Parameters ; Performance evaluation ; Python ; Radiometers ; random forest (RF) ; Remote sensing ; Satellite imagery ; Scikit-Learn (sklearn) ; Support vector machines</subject><ispartof>Remote sensing (Basel, Switzerland), 2022-02, Vol.14 (3), p.574</ispartof><rights>2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23</citedby><cites>FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23</cites><orcidid>0000-0003-3235-587X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2627828019/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2627828019?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,25753,27924,27925,37012,44590,75126</link.rule.ids></links><search><creatorcontrib>Adugna, Tesfaye</creatorcontrib><creatorcontrib>Xu, Wenbo</creatorcontrib><creatorcontrib>Fan, Jinlong</creatorcontrib><title>Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images</title><title>Remote sensing (Basel, Switzerland)</title><description>The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector machine (SVM) are two of the most widely used ML algorithms to generate land cover (LC) maps from satellite imageries. Although several comparisons have been conducted between these two algorithms, the findings are contradicting. Moreover, the comparisons were made on local-scale LC map generation either from high or medium resolution images using various software, but not Python. In this paper, we compared the performance of these two algorithms for large area LC mapping of parts of Africa using coarse resolution imageries in the Python platform by the employing Scikit-Learn (sklearn) library. We employed a big dataset, 297 metrics, comprised of systematically selected 9-month composite FegnYun-3C (FY-3C) satellite images with 1 km resolution. Several experiments were performed using a range of values to determine the best values for the two most important parameters of each classifier, the number of trees and the number of variables, for RF, and penalty value and gamma for SVM, and to obtain the best model of each algorithm. Our results showed that RF outperformed SVM yielding 0.86 (OA) and 0.83 (k), which are 1–2% and 3% higher than the best SVM model, respectively. In addition, RF performed better in mixed class classification; however, it performed almost the same when classifying relatively pure classes with distinct spectral variation, i.e., consisting of less mixed pixels. Furthermore, RF is more efficient in handling large input datasets where the SVM fails. Hence, RF is a more robust ML algorithm especially for heterogeneous large area mapping using coarse resolution images. Finally, default parameter values in the sklearn library work well for satellite image classification with minor/or no adjustment for these algorithms.</description><subject>Algorithms</subject><subject>Classification</subject><subject>Classifiers</subject><subject>Datasets</subject><subject>Image classification</subject><subject>Land cover</subject><subject>land cover mapping</subject><subject>large area</subject><subject>Learning algorithms</subject><subject>Libraries</subject><subject>Machine learning</subject><subject>machine learning (ML)</subject><subject>Mapping</subject><subject>Mathematical models</subject><subject>Neural networks</subject><subject>Parameters</subject><subject>Performance evaluation</subject><subject>Python</subject><subject>Radiometers</subject><subject>random forest (RF)</subject><subject>Remote sensing</subject><subject>Satellite imagery</subject><subject>Scikit-Learn (sklearn)</subject><subject>Support vector machines</subject><issn>2072-4292</issn><issn>2072-4292</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNpNkdtq3DAQhk1oICHJTZ5A0LuCG50tXRbTbRa2BHIo9ErI8nijxetxJG-hT5DXjt0taediTnzzD8wUxTWjn4Ww9CZlJqmgqpInxTmnFS8lt_zDf_lZcZXzjs4mBLNUnhevNe5Hn2LGgWBH7v3Q4p6sMEGeyFyQh8M4YprIDwgTJvLdh-c4AKl7n3PsIqRMurl_D9uIg-_JZhmq8Rcs7DjGYUue8uJr9CnDDGbsD9MMk9XPUtRkvfdbyJfFaef7DFd_40XxuPr6WN-Wm7tv6_rLpgxCs6kMSrLQemCqrbTglW2El15L1miuoTHamMDaRoOxzCsNoCxjVklOaWgbLi6K9VG2Rb9zY4p7n3479NH9aWDaOp-mGHpwWonGgtEKGJeV4tZaUNKYineskmHR-njUGhO-HOZ7uR0e0nyD7LjmleGGMjtTn45USJhzgu59K6NueZv79zbxBpwNiJI</recordid><startdate>20220201</startdate><enddate>20220201</enddate><creator>Adugna, Tesfaye</creator><creator>Xu, Wenbo</creator><creator>Fan, Jinlong</creator><general>MDPI AG</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7QR</scope><scope>7SC</scope><scope>7SE</scope><scope>7SN</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>BKSAR</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>H8G</scope><scope>HCIFZ</scope><scope>JG9</scope><scope>JQ2</scope><scope>KR7</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PCBAR</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-3235-587X</orcidid></search><sort><creationdate>20220201</creationdate><title>Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images</title><author>Adugna, Tesfaye ; Xu, Wenbo ; Fan, Jinlong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Algorithms</topic><topic>Classification</topic><topic>Classifiers</topic><topic>Datasets</topic><topic>Image classification</topic><topic>Land cover</topic><topic>land cover mapping</topic><topic>large area</topic><topic>Learning algorithms</topic><topic>Libraries</topic><topic>Machine learning</topic><topic>machine learning (ML)</topic><topic>Mapping</topic><topic>Mathematical models</topic><topic>Neural networks</topic><topic>Parameters</topic><topic>Performance evaluation</topic><topic>Python</topic><topic>Radiometers</topic><topic>random forest (RF)</topic><topic>Remote sensing</topic><topic>Satellite imagery</topic><topic>Scikit-Learn (sklearn)</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Adugna, Tesfaye</creatorcontrib><creatorcontrib>Xu, Wenbo</creatorcontrib><creatorcontrib>Fan, Jinlong</creatorcontrib><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Ecology Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical & Transportation Engineering Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Proquest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Earth, Atmospheric & Aquatic Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Copper Technical Reference Library</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Civil Engineering Abstracts</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Engineering Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Earth, Atmospheric & Aquatic Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><collection>Directory of Open Access Journals</collection><jtitle>Remote sensing (Basel, Switzerland)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Adugna, Tesfaye</au><au>Xu, Wenbo</au><au>Fan, Jinlong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images</atitle><jtitle>Remote sensing (Basel, Switzerland)</jtitle><date>2022-02-01</date><risdate>2022</risdate><volume>14</volume><issue>3</issue><spage>574</spage><pages>574-</pages><issn>2072-4292</issn><eissn>2072-4292</eissn><abstract>The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector machine (SVM) are two of the most widely used ML algorithms to generate land cover (LC) maps from satellite imageries. Although several comparisons have been conducted between these two algorithms, the findings are contradicting. Moreover, the comparisons were made on local-scale LC map generation either from high or medium resolution images using various software, but not Python. In this paper, we compared the performance of these two algorithms for large area LC mapping of parts of Africa using coarse resolution imageries in the Python platform by the employing Scikit-Learn (sklearn) library. We employed a big dataset, 297 metrics, comprised of systematically selected 9-month composite FegnYun-3C (FY-3C) satellite images with 1 km resolution. Several experiments were performed using a range of values to determine the best values for the two most important parameters of each classifier, the number of trees and the number of variables, for RF, and penalty value and gamma for SVM, and to obtain the best model of each algorithm. Our results showed that RF outperformed SVM yielding 0.86 (OA) and 0.83 (k), which are 1–2% and 3% higher than the best SVM model, respectively. In addition, RF performed better in mixed class classification; however, it performed almost the same when classifying relatively pure classes with distinct spectral variation, i.e., consisting of less mixed pixels. Furthermore, RF is more efficient in handling large input datasets where the SVM fails. Hence, RF is a more robust ML algorithm especially for heterogeneous large area mapping using coarse resolution images. Finally, default parameter values in the sklearn library work well for satellite image classification with minor/or no adjustment for these algorithms.</abstract><cop>Basel</cop><pub>MDPI AG</pub><doi>10.3390/rs14030574</doi><orcidid>https://orcid.org/0000-0003-3235-587X</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2072-4292
ispartof	Remote sensing (Basel, Switzerland), 2022-02, Vol.14 (3), p.574
issn	2072-4292 2072-4292
language	eng
recordid	cdi_doaj_primary_oai_doaj_org_article_653b9e865e124752999e548872f174c2
source	Publicly Available Content Database
subjects	Algorithms Classification Classifiers Datasets Image classification Land cover land cover mapping large area Learning algorithms Libraries Machine learning machine learning (ML) Mapping Mathematical models Neural networks Parameters Performance evaluation Python Radiometers random forest (RF) Remote sensing Satellite imagery Scikit-Learn (sklearn) Support vector machines
title	Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T10%3A48%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comparison%20of%20Random%20Forest%20and%20Support%20Vector%20Machine%20Classifiers%20for%20Regional%20Land%20Cover%20Mapping%20Using%20Coarse%20Resolution%20FY-3C%20Images&rft.jtitle=Remote%20sensing%20(Basel,%20Switzerland)&rft.au=Adugna,%20Tesfaye&rft.date=2022-02-01&rft.volume=14&rft.issue=3&rft.spage=574&rft.pages=574-&rft.issn=2072-4292&rft.eissn=2072-4292&rft_id=info:doi/10.3390/rs14030574&rft_dat=%3Cproquest_doaj_%3E2627828019%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2627828019&rft_id=info:pmid/&rfr_iscdi=true