Loading…

Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images

The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector...

Full description

Saved in:
Bibliographic Details
Published in:Remote sensing (Basel, Switzerland) Switzerland), 2022-02, Vol.14 (3), p.574
Main Authors: Adugna, Tesfaye, Xu, Wenbo, Fan, Jinlong
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23
cites cdi_FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23
container_end_page
container_issue 3
container_start_page 574
container_title Remote sensing (Basel, Switzerland)
container_volume 14
creator Adugna, Tesfaye
Xu, Wenbo
Fan, Jinlong
description The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector machine (SVM) are two of the most widely used ML algorithms to generate land cover (LC) maps from satellite imageries. Although several comparisons have been conducted between these two algorithms, the findings are contradicting. Moreover, the comparisons were made on local-scale LC map generation either from high or medium resolution images using various software, but not Python. In this paper, we compared the performance of these two algorithms for large area LC mapping of parts of Africa using coarse resolution imageries in the Python platform by the employing Scikit-Learn (sklearn) library. We employed a big dataset, 297 metrics, comprised of systematically selected 9-month composite FegnYun-3C (FY-3C) satellite images with 1 km resolution. Several experiments were performed using a range of values to determine the best values for the two most important parameters of each classifier, the number of trees and the number of variables, for RF, and penalty value and gamma for SVM, and to obtain the best model of each algorithm. Our results showed that RF outperformed SVM yielding 0.86 (OA) and 0.83 (k), which are 1–2% and 3% higher than the best SVM model, respectively. In addition, RF performed better in mixed class classification; however, it performed almost the same when classifying relatively pure classes with distinct spectral variation, i.e., consisting of less mixed pixels. Furthermore, RF is more efficient in handling large input datasets where the SVM fails. Hence, RF is a more robust ML algorithm especially for heterogeneous large area mapping using coarse resolution images. Finally, default parameter values in the sklearn library work well for satellite image classification with minor/or no adjustment for these algorithms.
doi_str_mv 10.3390/rs14030574
format article
fullrecord <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_653b9e865e124752999e548872f174c2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_653b9e865e124752999e548872f174c2</doaj_id><sourcerecordid>2627828019</sourcerecordid><originalsourceid>FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23</originalsourceid><addsrcrecordid>eNpNkdtq3DAQhk1oICHJTZ5A0LuCG50tXRbTbRa2BHIo9ErI8nijxetxJG-hT5DXjt0taediTnzzD8wUxTWjn4Ww9CZlJqmgqpInxTmnFS8lt_zDf_lZcZXzjs4mBLNUnhevNe5Hn2LGgWBH7v3Q4p6sMEGeyFyQh8M4YprIDwgTJvLdh-c4AKl7n3PsIqRMurl_D9uIg-_JZhmq8Rcs7DjGYUue8uJr9CnDDGbsD9MMk9XPUtRkvfdbyJfFaef7DFd_40XxuPr6WN-Wm7tv6_rLpgxCs6kMSrLQemCqrbTglW2El15L1miuoTHamMDaRoOxzCsNoCxjVklOaWgbLi6K9VG2Rb9zY4p7n3479NH9aWDaOp-mGHpwWonGgtEKGJeV4tZaUNKYineskmHR-njUGhO-HOZ7uR0e0nyD7LjmleGGMjtTn45USJhzgu59K6NueZv79zbxBpwNiJI</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2627828019</pqid></control><display><type>article</type><title>Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images</title><source>Publicly Available Content Database</source><creator>Adugna, Tesfaye ; Xu, Wenbo ; Fan, Jinlong</creator><creatorcontrib>Adugna, Tesfaye ; Xu, Wenbo ; Fan, Jinlong</creatorcontrib><description>The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector machine (SVM) are two of the most widely used ML algorithms to generate land cover (LC) maps from satellite imageries. Although several comparisons have been conducted between these two algorithms, the findings are contradicting. Moreover, the comparisons were made on local-scale LC map generation either from high or medium resolution images using various software, but not Python. In this paper, we compared the performance of these two algorithms for large area LC mapping of parts of Africa using coarse resolution imageries in the Python platform by the employing Scikit-Learn (sklearn) library. We employed a big dataset, 297 metrics, comprised of systematically selected 9-month composite FegnYun-3C (FY-3C) satellite images with 1 km resolution. Several experiments were performed using a range of values to determine the best values for the two most important parameters of each classifier, the number of trees and the number of variables, for RF, and penalty value and gamma for SVM, and to obtain the best model of each algorithm. Our results showed that RF outperformed SVM yielding 0.86 (OA) and 0.83 (k), which are 1–2% and 3% higher than the best SVM model, respectively. In addition, RF performed better in mixed class classification; however, it performed almost the same when classifying relatively pure classes with distinct spectral variation, i.e., consisting of less mixed pixels. Furthermore, RF is more efficient in handling large input datasets where the SVM fails. Hence, RF is a more robust ML algorithm especially for heterogeneous large area mapping using coarse resolution images. Finally, default parameter values in the sklearn library work well for satellite image classification with minor/or no adjustment for these algorithms.</description><identifier>ISSN: 2072-4292</identifier><identifier>EISSN: 2072-4292</identifier><identifier>DOI: 10.3390/rs14030574</identifier><language>eng</language><publisher>Basel: MDPI AG</publisher><subject>Algorithms ; Classification ; Classifiers ; Datasets ; Image classification ; Land cover ; land cover mapping ; large area ; Learning algorithms ; Libraries ; Machine learning ; machine learning (ML) ; Mapping ; Mathematical models ; Neural networks ; Parameters ; Performance evaluation ; Python ; Radiometers ; random forest (RF) ; Remote sensing ; Satellite imagery ; Scikit-Learn (sklearn) ; Support vector machines</subject><ispartof>Remote sensing (Basel, Switzerland), 2022-02, Vol.14 (3), p.574</ispartof><rights>2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23</citedby><cites>FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23</cites><orcidid>0000-0003-3235-587X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2627828019/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2627828019?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,25753,27924,27925,37012,44590,75126</link.rule.ids></links><search><creatorcontrib>Adugna, Tesfaye</creatorcontrib><creatorcontrib>Xu, Wenbo</creatorcontrib><creatorcontrib>Fan, Jinlong</creatorcontrib><title>Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images</title><title>Remote sensing (Basel, Switzerland)</title><description>The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector machine (SVM) are two of the most widely used ML algorithms to generate land cover (LC) maps from satellite imageries. Although several comparisons have been conducted between these two algorithms, the findings are contradicting. Moreover, the comparisons were made on local-scale LC map generation either from high or medium resolution images using various software, but not Python. In this paper, we compared the performance of these two algorithms for large area LC mapping of parts of Africa using coarse resolution imageries in the Python platform by the employing Scikit-Learn (sklearn) library. We employed a big dataset, 297 metrics, comprised of systematically selected 9-month composite FegnYun-3C (FY-3C) satellite images with 1 km resolution. Several experiments were performed using a range of values to determine the best values for the two most important parameters of each classifier, the number of trees and the number of variables, for RF, and penalty value and gamma for SVM, and to obtain the best model of each algorithm. Our results showed that RF outperformed SVM yielding 0.86 (OA) and 0.83 (k), which are 1–2% and 3% higher than the best SVM model, respectively. In addition, RF performed better in mixed class classification; however, it performed almost the same when classifying relatively pure classes with distinct spectral variation, i.e., consisting of less mixed pixels. Furthermore, RF is more efficient in handling large input datasets where the SVM fails. Hence, RF is a more robust ML algorithm especially for heterogeneous large area mapping using coarse resolution images. Finally, default parameter values in the sklearn library work well for satellite image classification with minor/or no adjustment for these algorithms.</description><subject>Algorithms</subject><subject>Classification</subject><subject>Classifiers</subject><subject>Datasets</subject><subject>Image classification</subject><subject>Land cover</subject><subject>land cover mapping</subject><subject>large area</subject><subject>Learning algorithms</subject><subject>Libraries</subject><subject>Machine learning</subject><subject>machine learning (ML)</subject><subject>Mapping</subject><subject>Mathematical models</subject><subject>Neural networks</subject><subject>Parameters</subject><subject>Performance evaluation</subject><subject>Python</subject><subject>Radiometers</subject><subject>random forest (RF)</subject><subject>Remote sensing</subject><subject>Satellite imagery</subject><subject>Scikit-Learn (sklearn)</subject><subject>Support vector machines</subject><issn>2072-4292</issn><issn>2072-4292</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNpNkdtq3DAQhk1oICHJTZ5A0LuCG50tXRbTbRa2BHIo9ErI8nijxetxJG-hT5DXjt0taediTnzzD8wUxTWjn4Ww9CZlJqmgqpInxTmnFS8lt_zDf_lZcZXzjs4mBLNUnhevNe5Hn2LGgWBH7v3Q4p6sMEGeyFyQh8M4YprIDwgTJvLdh-c4AKl7n3PsIqRMurl_D9uIg-_JZhmq8Rcs7DjGYUue8uJr9CnDDGbsD9MMk9XPUtRkvfdbyJfFaef7DFd_40XxuPr6WN-Wm7tv6_rLpgxCs6kMSrLQemCqrbTglW2El15L1miuoTHamMDaRoOxzCsNoCxjVklOaWgbLi6K9VG2Rb9zY4p7n3479NH9aWDaOp-mGHpwWonGgtEKGJeV4tZaUNKYineskmHR-njUGhO-HOZ7uR0e0nyD7LjmleGGMjtTn45USJhzgu59K6NueZv79zbxBpwNiJI</recordid><startdate>20220201</startdate><enddate>20220201</enddate><creator>Adugna, Tesfaye</creator><creator>Xu, Wenbo</creator><creator>Fan, Jinlong</creator><general>MDPI AG</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7QR</scope><scope>7SC</scope><scope>7SE</scope><scope>7SN</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>BKSAR</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>H8G</scope><scope>HCIFZ</scope><scope>JG9</scope><scope>JQ2</scope><scope>KR7</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PCBAR</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-3235-587X</orcidid></search><sort><creationdate>20220201</creationdate><title>Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images</title><author>Adugna, Tesfaye ; Xu, Wenbo ; Fan, Jinlong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Algorithms</topic><topic>Classification</topic><topic>Classifiers</topic><topic>Datasets</topic><topic>Image classification</topic><topic>Land cover</topic><topic>land cover mapping</topic><topic>large area</topic><topic>Learning algorithms</topic><topic>Libraries</topic><topic>Machine learning</topic><topic>machine learning (ML)</topic><topic>Mapping</topic><topic>Mathematical models</topic><topic>Neural networks</topic><topic>Parameters</topic><topic>Performance evaluation</topic><topic>Python</topic><topic>Radiometers</topic><topic>random forest (RF)</topic><topic>Remote sensing</topic><topic>Satellite imagery</topic><topic>Scikit-Learn (sklearn)</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Adugna, Tesfaye</creatorcontrib><creatorcontrib>Xu, Wenbo</creatorcontrib><creatorcontrib>Fan, Jinlong</creatorcontrib><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Ecology Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Proquest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Earth, Atmospheric &amp; Aquatic Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Copper Technical Reference Library</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Civil Engineering Abstracts</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Engineering Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Earth, Atmospheric &amp; Aquatic Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><collection>Directory of Open Access Journals</collection><jtitle>Remote sensing (Basel, Switzerland)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Adugna, Tesfaye</au><au>Xu, Wenbo</au><au>Fan, Jinlong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images</atitle><jtitle>Remote sensing (Basel, Switzerland)</jtitle><date>2022-02-01</date><risdate>2022</risdate><volume>14</volume><issue>3</issue><spage>574</spage><pages>574-</pages><issn>2072-4292</issn><eissn>2072-4292</eissn><abstract>The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector machine (SVM) are two of the most widely used ML algorithms to generate land cover (LC) maps from satellite imageries. Although several comparisons have been conducted between these two algorithms, the findings are contradicting. Moreover, the comparisons were made on local-scale LC map generation either from high or medium resolution images using various software, but not Python. In this paper, we compared the performance of these two algorithms for large area LC mapping of parts of Africa using coarse resolution imageries in the Python platform by the employing Scikit-Learn (sklearn) library. We employed a big dataset, 297 metrics, comprised of systematically selected 9-month composite FegnYun-3C (FY-3C) satellite images with 1 km resolution. Several experiments were performed using a range of values to determine the best values for the two most important parameters of each classifier, the number of trees and the number of variables, for RF, and penalty value and gamma for SVM, and to obtain the best model of each algorithm. Our results showed that RF outperformed SVM yielding 0.86 (OA) and 0.83 (k), which are 1–2% and 3% higher than the best SVM model, respectively. In addition, RF performed better in mixed class classification; however, it performed almost the same when classifying relatively pure classes with distinct spectral variation, i.e., consisting of less mixed pixels. Furthermore, RF is more efficient in handling large input datasets where the SVM fails. Hence, RF is a more robust ML algorithm especially for heterogeneous large area mapping using coarse resolution images. Finally, default parameter values in the sklearn library work well for satellite image classification with minor/or no adjustment for these algorithms.</abstract><cop>Basel</cop><pub>MDPI AG</pub><doi>10.3390/rs14030574</doi><orcidid>https://orcid.org/0000-0003-3235-587X</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2072-4292
ispartof Remote sensing (Basel, Switzerland), 2022-02, Vol.14 (3), p.574
issn 2072-4292
2072-4292
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_653b9e865e124752999e548872f174c2
source Publicly Available Content Database
subjects Algorithms
Classification
Classifiers
Datasets
Image classification
Land cover
land cover mapping
large area
Learning algorithms
Libraries
Machine learning
machine learning (ML)
Mapping
Mathematical models
Neural networks
Parameters
Performance evaluation
Python
Radiometers
random forest (RF)
Remote sensing
Satellite imagery
Scikit-Learn (sklearn)
Support vector machines
title Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T10%3A48%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comparison%20of%20Random%20Forest%20and%20Support%20Vector%20Machine%20Classifiers%20for%20Regional%20Land%20Cover%20Mapping%20Using%20Coarse%20Resolution%20FY-3C%20Images&rft.jtitle=Remote%20sensing%20(Basel,%20Switzerland)&rft.au=Adugna,%20Tesfaye&rft.date=2022-02-01&rft.volume=14&rft.issue=3&rft.spage=574&rft.pages=574-&rft.issn=2072-4292&rft.eissn=2072-4292&rft_id=info:doi/10.3390/rs14030574&rft_dat=%3Cproquest_doaj_%3E2627828019%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2627828019&rft_id=info:pmid/&rfr_iscdi=true