Loading…
Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images
The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector...
Saved in:
Published in: | Remote sensing (Basel, Switzerland) Switzerland), 2022-02, Vol.14 (3), p.574 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23 |
---|---|
cites | cdi_FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23 |
container_end_page | |
container_issue | 3 |
container_start_page | 574 |
container_title | Remote sensing (Basel, Switzerland) |
container_volume | 14 |
creator | Adugna, Tesfaye Xu, Wenbo Fan, Jinlong |
description | The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector machine (SVM) are two of the most widely used ML algorithms to generate land cover (LC) maps from satellite imageries. Although several comparisons have been conducted between these two algorithms, the findings are contradicting. Moreover, the comparisons were made on local-scale LC map generation either from high or medium resolution images using various software, but not Python. In this paper, we compared the performance of these two algorithms for large area LC mapping of parts of Africa using coarse resolution imageries in the Python platform by the employing Scikit-Learn (sklearn) library. We employed a big dataset, 297 metrics, comprised of systematically selected 9-month composite FegnYun-3C (FY-3C) satellite images with 1 km resolution. Several experiments were performed using a range of values to determine the best values for the two most important parameters of each classifier, the number of trees and the number of variables, for RF, and penalty value and gamma for SVM, and to obtain the best model of each algorithm. Our results showed that RF outperformed SVM yielding 0.86 (OA) and 0.83 (k), which are 1–2% and 3% higher than the best SVM model, respectively. In addition, RF performed better in mixed class classification; however, it performed almost the same when classifying relatively pure classes with distinct spectral variation, i.e., consisting of less mixed pixels. Furthermore, RF is more efficient in handling large input datasets where the SVM fails. Hence, RF is a more robust ML algorithm especially for heterogeneous large area mapping using coarse resolution images. Finally, default parameter values in the sklearn library work well for satellite image classification with minor/or no adjustment for these algorithms. |
doi_str_mv | 10.3390/rs14030574 |
format | article |
fullrecord | <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_653b9e865e124752999e548872f174c2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_653b9e865e124752999e548872f174c2</doaj_id><sourcerecordid>2627828019</sourcerecordid><originalsourceid>FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23</originalsourceid><addsrcrecordid>eNpNkdtq3DAQhk1oICHJTZ5A0LuCG50tXRbTbRa2BHIo9ErI8nijxetxJG-hT5DXjt0taediTnzzD8wUxTWjn4Ww9CZlJqmgqpInxTmnFS8lt_zDf_lZcZXzjs4mBLNUnhevNe5Hn2LGgWBH7v3Q4p6sMEGeyFyQh8M4YprIDwgTJvLdh-c4AKl7n3PsIqRMurl_D9uIg-_JZhmq8Rcs7DjGYUue8uJr9CnDDGbsD9MMk9XPUtRkvfdbyJfFaef7DFd_40XxuPr6WN-Wm7tv6_rLpgxCs6kMSrLQemCqrbTglW2El15L1miuoTHamMDaRoOxzCsNoCxjVklOaWgbLi6K9VG2Rb9zY4p7n3479NH9aWDaOp-mGHpwWonGgtEKGJeV4tZaUNKYineskmHR-njUGhO-HOZ7uR0e0nyD7LjmleGGMjtTn45USJhzgu59K6NueZv79zbxBpwNiJI</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2627828019</pqid></control><display><type>article</type><title>Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images</title><source>Publicly Available Content Database</source><creator>Adugna, Tesfaye ; Xu, Wenbo ; Fan, Jinlong</creator><creatorcontrib>Adugna, Tesfaye ; Xu, Wenbo ; Fan, Jinlong</creatorcontrib><description>The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector machine (SVM) are two of the most widely used ML algorithms to generate land cover (LC) maps from satellite imageries. Although several comparisons have been conducted between these two algorithms, the findings are contradicting. Moreover, the comparisons were made on local-scale LC map generation either from high or medium resolution images using various software, but not Python. In this paper, we compared the performance of these two algorithms for large area LC mapping of parts of Africa using coarse resolution imageries in the Python platform by the employing Scikit-Learn (sklearn) library. We employed a big dataset, 297 metrics, comprised of systematically selected 9-month composite FegnYun-3C (FY-3C) satellite images with 1 km resolution. Several experiments were performed using a range of values to determine the best values for the two most important parameters of each classifier, the number of trees and the number of variables, for RF, and penalty value and gamma for SVM, and to obtain the best model of each algorithm. Our results showed that RF outperformed SVM yielding 0.86 (OA) and 0.83 (k), which are 1–2% and 3% higher than the best SVM model, respectively. In addition, RF performed better in mixed class classification; however, it performed almost the same when classifying relatively pure classes with distinct spectral variation, i.e., consisting of less mixed pixels. Furthermore, RF is more efficient in handling large input datasets where the SVM fails. Hence, RF is a more robust ML algorithm especially for heterogeneous large area mapping using coarse resolution images. Finally, default parameter values in the sklearn library work well for satellite image classification with minor/or no adjustment for these algorithms.</description><identifier>ISSN: 2072-4292</identifier><identifier>EISSN: 2072-4292</identifier><identifier>DOI: 10.3390/rs14030574</identifier><language>eng</language><publisher>Basel: MDPI AG</publisher><subject>Algorithms ; Classification ; Classifiers ; Datasets ; Image classification ; Land cover ; land cover mapping ; large area ; Learning algorithms ; Libraries ; Machine learning ; machine learning (ML) ; Mapping ; Mathematical models ; Neural networks ; Parameters ; Performance evaluation ; Python ; Radiometers ; random forest (RF) ; Remote sensing ; Satellite imagery ; Scikit-Learn (sklearn) ; Support vector machines</subject><ispartof>Remote sensing (Basel, Switzerland), 2022-02, Vol.14 (3), p.574</ispartof><rights>2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23</citedby><cites>FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23</cites><orcidid>0000-0003-3235-587X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2627828019/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2627828019?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,25753,27924,27925,37012,44590,75126</link.rule.ids></links><search><creatorcontrib>Adugna, Tesfaye</creatorcontrib><creatorcontrib>Xu, Wenbo</creatorcontrib><creatorcontrib>Fan, Jinlong</creatorcontrib><title>Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images</title><title>Remote sensing (Basel, Switzerland)</title><description>The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector machine (SVM) are two of the most widely used ML algorithms to generate land cover (LC) maps from satellite imageries. Although several comparisons have been conducted between these two algorithms, the findings are contradicting. Moreover, the comparisons were made on local-scale LC map generation either from high or medium resolution images using various software, but not Python. In this paper, we compared the performance of these two algorithms for large area LC mapping of parts of Africa using coarse resolution imageries in the Python platform by the employing Scikit-Learn (sklearn) library. We employed a big dataset, 297 metrics, comprised of systematically selected 9-month composite FegnYun-3C (FY-3C) satellite images with 1 km resolution. Several experiments were performed using a range of values to determine the best values for the two most important parameters of each classifier, the number of trees and the number of variables, for RF, and penalty value and gamma for SVM, and to obtain the best model of each algorithm. Our results showed that RF outperformed SVM yielding 0.86 (OA) and 0.83 (k), which are 1–2% and 3% higher than the best SVM model, respectively. In addition, RF performed better in mixed class classification; however, it performed almost the same when classifying relatively pure classes with distinct spectral variation, i.e., consisting of less mixed pixels. Furthermore, RF is more efficient in handling large input datasets where the SVM fails. Hence, RF is a more robust ML algorithm especially for heterogeneous large area mapping using coarse resolution images. Finally, default parameter values in the sklearn library work well for satellite image classification with minor/or no adjustment for these algorithms.</description><subject>Algorithms</subject><subject>Classification</subject><subject>Classifiers</subject><subject>Datasets</subject><subject>Image classification</subject><subject>Land cover</subject><subject>land cover mapping</subject><subject>large area</subject><subject>Learning algorithms</subject><subject>Libraries</subject><subject>Machine learning</subject><subject>machine learning (ML)</subject><subject>Mapping</subject><subject>Mathematical models</subject><subject>Neural networks</subject><subject>Parameters</subject><subject>Performance evaluation</subject><subject>Python</subject><subject>Radiometers</subject><subject>random forest (RF)</subject><subject>Remote sensing</subject><subject>Satellite imagery</subject><subject>Scikit-Learn (sklearn)</subject><subject>Support vector machines</subject><issn>2072-4292</issn><issn>2072-4292</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNpNkdtq3DAQhk1oICHJTZ5A0LuCG50tXRbTbRa2BHIo9ErI8nijxetxJG-hT5DXjt0taediTnzzD8wUxTWjn4Ww9CZlJqmgqpInxTmnFS8lt_zDf_lZcZXzjs4mBLNUnhevNe5Hn2LGgWBH7v3Q4p6sMEGeyFyQh8M4YprIDwgTJvLdh-c4AKl7n3PsIqRMurl_D9uIg-_JZhmq8Rcs7DjGYUue8uJr9CnDDGbsD9MMk9XPUtRkvfdbyJfFaef7DFd_40XxuPr6WN-Wm7tv6_rLpgxCs6kMSrLQemCqrbTglW2El15L1miuoTHamMDaRoOxzCsNoCxjVklOaWgbLi6K9VG2Rb9zY4p7n3479NH9aWDaOp-mGHpwWonGgtEKGJeV4tZaUNKYineskmHR-njUGhO-HOZ7uR0e0nyD7LjmleGGMjtTn45USJhzgu59K6NueZv79zbxBpwNiJI</recordid><startdate>20220201</startdate><enddate>20220201</enddate><creator>Adugna, Tesfaye</creator><creator>Xu, Wenbo</creator><creator>Fan, Jinlong</creator><general>MDPI AG</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7QR</scope><scope>7SC</scope><scope>7SE</scope><scope>7SN</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>BKSAR</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>H8G</scope><scope>HCIFZ</scope><scope>JG9</scope><scope>JQ2</scope><scope>KR7</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PCBAR</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-3235-587X</orcidid></search><sort><creationdate>20220201</creationdate><title>Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images</title><author>Adugna, Tesfaye ; Xu, Wenbo ; Fan, Jinlong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Algorithms</topic><topic>Classification</topic><topic>Classifiers</topic><topic>Datasets</topic><topic>Image classification</topic><topic>Land cover</topic><topic>land cover mapping</topic><topic>large area</topic><topic>Learning algorithms</topic><topic>Libraries</topic><topic>Machine learning</topic><topic>machine learning (ML)</topic><topic>Mapping</topic><topic>Mathematical models</topic><topic>Neural networks</topic><topic>Parameters</topic><topic>Performance evaluation</topic><topic>Python</topic><topic>Radiometers</topic><topic>random forest (RF)</topic><topic>Remote sensing</topic><topic>Satellite imagery</topic><topic>Scikit-Learn (sklearn)</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Adugna, Tesfaye</creatorcontrib><creatorcontrib>Xu, Wenbo</creatorcontrib><creatorcontrib>Fan, Jinlong</creatorcontrib><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Ecology Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical & Transportation Engineering Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Proquest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Earth, Atmospheric & Aquatic Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Copper Technical Reference Library</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Civil Engineering Abstracts</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Engineering Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Earth, Atmospheric & Aquatic Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><collection>Directory of Open Access Journals</collection><jtitle>Remote sensing (Basel, Switzerland)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Adugna, Tesfaye</au><au>Xu, Wenbo</au><au>Fan, Jinlong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images</atitle><jtitle>Remote sensing (Basel, Switzerland)</jtitle><date>2022-02-01</date><risdate>2022</risdate><volume>14</volume><issue>3</issue><spage>574</spage><pages>574-</pages><issn>2072-4292</issn><eissn>2072-4292</eissn><abstract>The type of algorithm employed to classify remote sensing imageries plays a great role in affecting the accuracy. In recent decades, machine learning (ML) has received great attention due to its robustness in remote sensing image classification. In this regard, random forest (RF) and support vector machine (SVM) are two of the most widely used ML algorithms to generate land cover (LC) maps from satellite imageries. Although several comparisons have been conducted between these two algorithms, the findings are contradicting. Moreover, the comparisons were made on local-scale LC map generation either from high or medium resolution images using various software, but not Python. In this paper, we compared the performance of these two algorithms for large area LC mapping of parts of Africa using coarse resolution imageries in the Python platform by the employing Scikit-Learn (sklearn) library. We employed a big dataset, 297 metrics, comprised of systematically selected 9-month composite FegnYun-3C (FY-3C) satellite images with 1 km resolution. Several experiments were performed using a range of values to determine the best values for the two most important parameters of each classifier, the number of trees and the number of variables, for RF, and penalty value and gamma for SVM, and to obtain the best model of each algorithm. Our results showed that RF outperformed SVM yielding 0.86 (OA) and 0.83 (k), which are 1–2% and 3% higher than the best SVM model, respectively. In addition, RF performed better in mixed class classification; however, it performed almost the same when classifying relatively pure classes with distinct spectral variation, i.e., consisting of less mixed pixels. Furthermore, RF is more efficient in handling large input datasets where the SVM fails. Hence, RF is a more robust ML algorithm especially for heterogeneous large area mapping using coarse resolution images. Finally, default parameter values in the sklearn library work well for satellite image classification with minor/or no adjustment for these algorithms.</abstract><cop>Basel</cop><pub>MDPI AG</pub><doi>10.3390/rs14030574</doi><orcidid>https://orcid.org/0000-0003-3235-587X</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2072-4292 |
ispartof | Remote sensing (Basel, Switzerland), 2022-02, Vol.14 (3), p.574 |
issn | 2072-4292 2072-4292 |
language | eng |
recordid | cdi_doaj_primary_oai_doaj_org_article_653b9e865e124752999e548872f174c2 |
source | Publicly Available Content Database |
subjects | Algorithms Classification Classifiers Datasets Image classification Land cover land cover mapping large area Learning algorithms Libraries Machine learning machine learning (ML) Mapping Mathematical models Neural networks Parameters Performance evaluation Python Radiometers random forest (RF) Remote sensing Satellite imagery Scikit-Learn (sklearn) Support vector machines |
title | Comparison of Random Forest and Support Vector Machine Classifiers for Regional Land Cover Mapping Using Coarse Resolution FY-3C Images |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T10%3A48%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comparison%20of%20Random%20Forest%20and%20Support%20Vector%20Machine%20Classifiers%20for%20Regional%20Land%20Cover%20Mapping%20Using%20Coarse%20Resolution%20FY-3C%20Images&rft.jtitle=Remote%20sensing%20(Basel,%20Switzerland)&rft.au=Adugna,%20Tesfaye&rft.date=2022-02-01&rft.volume=14&rft.issue=3&rft.spage=574&rft.pages=574-&rft.issn=2072-4292&rft.eissn=2072-4292&rft_id=info:doi/10.3390/rs14030574&rft_dat=%3Cproquest_doaj_%3E2627828019%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c361t-c541cdae15d763279b3a4a641b626eb8688c1db6e891a56ee5911954200cdb23%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2627828019&rft_id=info:pmid/&rfr_iscdi=true |