Loading…

Two-phase training mitigates class imbalance for camera trap image classification with CNNs

By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the...

Full description

Saved in:

Bibliographic Details
Published in:	arXiv.org 2021-12
Main Authors:	Malik, Farjad, Wouters, Simon, Cartuyvels, Ruben, Ghadery, Erfan, Marie-Francine Moens
Format:	Article
Language:	English
Subjects:	Biodiversity Cameras Datasets Ecological effects Image classification Machine learning Oversampling Performance enhancement Training Wildlife conservation
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites
container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Malik, Farjad Wouters, Simon Cartuyvels, Ruben Ghadery, Erfan Marie-Francine Moens
description	By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the majority classes. As a result, they obtain good performance for a few majority classes but poor performance for many minority classes. We used two-phase training to increase the performance for these minority classes. We trained, next to a baseline model, four models that implemented a different versions of two-phase training on a subset of the highly imbalanced Snapshot Serengeti dataset. Our results suggest that two-phase training can improve performance for many minority classes, with limited loss in performance for the other classes. We find that two-phase training based on majority undersampling increases class-specific F1-scores up to 3.0%. We also find that two-phase training outperforms using only oversampling or undersampling by 6.1% in F1-score on average. Finally, we find that a combination of over- and undersampling leads to a better performance than using them individually.
format	article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2615377064</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2615377064</sourcerecordid><originalsourceid>FETCH-proquest_journals_26153770643</originalsourceid><addsrcrecordid>eNqNyr0OgjAUQOHGxESivEMTZ5LS8uNONE5MbA7kSgpcAi32lvD6YvQBnM5wvh0LpFJxdEmkPLCQaBBCyCyXaaoC9qhWG809kObeARo0HZ_QYwdeE29GIOI4PWEE02jeWscbmLSDj563A53-KmyxAY_W8BV9z4uypBPbtzCSDn89svPtWhX3aHb2tWjy9WAXZ7ZVyyxOVZ6LLFH_qTeUxkMj</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2615377064</pqid></control><display><type>article</type><title>Two-phase training mitigates class imbalance for camera trap image classification with CNNs</title><source>Publicly Available Content Database</source><creator>Malik, Farjad ; Wouters, Simon ; Cartuyvels, Ruben ; Ghadery, Erfan ; Marie-Francine Moens</creator><creatorcontrib>Malik, Farjad ; Wouters, Simon ; Cartuyvels, Ruben ; Ghadery, Erfan ; Marie-Francine Moens</creatorcontrib><description>By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the majority classes. As a result, they obtain good performance for a few majority classes but poor performance for many minority classes. We used two-phase training to increase the performance for these minority classes. We trained, next to a baseline model, four models that implemented a different versions of two-phase training on a subset of the highly imbalanced Snapshot Serengeti dataset. Our results suggest that two-phase training can improve performance for many minority classes, with limited loss in performance for the other classes. We find that two-phase training based on majority undersampling increases class-specific F1-scores up to 3.0%. We also find that two-phase training outperforms using only oversampling or undersampling by 6.1% in F1-score on average. Finally, we find that a combination of over- and undersampling leads to a better performance than using them individually.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Biodiversity ; Cameras ; Datasets ; Ecological effects ; Image classification ; Machine learning ; Oversampling ; Performance enhancement ; Training ; Wildlife conservation</subject><ispartof>arXiv.org, 2021-12</ispartof><rights>2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2615377064?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>776,780,25731,36989,44566</link.rule.ids></links><search><creatorcontrib>Malik, Farjad</creatorcontrib><creatorcontrib>Wouters, Simon</creatorcontrib><creatorcontrib>Cartuyvels, Ruben</creatorcontrib><creatorcontrib>Ghadery, Erfan</creatorcontrib><creatorcontrib>Marie-Francine Moens</creatorcontrib><title>Two-phase training mitigates class imbalance for camera trap image classification with CNNs</title><title>arXiv.org</title><description>By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the majority classes. As a result, they obtain good performance for a few majority classes but poor performance for many minority classes. We used two-phase training to increase the performance for these minority classes. We trained, next to a baseline model, four models that implemented a different versions of two-phase training on a subset of the highly imbalanced Snapshot Serengeti dataset. Our results suggest that two-phase training can improve performance for many minority classes, with limited loss in performance for the other classes. We find that two-phase training based on majority undersampling increases class-specific F1-scores up to 3.0%. We also find that two-phase training outperforms using only oversampling or undersampling by 6.1% in F1-score on average. Finally, we find that a combination of over- and undersampling leads to a better performance than using them individually.</description><subject>Biodiversity</subject><subject>Cameras</subject><subject>Datasets</subject><subject>Ecological effects</subject><subject>Image classification</subject><subject>Machine learning</subject><subject>Oversampling</subject><subject>Performance enhancement</subject><subject>Training</subject><subject>Wildlife conservation</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNyr0OgjAUQOHGxESivEMTZ5LS8uNONE5MbA7kSgpcAi32lvD6YvQBnM5wvh0LpFJxdEmkPLCQaBBCyCyXaaoC9qhWG809kObeARo0HZ_QYwdeE29GIOI4PWEE02jeWscbmLSDj563A53-KmyxAY_W8BV9z4uypBPbtzCSDn89svPtWhX3aHb2tWjy9WAXZ7ZVyyxOVZ6LLFH_qTeUxkMj</recordid><startdate>20211229</startdate><enddate>20211229</enddate><creator>Malik, Farjad</creator><creator>Wouters, Simon</creator><creator>Cartuyvels, Ruben</creator><creator>Ghadery, Erfan</creator><creator>Marie-Francine Moens</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20211229</creationdate><title>Two-phase training mitigates class imbalance for camera trap image classification with CNNs</title><author>Malik, Farjad ; Wouters, Simon ; Cartuyvels, Ruben ; Ghadery, Erfan ; Marie-Francine Moens</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_26153770643</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Biodiversity</topic><topic>Cameras</topic><topic>Datasets</topic><topic>Ecological effects</topic><topic>Image classification</topic><topic>Machine learning</topic><topic>Oversampling</topic><topic>Performance enhancement</topic><topic>Training</topic><topic>Wildlife conservation</topic><toplevel>online_resources</toplevel><creatorcontrib>Malik, Farjad</creatorcontrib><creatorcontrib>Wouters, Simon</creatorcontrib><creatorcontrib>Cartuyvels, Ruben</creatorcontrib><creatorcontrib>Ghadery, Erfan</creatorcontrib><creatorcontrib>Marie-Francine Moens</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Malik, Farjad</au><au>Wouters, Simon</au><au>Cartuyvels, Ruben</au><au>Ghadery, Erfan</au><au>Marie-Francine Moens</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Two-phase training mitigates class imbalance for camera trap image classification with CNNs</atitle><jtitle>arXiv.org</jtitle><date>2021-12-29</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the majority classes. As a result, they obtain good performance for a few majority classes but poor performance for many minority classes. We used two-phase training to increase the performance for these minority classes. We trained, next to a baseline model, four models that implemented a different versions of two-phase training on a subset of the highly imbalanced Snapshot Serengeti dataset. Our results suggest that two-phase training can improve performance for many minority classes, with limited loss in performance for the other classes. We find that two-phase training based on majority undersampling increases class-specific F1-scores up to 3.0%. We also find that two-phase training outperforms using only oversampling or undersampling by 6.1% in F1-score on average. Finally, we find that a combination of over- and undersampling leads to a better performance than using them individually.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2021-12
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2615377064
source	Publicly Available Content Database
subjects	Biodiversity Cameras Datasets Ecological effects Image classification Machine learning Oversampling Performance enhancement Training Wildlife conservation
title	Two-phase training mitigates class imbalance for camera trap image classification with CNNs
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T22%3A08%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Two-phase%20training%20mitigates%20class%20imbalance%20for%20camera%20trap%20image%20classification%20with%20CNNs&rft.jtitle=arXiv.org&rft.au=Malik,%20Farjad&rft.date=2021-12-29&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2615377064%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_26153770643%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2615377064&rft_id=info:pmid/&rfr_iscdi=true