Loading…

Two-phase training mitigates class imbalance for camera trap image classification with CNNs

By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2021-12
Main Authors: Malik, Farjad, Wouters, Simon, Cartuyvels, Ruben, Ghadery, Erfan, Marie-Francine Moens
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Malik, Farjad
Wouters, Simon
Cartuyvels, Ruben
Ghadery, Erfan
Marie-Francine Moens
description By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the majority classes. As a result, they obtain good performance for a few majority classes but poor performance for many minority classes. We used two-phase training to increase the performance for these minority classes. We trained, next to a baseline model, four models that implemented a different versions of two-phase training on a subset of the highly imbalanced Snapshot Serengeti dataset. Our results suggest that two-phase training can improve performance for many minority classes, with limited loss in performance for the other classes. We find that two-phase training based on majority undersampling increases class-specific F1-scores up to 3.0%. We also find that two-phase training outperforms using only oversampling or undersampling by 6.1% in F1-score on average. Finally, we find that a combination of over- and undersampling leads to a better performance than using them individually.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2615377064</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2615377064</sourcerecordid><originalsourceid>FETCH-proquest_journals_26153770643</originalsourceid><addsrcrecordid>eNqNyr0OgjAUQOHGxESivEMTZ5LS8uNONE5MbA7kSgpcAi32lvD6YvQBnM5wvh0LpFJxdEmkPLCQaBBCyCyXaaoC9qhWG809kObeARo0HZ_QYwdeE29GIOI4PWEE02jeWscbmLSDj563A53-KmyxAY_W8BV9z4uypBPbtzCSDn89svPtWhX3aHb2tWjy9WAXZ7ZVyyxOVZ6LLFH_qTeUxkMj</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2615377064</pqid></control><display><type>article</type><title>Two-phase training mitigates class imbalance for camera trap image classification with CNNs</title><source>Publicly Available Content Database</source><creator>Malik, Farjad ; Wouters, Simon ; Cartuyvels, Ruben ; Ghadery, Erfan ; Marie-Francine Moens</creator><creatorcontrib>Malik, Farjad ; Wouters, Simon ; Cartuyvels, Ruben ; Ghadery, Erfan ; Marie-Francine Moens</creatorcontrib><description>By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the majority classes. As a result, they obtain good performance for a few majority classes but poor performance for many minority classes. We used two-phase training to increase the performance for these minority classes. We trained, next to a baseline model, four models that implemented a different versions of two-phase training on a subset of the highly imbalanced Snapshot Serengeti dataset. Our results suggest that two-phase training can improve performance for many minority classes, with limited loss in performance for the other classes. We find that two-phase training based on majority undersampling increases class-specific F1-scores up to 3.0%. We also find that two-phase training outperforms using only oversampling or undersampling by 6.1% in F1-score on average. Finally, we find that a combination of over- and undersampling leads to a better performance than using them individually.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Biodiversity ; Cameras ; Datasets ; Ecological effects ; Image classification ; Machine learning ; Oversampling ; Performance enhancement ; Training ; Wildlife conservation</subject><ispartof>arXiv.org, 2021-12</ispartof><rights>2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2615377064?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>776,780,25731,36989,44566</link.rule.ids></links><search><creatorcontrib>Malik, Farjad</creatorcontrib><creatorcontrib>Wouters, Simon</creatorcontrib><creatorcontrib>Cartuyvels, Ruben</creatorcontrib><creatorcontrib>Ghadery, Erfan</creatorcontrib><creatorcontrib>Marie-Francine Moens</creatorcontrib><title>Two-phase training mitigates class imbalance for camera trap image classification with CNNs</title><title>arXiv.org</title><description>By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the majority classes. As a result, they obtain good performance for a few majority classes but poor performance for many minority classes. We used two-phase training to increase the performance for these minority classes. We trained, next to a baseline model, four models that implemented a different versions of two-phase training on a subset of the highly imbalanced Snapshot Serengeti dataset. Our results suggest that two-phase training can improve performance for many minority classes, with limited loss in performance for the other classes. We find that two-phase training based on majority undersampling increases class-specific F1-scores up to 3.0%. We also find that two-phase training outperforms using only oversampling or undersampling by 6.1% in F1-score on average. Finally, we find that a combination of over- and undersampling leads to a better performance than using them individually.</description><subject>Biodiversity</subject><subject>Cameras</subject><subject>Datasets</subject><subject>Ecological effects</subject><subject>Image classification</subject><subject>Machine learning</subject><subject>Oversampling</subject><subject>Performance enhancement</subject><subject>Training</subject><subject>Wildlife conservation</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNyr0OgjAUQOHGxESivEMTZ5LS8uNONE5MbA7kSgpcAi32lvD6YvQBnM5wvh0LpFJxdEmkPLCQaBBCyCyXaaoC9qhWG809kObeARo0HZ_QYwdeE29GIOI4PWEE02jeWscbmLSDj563A53-KmyxAY_W8BV9z4uypBPbtzCSDn89svPtWhX3aHb2tWjy9WAXZ7ZVyyxOVZ6LLFH_qTeUxkMj</recordid><startdate>20211229</startdate><enddate>20211229</enddate><creator>Malik, Farjad</creator><creator>Wouters, Simon</creator><creator>Cartuyvels, Ruben</creator><creator>Ghadery, Erfan</creator><creator>Marie-Francine Moens</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20211229</creationdate><title>Two-phase training mitigates class imbalance for camera trap image classification with CNNs</title><author>Malik, Farjad ; Wouters, Simon ; Cartuyvels, Ruben ; Ghadery, Erfan ; Marie-Francine Moens</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_26153770643</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Biodiversity</topic><topic>Cameras</topic><topic>Datasets</topic><topic>Ecological effects</topic><topic>Image classification</topic><topic>Machine learning</topic><topic>Oversampling</topic><topic>Performance enhancement</topic><topic>Training</topic><topic>Wildlife conservation</topic><toplevel>online_resources</toplevel><creatorcontrib>Malik, Farjad</creatorcontrib><creatorcontrib>Wouters, Simon</creatorcontrib><creatorcontrib>Cartuyvels, Ruben</creatorcontrib><creatorcontrib>Ghadery, Erfan</creatorcontrib><creatorcontrib>Marie-Francine Moens</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Malik, Farjad</au><au>Wouters, Simon</au><au>Cartuyvels, Ruben</au><au>Ghadery, Erfan</au><au>Marie-Francine Moens</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Two-phase training mitigates class imbalance for camera trap image classification with CNNs</atitle><jtitle>arXiv.org</jtitle><date>2021-12-29</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the majority classes. As a result, they obtain good performance for a few majority classes but poor performance for many minority classes. We used two-phase training to increase the performance for these minority classes. We trained, next to a baseline model, four models that implemented a different versions of two-phase training on a subset of the highly imbalanced Snapshot Serengeti dataset. Our results suggest that two-phase training can improve performance for many minority classes, with limited loss in performance for the other classes. We find that two-phase training based on majority undersampling increases class-specific F1-scores up to 3.0%. We also find that two-phase training outperforms using only oversampling or undersampling by 6.1% in F1-score on average. Finally, we find that a combination of over- and undersampling leads to a better performance than using them individually.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2021-12
issn 2331-8422
language eng
recordid cdi_proquest_journals_2615377064
source Publicly Available Content Database
subjects Biodiversity
Cameras
Datasets
Ecological effects
Image classification
Machine learning
Oversampling
Performance enhancement
Training
Wildlife conservation
title Two-phase training mitigates class imbalance for camera trap image classification with CNNs
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T22%3A08%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Two-phase%20training%20mitigates%20class%20imbalance%20for%20camera%20trap%20image%20classification%20with%20CNNs&rft.jtitle=arXiv.org&rft.au=Malik,%20Farjad&rft.date=2021-12-29&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2615377064%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_26153770643%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2615377064&rft_id=info:pmid/&rfr_iscdi=true