Loading…
Two-phase training mitigates class imbalance for camera trap image classification with CNNs
By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the...
Saved in:
Published in: | arXiv.org 2021-12 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | |
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Malik, Farjad Wouters, Simon Cartuyvels, Ruben Ghadery, Erfan Marie-Francine Moens |
description | By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the majority classes. As a result, they obtain good performance for a few majority classes but poor performance for many minority classes. We used two-phase training to increase the performance for these minority classes. We trained, next to a baseline model, four models that implemented a different versions of two-phase training on a subset of the highly imbalanced Snapshot Serengeti dataset. Our results suggest that two-phase training can improve performance for many minority classes, with limited loss in performance for the other classes. We find that two-phase training based on majority undersampling increases class-specific F1-scores up to 3.0%. We also find that two-phase training outperforms using only oversampling or undersampling by 6.1% in F1-score on average. Finally, we find that a combination of over- and undersampling leads to a better performance than using them individually. |
format | article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2615377064</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2615377064</sourcerecordid><originalsourceid>FETCH-proquest_journals_26153770643</originalsourceid><addsrcrecordid>eNqNyr0OgjAUQOHGxESivEMTZ5LS8uNONE5MbA7kSgpcAi32lvD6YvQBnM5wvh0LpFJxdEmkPLCQaBBCyCyXaaoC9qhWG809kObeARo0HZ_QYwdeE29GIOI4PWEE02jeWscbmLSDj563A53-KmyxAY_W8BV9z4uypBPbtzCSDn89svPtWhX3aHb2tWjy9WAXZ7ZVyyxOVZ6LLFH_qTeUxkMj</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2615377064</pqid></control><display><type>article</type><title>Two-phase training mitigates class imbalance for camera trap image classification with CNNs</title><source>Publicly Available Content Database</source><creator>Malik, Farjad ; Wouters, Simon ; Cartuyvels, Ruben ; Ghadery, Erfan ; Marie-Francine Moens</creator><creatorcontrib>Malik, Farjad ; Wouters, Simon ; Cartuyvels, Ruben ; Ghadery, Erfan ; Marie-Francine Moens</creatorcontrib><description>By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the majority classes. As a result, they obtain good performance for a few majority classes but poor performance for many minority classes. We used two-phase training to increase the performance for these minority classes. We trained, next to a baseline model, four models that implemented a different versions of two-phase training on a subset of the highly imbalanced Snapshot Serengeti dataset. Our results suggest that two-phase training can improve performance for many minority classes, with limited loss in performance for the other classes. We find that two-phase training based on majority undersampling increases class-specific F1-scores up to 3.0%. We also find that two-phase training outperforms using only oversampling or undersampling by 6.1% in F1-score on average. Finally, we find that a combination of over- and undersampling leads to a better performance than using them individually.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Biodiversity ; Cameras ; Datasets ; Ecological effects ; Image classification ; Machine learning ; Oversampling ; Performance enhancement ; Training ; Wildlife conservation</subject><ispartof>arXiv.org, 2021-12</ispartof><rights>2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2615377064?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>776,780,25731,36989,44566</link.rule.ids></links><search><creatorcontrib>Malik, Farjad</creatorcontrib><creatorcontrib>Wouters, Simon</creatorcontrib><creatorcontrib>Cartuyvels, Ruben</creatorcontrib><creatorcontrib>Ghadery, Erfan</creatorcontrib><creatorcontrib>Marie-Francine Moens</creatorcontrib><title>Two-phase training mitigates class imbalance for camera trap image classification with CNNs</title><title>arXiv.org</title><description>By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the majority classes. As a result, they obtain good performance for a few majority classes but poor performance for many minority classes. We used two-phase training to increase the performance for these minority classes. We trained, next to a baseline model, four models that implemented a different versions of two-phase training on a subset of the highly imbalanced Snapshot Serengeti dataset. Our results suggest that two-phase training can improve performance for many minority classes, with limited loss in performance for the other classes. We find that two-phase training based on majority undersampling increases class-specific F1-scores up to 3.0%. We also find that two-phase training outperforms using only oversampling or undersampling by 6.1% in F1-score on average. Finally, we find that a combination of over- and undersampling leads to a better performance than using them individually.</description><subject>Biodiversity</subject><subject>Cameras</subject><subject>Datasets</subject><subject>Ecological effects</subject><subject>Image classification</subject><subject>Machine learning</subject><subject>Oversampling</subject><subject>Performance enhancement</subject><subject>Training</subject><subject>Wildlife conservation</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNyr0OgjAUQOHGxESivEMTZ5LS8uNONE5MbA7kSgpcAi32lvD6YvQBnM5wvh0LpFJxdEmkPLCQaBBCyCyXaaoC9qhWG809kObeARo0HZ_QYwdeE29GIOI4PWEE02jeWscbmLSDj563A53-KmyxAY_W8BV9z4uypBPbtzCSDn89svPtWhX3aHb2tWjy9WAXZ7ZVyyxOVZ6LLFH_qTeUxkMj</recordid><startdate>20211229</startdate><enddate>20211229</enddate><creator>Malik, Farjad</creator><creator>Wouters, Simon</creator><creator>Cartuyvels, Ruben</creator><creator>Ghadery, Erfan</creator><creator>Marie-Francine Moens</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20211229</creationdate><title>Two-phase training mitigates class imbalance for camera trap image classification with CNNs</title><author>Malik, Farjad ; Wouters, Simon ; Cartuyvels, Ruben ; Ghadery, Erfan ; Marie-Francine Moens</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_26153770643</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Biodiversity</topic><topic>Cameras</topic><topic>Datasets</topic><topic>Ecological effects</topic><topic>Image classification</topic><topic>Machine learning</topic><topic>Oversampling</topic><topic>Performance enhancement</topic><topic>Training</topic><topic>Wildlife conservation</topic><toplevel>online_resources</toplevel><creatorcontrib>Malik, Farjad</creatorcontrib><creatorcontrib>Wouters, Simon</creatorcontrib><creatorcontrib>Cartuyvels, Ruben</creatorcontrib><creatorcontrib>Ghadery, Erfan</creatorcontrib><creatorcontrib>Marie-Francine Moens</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Malik, Farjad</au><au>Wouters, Simon</au><au>Cartuyvels, Ruben</au><au>Ghadery, Erfan</au><au>Marie-Francine Moens</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Two-phase training mitigates class imbalance for camera trap image classification with CNNs</atitle><jtitle>arXiv.org</jtitle><date>2021-12-29</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the majority classes. As a result, they obtain good performance for a few majority classes but poor performance for many minority classes. We used two-phase training to increase the performance for these minority classes. We trained, next to a baseline model, four models that implemented a different versions of two-phase training on a subset of the highly imbalanced Snapshot Serengeti dataset. Our results suggest that two-phase training can improve performance for many minority classes, with limited loss in performance for the other classes. We find that two-phase training based on majority undersampling increases class-specific F1-scores up to 3.0%. We also find that two-phase training outperforms using only oversampling or undersampling by 6.1% in F1-score on average. Finally, we find that a combination of over- and undersampling leads to a better performance than using them individually.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2021-12 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2615377064 |
source | Publicly Available Content Database |
subjects | Biodiversity Cameras Datasets Ecological effects Image classification Machine learning Oversampling Performance enhancement Training Wildlife conservation |
title | Two-phase training mitigates class imbalance for camera trap image classification with CNNs |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T22%3A08%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Two-phase%20training%20mitigates%20class%20imbalance%20for%20camera%20trap%20image%20classification%20with%20CNNs&rft.jtitle=arXiv.org&rft.au=Malik,%20Farjad&rft.date=2021-12-29&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2615377064%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_26153770643%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2615377064&rft_id=info:pmid/&rfr_iscdi=true |