Loading…

Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images

Object detection on drone images with low-latency is an important but challenging task on the resource-constrained unmanned aerial vehicle (UAV) platform. This paper investigates optimizing the detection head based on the sparse convolution, which proves effective in balancing the accuracy and effic...

Full description

Saved in:

Bibliographic Details
Published in:	arXiv.org 2023-03
Main Authors:	Du, Bowei, Huang, Yecheng, Chen, Jiaxin, Huang, Di
Format:	Article
Language:	English
Subjects:	Context Image enhancement Multilayers Network latency Object recognition Optimization Unmanned aerial vehicles
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites
container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Du, Bowei Huang, Yecheng Chen, Jiaxin Huang, Di
description	Object detection on drone images with low-latency is an important but challenging task on the resource-constrained unmanned aerial vehicle (UAV) platform. This paper investigates optimizing the detection head based on the sparse convolution, which proves effective in balancing the accuracy and efficiency. Nevertheless, it suffers from inadequate integration of contextual information of tiny objects as well as clumsy control of the mask ratio in the presence of foreground with varying scales. To address the issues above, we propose a novel global context-enhanced adaptive sparse convolutional network (CEASC). It first develops a context-enhanced group normalization (CE-GN) layer, by replacing the statistics based on sparsely sampled features with the global contextual ones, and then designs an adaptive multi-layer masking strategy to generate optimal mask ratios at distinct scales for compact foreground coverage, promoting both the accuracy and efficiency. Extensive experimental results on two major benchmarks, i.e. VisDrone and UAVDT, demonstrate that CEASC remarkably reduces the GFLOPs and accelerates the inference procedure when plugging into the typical state-of-the-art detection frameworks (e.g. RetinaNet and GFL V1) with competitive performance. Code is available at https://github.com/Cuogeihong/CEASC.
format	article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2791775873</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2791775873</sourcerecordid><originalsourceid>FETCH-proquest_journals_27917758733</originalsourceid><addsrcrecordid>eNqNjc0OAUEQhCcSCcE7dOIsWTPWcBT_Fw7cZdDYtabXdC8e3xw8gKSSL6l8qaqppjam3xsNtG6oDnOeJIkeWp2mpqnC5OxKyV4Iu9IFRpiSf1FRSUbeFbBBeVO4M7wzucGyoGMsoyL4EZj7m_MnfKAXuFCAhWPBANtjjieBGUpEnIGYWSCPsH64K3Jb1S-uYOz82FLdxXw_XfXKQM8KWQ45VSG-80Hbcd_adGSN-c_6AqHMS7I</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2791775873</pqid></control><display><type>article</type><title>Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images</title><source>Publicly Available Content Database</source><creator>Du, Bowei ; Huang, Yecheng ; Chen, Jiaxin ; Huang, Di</creator><creatorcontrib>Du, Bowei ; Huang, Yecheng ; Chen, Jiaxin ; Huang, Di</creatorcontrib><description>Object detection on drone images with low-latency is an important but challenging task on the resource-constrained unmanned aerial vehicle (UAV) platform. This paper investigates optimizing the detection head based on the sparse convolution, which proves effective in balancing the accuracy and efficiency. Nevertheless, it suffers from inadequate integration of contextual information of tiny objects as well as clumsy control of the mask ratio in the presence of foreground with varying scales. To address the issues above, we propose a novel global context-enhanced adaptive sparse convolutional network (CEASC). It first develops a context-enhanced group normalization (CE-GN) layer, by replacing the statistics based on sparsely sampled features with the global contextual ones, and then designs an adaptive multi-layer masking strategy to generate optimal mask ratios at distinct scales for compact foreground coverage, promoting both the accuracy and efficiency. Extensive experimental results on two major benchmarks, i.e. VisDrone and UAVDT, demonstrate that CEASC remarkably reduces the GFLOPs and accelerates the inference procedure when plugging into the typical state-of-the-art detection frameworks (e.g. RetinaNet and GFL V1) with competitive performance. Code is available at https://github.com/Cuogeihong/CEASC.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Context ; Image enhancement ; Multilayers ; Network latency ; Object recognition ; Optimization ; Unmanned aerial vehicles</subject><ispartof>arXiv.org, 2023-03</ispartof><rights>2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2791775873?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>776,780,25731,36989,44566</link.rule.ids></links><search><creatorcontrib>Du, Bowei</creatorcontrib><creatorcontrib>Huang, Yecheng</creatorcontrib><creatorcontrib>Chen, Jiaxin</creatorcontrib><creatorcontrib>Huang, Di</creatorcontrib><title>Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images</title><title>arXiv.org</title><description>Object detection on drone images with low-latency is an important but challenging task on the resource-constrained unmanned aerial vehicle (UAV) platform. This paper investigates optimizing the detection head based on the sparse convolution, which proves effective in balancing the accuracy and efficiency. Nevertheless, it suffers from inadequate integration of contextual information of tiny objects as well as clumsy control of the mask ratio in the presence of foreground with varying scales. To address the issues above, we propose a novel global context-enhanced adaptive sparse convolutional network (CEASC). It first develops a context-enhanced group normalization (CE-GN) layer, by replacing the statistics based on sparsely sampled features with the global contextual ones, and then designs an adaptive multi-layer masking strategy to generate optimal mask ratios at distinct scales for compact foreground coverage, promoting both the accuracy and efficiency. Extensive experimental results on two major benchmarks, i.e. VisDrone and UAVDT, demonstrate that CEASC remarkably reduces the GFLOPs and accelerates the inference procedure when plugging into the typical state-of-the-art detection frameworks (e.g. RetinaNet and GFL V1) with competitive performance. Code is available at https://github.com/Cuogeihong/CEASC.</description><subject>Context</subject><subject>Image enhancement</subject><subject>Multilayers</subject><subject>Network latency</subject><subject>Object recognition</subject><subject>Optimization</subject><subject>Unmanned aerial vehicles</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNjc0OAUEQhCcSCcE7dOIsWTPWcBT_Fw7cZdDYtabXdC8e3xw8gKSSL6l8qaqppjam3xsNtG6oDnOeJIkeWp2mpqnC5OxKyV4Iu9IFRpiSf1FRSUbeFbBBeVO4M7wzucGyoGMsoyL4EZj7m_MnfKAXuFCAhWPBANtjjieBGUpEnIGYWSCPsH64K3Jb1S-uYOz82FLdxXw_XfXKQM8KWQ45VSG-80Hbcd_adGSN-c_6AqHMS7I</recordid><startdate>20230325</startdate><enddate>20230325</enddate><creator>Du, Bowei</creator><creator>Huang, Yecheng</creator><creator>Chen, Jiaxin</creator><creator>Huang, Di</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20230325</creationdate><title>Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images</title><author>Du, Bowei ; Huang, Yecheng ; Chen, Jiaxin ; Huang, Di</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_27917758733</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Context</topic><topic>Image enhancement</topic><topic>Multilayers</topic><topic>Network latency</topic><topic>Object recognition</topic><topic>Optimization</topic><topic>Unmanned aerial vehicles</topic><toplevel>online_resources</toplevel><creatorcontrib>Du, Bowei</creatorcontrib><creatorcontrib>Huang, Yecheng</creatorcontrib><creatorcontrib>Chen, Jiaxin</creatorcontrib><creatorcontrib>Huang, Di</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Du, Bowei</au><au>Huang, Yecheng</au><au>Chen, Jiaxin</au><au>Huang, Di</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images</atitle><jtitle>arXiv.org</jtitle><date>2023-03-25</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Object detection on drone images with low-latency is an important but challenging task on the resource-constrained unmanned aerial vehicle (UAV) platform. This paper investigates optimizing the detection head based on the sparse convolution, which proves effective in balancing the accuracy and efficiency. Nevertheless, it suffers from inadequate integration of contextual information of tiny objects as well as clumsy control of the mask ratio in the presence of foreground with varying scales. To address the issues above, we propose a novel global context-enhanced adaptive sparse convolutional network (CEASC). It first develops a context-enhanced group normalization (CE-GN) layer, by replacing the statistics based on sparsely sampled features with the global contextual ones, and then designs an adaptive multi-layer masking strategy to generate optimal mask ratios at distinct scales for compact foreground coverage, promoting both the accuracy and efficiency. Extensive experimental results on two major benchmarks, i.e. VisDrone and UAVDT, demonstrate that CEASC remarkably reduces the GFLOPs and accelerates the inference procedure when plugging into the typical state-of-the-art detection frameworks (e.g. RetinaNet and GFL V1) with competitive performance. Code is available at https://github.com/Cuogeihong/CEASC.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2023-03
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2791775873
source	Publicly Available Content Database
subjects	Context Image enhancement Multilayers Network latency Object recognition Optimization Unmanned aerial vehicles
title	Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T16%3A31%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Adaptive%20Sparse%20Convolutional%20Networks%20with%20Global%20Context%20Enhancement%20for%20Faster%20Object%20Detection%20on%20Drone%20Images&rft.jtitle=arXiv.org&rft.au=Du,%20Bowei&rft.date=2023-03-25&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2791775873%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_27917758733%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2791775873&rft_id=info:pmid/&rfr_iscdi=true