Loading…

Construction of a feature enhancement network for small object detection

•To more effectively expand the possibility of small objects appearing, we improve current copy-paste based data augmentation method (CDCI) by introducing collision detection and spatial context position extension to avoid object collision and incorrect context information caused by random copy-past...

Full description

Saved in:

Bibliographic Details
Published in:	Pattern recognition 2023-11, Vol.143, p.109801, Article 109801
Main Authors:	Zhang, Hongyun, Li, Miao, Miao, Duoqian, Pedrycz, Witold, Wang, Zhaoguo, Jiang, Minghui
Format:	Article
Language:	English
Subjects:	Collision detection FENet Granular computing High-Resolution block HR-FPN Small object detection
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143
cites	cdi_FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143
container_end_page
container_issue
container_start_page	109801
container_title	Pattern recognition
container_volume	143
creator	Zhang, Hongyun Li, Miao Miao, Duoqian Pedrycz, Witold Wang, Zhaoguo Jiang, Minghui
description	•To more effectively expand the possibility of small objects appearing, we improve current copy-paste based data augmentation method (CDCI) by introducing collision detection and spatial context position extension to avoid object collision and incorrect context information caused by random copy-paste.•To solve the problem that the small objects are vulnerable to scale variation, we construct a multi-granular deformable convolution network to learn and capture the changes in the shape and scale of the object, and offset feature representations in different granularity are acquire by granulating and fusing the offset features.•A high-resolution block (HR block) is designed to bring more semantic information while maintaining high-resolution features, and high-resolution block-based Feature Pyramid is built by parallel embedding HR block in FPN to further enhancing the feature representation.•A large number of experiments are reported to demonstrate the effectiveness of the proposed method. At the same time, we set up ablation experiments to analyze the rationality of proposed different strategies. Limited by the size, location, number of samples and other factors of the small object itself, the small object is usually insufficient, which degrades the performance of the small object detection algorithms. To address this issue, we construct a novel Feature Enhancement Network (FENet) to improve the performance of small object detection. Firstly, an improved data augmentation method based on collision detection and spatial context extension (CDCI) is proposed to effectively expand the possibility of small object detection. Then, based on the idea of Granular Computing, a multi-granular deformable convolution network is constructed to acquire the offset feature representation at the different granularity levels. Finally, we design a high-resolution block (HR block) and build High-Resolution Block-based Feature Pyramid by parallel embedding HR block in FPN (HR-FPN) to make full use different granularity and resolution features. By above strategies, FENet can acquire sufficient feature information of small objects. In this paper, we firstly applied the multi-granularity deformable convolution to feature extraction of small objects. Meanwhile, a new feature fusion module is constructed by optimizing feature pyramid to maintain the detailed features and enrich the semantic information of small objects. Experiments show that FENet achieves excellent performance compa
doi_str_mv	10.1016/j.patcog.2023.109801
format	article
fullrecord	<record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_patcog_2023_109801</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0031320323004995</els_id><sourcerecordid>S0031320323004995</sourcerecordid><originalsourceid>FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143</originalsourceid><addsrcrecordid>eNp9kMtOwzAQRb0AiVL4Axb-gYTxo06yQUIR0EqV2MDacuwxJLRxZbsg_p6UsGY10kjn6t5DyA2DkgFTt0N5MNmGt5IDF9OrqYGdkQWAYIXgIC7IZUoDAKuY5AuybsOYcjza3IeRBk8N9WjyMSLF8d2MFvc4Zjpi_grxg_oQadqb3Y6GbkCbqcOMv-wVOfdml_D67y7J6-PDS7suts9Pm_Z-W1gBKhcrrxrXoZHGWSMqAzXnHfcVl9ZKLitZr0SjhHXC1g1D5ayqEZh3yviuY1IsiZxzbQwpRfT6EPu9id-agT4Z0IOeDeiTAT0bmLC7GcOp22ePUSfb4zTP9XEaoF3o_w_4ARSyabM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Construction of a feature enhancement network for small object detection</title><source>Elsevier</source><creator>Zhang, Hongyun ; Li, Miao ; Miao, Duoqian ; Pedrycz, Witold ; Wang, Zhaoguo ; Jiang, Minghui</creator><creatorcontrib>Zhang, Hongyun ; Li, Miao ; Miao, Duoqian ; Pedrycz, Witold ; Wang, Zhaoguo ; Jiang, Minghui</creatorcontrib><description>•To more effectively expand the possibility of small objects appearing, we improve current copy-paste based data augmentation method (CDCI) by introducing collision detection and spatial context position extension to avoid object collision and incorrect context information caused by random copy-paste.•To solve the problem that the small objects are vulnerable to scale variation, we construct a multi-granular deformable convolution network to learn and capture the changes in the shape and scale of the object, and offset feature representations in different granularity are acquire by granulating and fusing the offset features.•A high-resolution block (HR block) is designed to bring more semantic information while maintaining high-resolution features, and high-resolution block-based Feature Pyramid is built by parallel embedding HR block in FPN to further enhancing the feature representation.•A large number of experiments are reported to demonstrate the effectiveness of the proposed method. At the same time, we set up ablation experiments to analyze the rationality of proposed different strategies. Limited by the size, location, number of samples and other factors of the small object itself, the small object is usually insufficient, which degrades the performance of the small object detection algorithms. To address this issue, we construct a novel Feature Enhancement Network (FENet) to improve the performance of small object detection. Firstly, an improved data augmentation method based on collision detection and spatial context extension (CDCI) is proposed to effectively expand the possibility of small object detection. Then, based on the idea of Granular Computing, a multi-granular deformable convolution network is constructed to acquire the offset feature representation at the different granularity levels. Finally, we design a high-resolution block (HR block) and build High-Resolution Block-based Feature Pyramid by parallel embedding HR block in FPN (HR-FPN) to make full use different granularity and resolution features. By above strategies, FENet can acquire sufficient feature information of small objects. In this paper, we firstly applied the multi-granularity deformable convolution to feature extraction of small objects. Meanwhile, a new feature fusion module is constructed by optimizing feature pyramid to maintain the detailed features and enrich the semantic information of small objects. Experiments show that FENet achieves excellent performance compared with performance of other methods when applied to the publicly available COCO dataset, VisDrone dataset and TinyPerson dataset. The code is available at https://github.com/cowarder/FENet.</description><identifier>ISSN: 0031-3203</identifier><identifier>DOI: 10.1016/j.patcog.2023.109801</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Collision detection ; FENet ; Granular computing ; High-Resolution block ; HR-FPN ; Small object detection</subject><ispartof>Pattern recognition, 2023-11, Vol.143, p.109801, Article 109801</ispartof><rights>2023</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143</citedby><cites>FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27900,27901</link.rule.ids></links><search><creatorcontrib>Zhang, Hongyun</creatorcontrib><creatorcontrib>Li, Miao</creatorcontrib><creatorcontrib>Miao, Duoqian</creatorcontrib><creatorcontrib>Pedrycz, Witold</creatorcontrib><creatorcontrib>Wang, Zhaoguo</creatorcontrib><creatorcontrib>Jiang, Minghui</creatorcontrib><title>Construction of a feature enhancement network for small object detection</title><title>Pattern recognition</title><description>•To more effectively expand the possibility of small objects appearing, we improve current copy-paste based data augmentation method (CDCI) by introducing collision detection and spatial context position extension to avoid object collision and incorrect context information caused by random copy-paste.•To solve the problem that the small objects are vulnerable to scale variation, we construct a multi-granular deformable convolution network to learn and capture the changes in the shape and scale of the object, and offset feature representations in different granularity are acquire by granulating and fusing the offset features.•A high-resolution block (HR block) is designed to bring more semantic information while maintaining high-resolution features, and high-resolution block-based Feature Pyramid is built by parallel embedding HR block in FPN to further enhancing the feature representation.•A large number of experiments are reported to demonstrate the effectiveness of the proposed method. At the same time, we set up ablation experiments to analyze the rationality of proposed different strategies. Limited by the size, location, number of samples and other factors of the small object itself, the small object is usually insufficient, which degrades the performance of the small object detection algorithms. To address this issue, we construct a novel Feature Enhancement Network (FENet) to improve the performance of small object detection. Firstly, an improved data augmentation method based on collision detection and spatial context extension (CDCI) is proposed to effectively expand the possibility of small object detection. Then, based on the idea of Granular Computing, a multi-granular deformable convolution network is constructed to acquire the offset feature representation at the different granularity levels. Finally, we design a high-resolution block (HR block) and build High-Resolution Block-based Feature Pyramid by parallel embedding HR block in FPN (HR-FPN) to make full use different granularity and resolution features. By above strategies, FENet can acquire sufficient feature information of small objects. In this paper, we firstly applied the multi-granularity deformable convolution to feature extraction of small objects. Meanwhile, a new feature fusion module is constructed by optimizing feature pyramid to maintain the detailed features and enrich the semantic information of small objects. Experiments show that FENet achieves excellent performance compared with performance of other methods when applied to the publicly available COCO dataset, VisDrone dataset and TinyPerson dataset. The code is available at https://github.com/cowarder/FENet.</description><subject>Collision detection</subject><subject>FENet</subject><subject>Granular computing</subject><subject>High-Resolution block</subject><subject>HR-FPN</subject><subject>Small object detection</subject><issn>0031-3203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kMtOwzAQRb0AiVL4Axb-gYTxo06yQUIR0EqV2MDacuwxJLRxZbsg_p6UsGY10kjn6t5DyA2DkgFTt0N5MNmGt5IDF9OrqYGdkQWAYIXgIC7IZUoDAKuY5AuybsOYcjza3IeRBk8N9WjyMSLF8d2MFvc4Zjpi_grxg_oQadqb3Y6GbkCbqcOMv-wVOfdml_D67y7J6-PDS7suts9Pm_Z-W1gBKhcrrxrXoZHGWSMqAzXnHfcVl9ZKLitZr0SjhHXC1g1D5ayqEZh3yviuY1IsiZxzbQwpRfT6EPu9id-agT4Z0IOeDeiTAT0bmLC7GcOp22ePUSfb4zTP9XEaoF3o_w_4ARSyabM</recordid><startdate>202311</startdate><enddate>202311</enddate><creator>Zhang, Hongyun</creator><creator>Li, Miao</creator><creator>Miao, Duoqian</creator><creator>Pedrycz, Witold</creator><creator>Wang, Zhaoguo</creator><creator>Jiang, Minghui</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>202311</creationdate><title>Construction of a feature enhancement network for small object detection</title><author>Zhang, Hongyun ; Li, Miao ; Miao, Duoqian ; Pedrycz, Witold ; Wang, Zhaoguo ; Jiang, Minghui</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Collision detection</topic><topic>FENet</topic><topic>Granular computing</topic><topic>High-Resolution block</topic><topic>HR-FPN</topic><topic>Small object detection</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Hongyun</creatorcontrib><creatorcontrib>Li, Miao</creatorcontrib><creatorcontrib>Miao, Duoqian</creatorcontrib><creatorcontrib>Pedrycz, Witold</creatorcontrib><creatorcontrib>Wang, Zhaoguo</creatorcontrib><creatorcontrib>Jiang, Minghui</creatorcontrib><collection>CrossRef</collection><jtitle>Pattern recognition</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhang, Hongyun</au><au>Li, Miao</au><au>Miao, Duoqian</au><au>Pedrycz, Witold</au><au>Wang, Zhaoguo</au><au>Jiang, Minghui</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Construction of a feature enhancement network for small object detection</atitle><jtitle>Pattern recognition</jtitle><date>2023-11</date><risdate>2023</risdate><volume>143</volume><spage>109801</spage><pages>109801-</pages><artnum>109801</artnum><issn>0031-3203</issn><abstract>•To more effectively expand the possibility of small objects appearing, we improve current copy-paste based data augmentation method (CDCI) by introducing collision detection and spatial context position extension to avoid object collision and incorrect context information caused by random copy-paste.•To solve the problem that the small objects are vulnerable to scale variation, we construct a multi-granular deformable convolution network to learn and capture the changes in the shape and scale of the object, and offset feature representations in different granularity are acquire by granulating and fusing the offset features.•A high-resolution block (HR block) is designed to bring more semantic information while maintaining high-resolution features, and high-resolution block-based Feature Pyramid is built by parallel embedding HR block in FPN to further enhancing the feature representation.•A large number of experiments are reported to demonstrate the effectiveness of the proposed method. At the same time, we set up ablation experiments to analyze the rationality of proposed different strategies. Limited by the size, location, number of samples and other factors of the small object itself, the small object is usually insufficient, which degrades the performance of the small object detection algorithms. To address this issue, we construct a novel Feature Enhancement Network (FENet) to improve the performance of small object detection. Firstly, an improved data augmentation method based on collision detection and spatial context extension (CDCI) is proposed to effectively expand the possibility of small object detection. Then, based on the idea of Granular Computing, a multi-granular deformable convolution network is constructed to acquire the offset feature representation at the different granularity levels. Finally, we design a high-resolution block (HR block) and build High-Resolution Block-based Feature Pyramid by parallel embedding HR block in FPN (HR-FPN) to make full use different granularity and resolution features. By above strategies, FENet can acquire sufficient feature information of small objects. In this paper, we firstly applied the multi-granularity deformable convolution to feature extraction of small objects. Meanwhile, a new feature fusion module is constructed by optimizing feature pyramid to maintain the detailed features and enrich the semantic information of small objects. Experiments show that FENet achieves excellent performance compared with performance of other methods when applied to the publicly available COCO dataset, VisDrone dataset and TinyPerson dataset. The code is available at https://github.com/cowarder/FENet.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.patcog.2023.109801</doi></addata></record>
fulltext	fulltext
identifier	ISSN: 0031-3203
ispartof	Pattern recognition, 2023-11, Vol.143, p.109801, Article 109801
issn	0031-3203
language	eng
recordid	cdi_crossref_primary_10_1016_j_patcog_2023_109801
source	Elsevier
subjects	Collision detection FENet Granular computing High-Resolution block HR-FPN Small object detection
title	Construction of a feature enhancement network for small object detection
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-24T21%3A17%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Construction%20of%20a%20feature%20enhancement%20network%20for%20small%20object%20detection&rft.jtitle=Pattern%20recognition&rft.au=Zhang,%20Hongyun&rft.date=2023-11&rft.volume=143&rft.spage=109801&rft.pages=109801-&rft.artnum=109801&rft.issn=0031-3203&rft_id=info:doi/10.1016/j.patcog.2023.109801&rft_dat=%3Celsevier_cross%3ES0031320323004995%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true