Loading…

Construction of a feature enhancement network for small object detection

•To more effectively expand the possibility of small objects appearing, we improve current copy-paste based data augmentation method (CDCI) by introducing collision detection and spatial context position extension to avoid object collision and incorrect context information caused by random copy-past...

Full description

Saved in:
Bibliographic Details
Published in:Pattern recognition 2023-11, Vol.143, p.109801, Article 109801
Main Authors: Zhang, Hongyun, Li, Miao, Miao, Duoqian, Pedrycz, Witold, Wang, Zhaoguo, Jiang, Minghui
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143
cites cdi_FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143
container_end_page
container_issue
container_start_page 109801
container_title Pattern recognition
container_volume 143
creator Zhang, Hongyun
Li, Miao
Miao, Duoqian
Pedrycz, Witold
Wang, Zhaoguo
Jiang, Minghui
description •To more effectively expand the possibility of small objects appearing, we improve current copy-paste based data augmentation method (CDCI) by introducing collision detection and spatial context position extension to avoid object collision and incorrect context information caused by random copy-paste.•To solve the problem that the small objects are vulnerable to scale variation, we construct a multi-granular deformable convolution network to learn and capture the changes in the shape and scale of the object, and offset feature representations in different granularity are acquire by granulating and fusing the offset features.•A high-resolution block (HR block) is designed to bring more semantic information while maintaining high-resolution features, and high-resolution block-based Feature Pyramid is built by parallel embedding HR block in FPN to further enhancing the feature representation.•A large number of experiments are reported to demonstrate the effectiveness of the proposed method. At the same time, we set up ablation experiments to analyze the rationality of proposed different strategies. Limited by the size, location, number of samples and other factors of the small object itself, the small object is usually insufficient, which degrades the performance of the small object detection algorithms. To address this issue, we construct a novel Feature Enhancement Network (FENet) to improve the performance of small object detection. Firstly, an improved data augmentation method based on collision detection and spatial context extension (CDCI) is proposed to effectively expand the possibility of small object detection. Then, based on the idea of Granular Computing, a multi-granular deformable convolution network is constructed to acquire the offset feature representation at the different granularity levels. Finally, we design a high-resolution block (HR block) and build High-Resolution Block-based Feature Pyramid by parallel embedding HR block in FPN (HR-FPN) to make full use different granularity and resolution features. By above strategies, FENet can acquire sufficient feature information of small objects. In this paper, we firstly applied the multi-granularity deformable convolution to feature extraction of small objects. Meanwhile, a new feature fusion module is constructed by optimizing feature pyramid to maintain the detailed features and enrich the semantic information of small objects. Experiments show that FENet achieves excellent performance compa
doi_str_mv 10.1016/j.patcog.2023.109801
format article
fullrecord <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_patcog_2023_109801</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0031320323004995</els_id><sourcerecordid>S0031320323004995</sourcerecordid><originalsourceid>FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143</originalsourceid><addsrcrecordid>eNp9kMtOwzAQRb0AiVL4Axb-gYTxo06yQUIR0EqV2MDacuwxJLRxZbsg_p6UsGY10kjn6t5DyA2DkgFTt0N5MNmGt5IDF9OrqYGdkQWAYIXgIC7IZUoDAKuY5AuybsOYcjza3IeRBk8N9WjyMSLF8d2MFvc4Zjpi_grxg_oQadqb3Y6GbkCbqcOMv-wVOfdml_D67y7J6-PDS7suts9Pm_Z-W1gBKhcrrxrXoZHGWSMqAzXnHfcVl9ZKLitZr0SjhHXC1g1D5ayqEZh3yviuY1IsiZxzbQwpRfT6EPu9id-agT4Z0IOeDeiTAT0bmLC7GcOp22ePUSfb4zTP9XEaoF3o_w_4ARSyabM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Construction of a feature enhancement network for small object detection</title><source>Elsevier</source><creator>Zhang, Hongyun ; Li, Miao ; Miao, Duoqian ; Pedrycz, Witold ; Wang, Zhaoguo ; Jiang, Minghui</creator><creatorcontrib>Zhang, Hongyun ; Li, Miao ; Miao, Duoqian ; Pedrycz, Witold ; Wang, Zhaoguo ; Jiang, Minghui</creatorcontrib><description>•To more effectively expand the possibility of small objects appearing, we improve current copy-paste based data augmentation method (CDCI) by introducing collision detection and spatial context position extension to avoid object collision and incorrect context information caused by random copy-paste.•To solve the problem that the small objects are vulnerable to scale variation, we construct a multi-granular deformable convolution network to learn and capture the changes in the shape and scale of the object, and offset feature representations in different granularity are acquire by granulating and fusing the offset features.•A high-resolution block (HR block) is designed to bring more semantic information while maintaining high-resolution features, and high-resolution block-based Feature Pyramid is built by parallel embedding HR block in FPN to further enhancing the feature representation.•A large number of experiments are reported to demonstrate the effectiveness of the proposed method. At the same time, we set up ablation experiments to analyze the rationality of proposed different strategies. Limited by the size, location, number of samples and other factors of the small object itself, the small object is usually insufficient, which degrades the performance of the small object detection algorithms. To address this issue, we construct a novel Feature Enhancement Network (FENet) to improve the performance of small object detection. Firstly, an improved data augmentation method based on collision detection and spatial context extension (CDCI) is proposed to effectively expand the possibility of small object detection. Then, based on the idea of Granular Computing, a multi-granular deformable convolution network is constructed to acquire the offset feature representation at the different granularity levels. Finally, we design a high-resolution block (HR block) and build High-Resolution Block-based Feature Pyramid by parallel embedding HR block in FPN (HR-FPN) to make full use different granularity and resolution features. By above strategies, FENet can acquire sufficient feature information of small objects. In this paper, we firstly applied the multi-granularity deformable convolution to feature extraction of small objects. Meanwhile, a new feature fusion module is constructed by optimizing feature pyramid to maintain the detailed features and enrich the semantic information of small objects. Experiments show that FENet achieves excellent performance compared with performance of other methods when applied to the publicly available COCO dataset, VisDrone dataset and TinyPerson dataset. The code is available at https://github.com/cowarder/FENet.</description><identifier>ISSN: 0031-3203</identifier><identifier>DOI: 10.1016/j.patcog.2023.109801</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Collision detection ; FENet ; Granular computing ; High-Resolution block ; HR-FPN ; Small object detection</subject><ispartof>Pattern recognition, 2023-11, Vol.143, p.109801, Article 109801</ispartof><rights>2023</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143</citedby><cites>FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27900,27901</link.rule.ids></links><search><creatorcontrib>Zhang, Hongyun</creatorcontrib><creatorcontrib>Li, Miao</creatorcontrib><creatorcontrib>Miao, Duoqian</creatorcontrib><creatorcontrib>Pedrycz, Witold</creatorcontrib><creatorcontrib>Wang, Zhaoguo</creatorcontrib><creatorcontrib>Jiang, Minghui</creatorcontrib><title>Construction of a feature enhancement network for small object detection</title><title>Pattern recognition</title><description>•To more effectively expand the possibility of small objects appearing, we improve current copy-paste based data augmentation method (CDCI) by introducing collision detection and spatial context position extension to avoid object collision and incorrect context information caused by random copy-paste.•To solve the problem that the small objects are vulnerable to scale variation, we construct a multi-granular deformable convolution network to learn and capture the changes in the shape and scale of the object, and offset feature representations in different granularity are acquire by granulating and fusing the offset features.•A high-resolution block (HR block) is designed to bring more semantic information while maintaining high-resolution features, and high-resolution block-based Feature Pyramid is built by parallel embedding HR block in FPN to further enhancing the feature representation.•A large number of experiments are reported to demonstrate the effectiveness of the proposed method. At the same time, we set up ablation experiments to analyze the rationality of proposed different strategies. Limited by the size, location, number of samples and other factors of the small object itself, the small object is usually insufficient, which degrades the performance of the small object detection algorithms. To address this issue, we construct a novel Feature Enhancement Network (FENet) to improve the performance of small object detection. Firstly, an improved data augmentation method based on collision detection and spatial context extension (CDCI) is proposed to effectively expand the possibility of small object detection. Then, based on the idea of Granular Computing, a multi-granular deformable convolution network is constructed to acquire the offset feature representation at the different granularity levels. Finally, we design a high-resolution block (HR block) and build High-Resolution Block-based Feature Pyramid by parallel embedding HR block in FPN (HR-FPN) to make full use different granularity and resolution features. By above strategies, FENet can acquire sufficient feature information of small objects. In this paper, we firstly applied the multi-granularity deformable convolution to feature extraction of small objects. Meanwhile, a new feature fusion module is constructed by optimizing feature pyramid to maintain the detailed features and enrich the semantic information of small objects. Experiments show that FENet achieves excellent performance compared with performance of other methods when applied to the publicly available COCO dataset, VisDrone dataset and TinyPerson dataset. The code is available at https://github.com/cowarder/FENet.</description><subject>Collision detection</subject><subject>FENet</subject><subject>Granular computing</subject><subject>High-Resolution block</subject><subject>HR-FPN</subject><subject>Small object detection</subject><issn>0031-3203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kMtOwzAQRb0AiVL4Axb-gYTxo06yQUIR0EqV2MDacuwxJLRxZbsg_p6UsGY10kjn6t5DyA2DkgFTt0N5MNmGt5IDF9OrqYGdkQWAYIXgIC7IZUoDAKuY5AuybsOYcjza3IeRBk8N9WjyMSLF8d2MFvc4Zjpi_grxg_oQadqb3Y6GbkCbqcOMv-wVOfdml_D67y7J6-PDS7suts9Pm_Z-W1gBKhcrrxrXoZHGWSMqAzXnHfcVl9ZKLitZr0SjhHXC1g1D5ayqEZh3yviuY1IsiZxzbQwpRfT6EPu9id-agT4Z0IOeDeiTAT0bmLC7GcOp22ePUSfb4zTP9XEaoF3o_w_4ARSyabM</recordid><startdate>202311</startdate><enddate>202311</enddate><creator>Zhang, Hongyun</creator><creator>Li, Miao</creator><creator>Miao, Duoqian</creator><creator>Pedrycz, Witold</creator><creator>Wang, Zhaoguo</creator><creator>Jiang, Minghui</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>202311</creationdate><title>Construction of a feature enhancement network for small object detection</title><author>Zhang, Hongyun ; Li, Miao ; Miao, Duoqian ; Pedrycz, Witold ; Wang, Zhaoguo ; Jiang, Minghui</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Collision detection</topic><topic>FENet</topic><topic>Granular computing</topic><topic>High-Resolution block</topic><topic>HR-FPN</topic><topic>Small object detection</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Hongyun</creatorcontrib><creatorcontrib>Li, Miao</creatorcontrib><creatorcontrib>Miao, Duoqian</creatorcontrib><creatorcontrib>Pedrycz, Witold</creatorcontrib><creatorcontrib>Wang, Zhaoguo</creatorcontrib><creatorcontrib>Jiang, Minghui</creatorcontrib><collection>CrossRef</collection><jtitle>Pattern recognition</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhang, Hongyun</au><au>Li, Miao</au><au>Miao, Duoqian</au><au>Pedrycz, Witold</au><au>Wang, Zhaoguo</au><au>Jiang, Minghui</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Construction of a feature enhancement network for small object detection</atitle><jtitle>Pattern recognition</jtitle><date>2023-11</date><risdate>2023</risdate><volume>143</volume><spage>109801</spage><pages>109801-</pages><artnum>109801</artnum><issn>0031-3203</issn><abstract>•To more effectively expand the possibility of small objects appearing, we improve current copy-paste based data augmentation method (CDCI) by introducing collision detection and spatial context position extension to avoid object collision and incorrect context information caused by random copy-paste.•To solve the problem that the small objects are vulnerable to scale variation, we construct a multi-granular deformable convolution network to learn and capture the changes in the shape and scale of the object, and offset feature representations in different granularity are acquire by granulating and fusing the offset features.•A high-resolution block (HR block) is designed to bring more semantic information while maintaining high-resolution features, and high-resolution block-based Feature Pyramid is built by parallel embedding HR block in FPN to further enhancing the feature representation.•A large number of experiments are reported to demonstrate the effectiveness of the proposed method. At the same time, we set up ablation experiments to analyze the rationality of proposed different strategies. Limited by the size, location, number of samples and other factors of the small object itself, the small object is usually insufficient, which degrades the performance of the small object detection algorithms. To address this issue, we construct a novel Feature Enhancement Network (FENet) to improve the performance of small object detection. Firstly, an improved data augmentation method based on collision detection and spatial context extension (CDCI) is proposed to effectively expand the possibility of small object detection. Then, based on the idea of Granular Computing, a multi-granular deformable convolution network is constructed to acquire the offset feature representation at the different granularity levels. Finally, we design a high-resolution block (HR block) and build High-Resolution Block-based Feature Pyramid by parallel embedding HR block in FPN (HR-FPN) to make full use different granularity and resolution features. By above strategies, FENet can acquire sufficient feature information of small objects. In this paper, we firstly applied the multi-granularity deformable convolution to feature extraction of small objects. Meanwhile, a new feature fusion module is constructed by optimizing feature pyramid to maintain the detailed features and enrich the semantic information of small objects. Experiments show that FENet achieves excellent performance compared with performance of other methods when applied to the publicly available COCO dataset, VisDrone dataset and TinyPerson dataset. The code is available at https://github.com/cowarder/FENet.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.patcog.2023.109801</doi></addata></record>
fulltext fulltext
identifier ISSN: 0031-3203
ispartof Pattern recognition, 2023-11, Vol.143, p.109801, Article 109801
issn 0031-3203
language eng
recordid cdi_crossref_primary_10_1016_j_patcog_2023_109801
source Elsevier
subjects Collision detection
FENet
Granular computing
High-Resolution block
HR-FPN
Small object detection
title Construction of a feature enhancement network for small object detection
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-24T21%3A17%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Construction%20of%20a%20feature%20enhancement%20network%20for%20small%20object%20detection&rft.jtitle=Pattern%20recognition&rft.au=Zhang,%20Hongyun&rft.date=2023-11&rft.volume=143&rft.spage=109801&rft.pages=109801-&rft.artnum=109801&rft.issn=0031-3203&rft_id=info:doi/10.1016/j.patcog.2023.109801&rft_dat=%3Celsevier_cross%3ES0031320323004995%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c306t-5f69dbea4adca37a0822b2f724cc42474853963cd3c891e6dc68e01fd6afbb143%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true