Loading…

AMFLW-YOLO: A Lightweight Network for Remote Sensing Image Detection Based on Attention Mechanism and Multiscale Feature Fusion

The scale of targets in remote sensing images varies greatly and is diverse. It has many small targets that are distributed densely and high complexity of image background. The number of network model parameters and the computation amount of the object detection algorithms based on deep learning is...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on geoscience and remote sensing 2023, Vol.61, p.1-16
Main Authors: Peng, Guili, Yang, Zijian, Wang, Shoubin, Zhou, Yuan
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c294t-338c50c1d8c929e7773e5631f55877c66a6d8a6dd369765a6a1a67a8ecc9bdfe3
cites cdi_FETCH-LOGICAL-c294t-338c50c1d8c929e7773e5631f55877c66a6d8a6dd369765a6a1a67a8ecc9bdfe3
container_end_page 16
container_issue
container_start_page 1
container_title IEEE transactions on geoscience and remote sensing
container_volume 61
creator Peng, Guili
Yang, Zijian
Wang, Shoubin
Zhou, Yuan
description The scale of targets in remote sensing images varies greatly and is diverse. It has many small targets that are distributed densely and high complexity of image background. The number of network model parameters and the computation amount of the object detection algorithms based on deep learning is huge. It is difficult to apply them on a platform with fixed performance and limited computing resources. A lightweight remote sensing object detection model is proposed in this article, which called attention and multiscale feature fusion lightweight-YOLO (AMFLW-YOLO). The deep separable convolution, inverted residual, and linear bottleneck structure are employed to replace the standard convolution layer to reduce the model parameters in the backbone network of the model. The coordinate attention (CA) mechanism is introduced into the feature fusion network to capture the direction- and location-aware information across channels at the same time, which improves the accuracy of the network. The bidirectional feature pyramid network (BiFPN) structure is employed to strengthen feature extraction. The learnable weights are introduced to learn the importance of different input features. The multiscale feature fusion is applied to improve the detection effect. The experimental results show that the algorithm achieves satisfactory performance in terms of efficiency and accuracy and has advantages in detection accuracy and model lightweight.
doi_str_mv 10.1109/TGRS.2023.3327285
format article
fullrecord <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_ieee_primary_10295543</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10295543</ieee_id><sourcerecordid>2887108843</sourcerecordid><originalsourceid>FETCH-LOGICAL-c294t-338c50c1d8c929e7773e5631f55877c66a6d8a6dd369765a6a1a67a8ecc9bdfe3</originalsourceid><addsrcrecordid>eNpNUMlOwzAQtRBIlOUDkDhY4pziJd64laUFKaUSixCnyDiTEmgTsB0hTvw6Lu2Bw8wbjd57o3kIHVEypJSY04fJ3f2QEcaHnDPFtNhCAyqEzojM8200INTIjGnDdtFeCG-E0FxQNUA_o-m4eMqeZ8XsDI9w0cxf4xesOr6F-NX5d1x3Ht_BsouA76ENTTvHN0s7B3wJEVxsuhaf2wAVTsMoRmj_VlNwr7ZtwhLbtsLTfhGb4OwC8Bhs7H3CPiTeAdqp7SLA4Qb30eP46uHiOitmk5uLUZE5ZvKYca6dII5W2hlmQCnFQUhO6_SiUk5KKyudquLSKCmstNRKZTU4Z16qGvg-Oln7fvjus4cQy7eu9206WTKtFSVa5zyx6JrlfBeCh7r88M3S-u-SknKVc7nKuVzlXG5yTprjtaYBgH98ZoRIlr8hJXml</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2887108843</pqid></control><display><type>article</type><title>AMFLW-YOLO: A Lightweight Network for Remote Sensing Image Detection Based on Attention Mechanism and Multiscale Feature Fusion</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Peng, Guili ; Yang, Zijian ; Wang, Shoubin ; Zhou, Yuan</creator><creatorcontrib>Peng, Guili ; Yang, Zijian ; Wang, Shoubin ; Zhou, Yuan</creatorcontrib><description>The scale of targets in remote sensing images varies greatly and is diverse. It has many small targets that are distributed densely and high complexity of image background. The number of network model parameters and the computation amount of the object detection algorithms based on deep learning is huge. It is difficult to apply them on a platform with fixed performance and limited computing resources. A lightweight remote sensing object detection model is proposed in this article, which called attention and multiscale feature fusion lightweight-YOLO (AMFLW-YOLO). The deep separable convolution, inverted residual, and linear bottleneck structure are employed to replace the standard convolution layer to reduce the model parameters in the backbone network of the model. The coordinate attention (CA) mechanism is introduced into the feature fusion network to capture the direction- and location-aware information across channels at the same time, which improves the accuracy of the network. The bidirectional feature pyramid network (BiFPN) structure is employed to strengthen feature extraction. The learnable weights are introduced to learn the importance of different input features. The multiscale feature fusion is applied to improve the detection effect. The experimental results show that the algorithm achieves satisfactory performance in terms of efficiency and accuracy and has advantages in detection accuracy and model lightweight.</description><identifier>ISSN: 0196-2892</identifier><identifier>EISSN: 1558-0644</identifier><identifier>DOI: 10.1109/TGRS.2023.3327285</identifier><identifier>CODEN: IGRSD2</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Accuracy ; Algorithms ; Attention mechanism ; Computation ; Computer networks ; Convolution ; Convolutional neural networks ; Deep learning ; Detection ; Feature extraction ; feature fusion ; Image detection ; Lightweight ; lightweight network ; Machine learning ; Mathematical models ; Model accuracy ; Object detection ; Object recognition ; Optical sensors ; Parameters ; Remote sensing ; remote sensing image ; Sensors ; Task analysis</subject><ispartof>IEEE transactions on geoscience and remote sensing, 2023, Vol.61, p.1-16</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c294t-338c50c1d8c929e7773e5631f55877c66a6d8a6dd369765a6a1a67a8ecc9bdfe3</citedby><cites>FETCH-LOGICAL-c294t-338c50c1d8c929e7773e5631f55877c66a6d8a6dd369765a6a1a67a8ecc9bdfe3</cites><orcidid>0000-0002-9845-5251 ; 0000-0002-0322-2982 ; 0000-0002-3920-7303 ; 0009-0001-8899-6134</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10295543$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,4024,27923,27924,27925,54796</link.rule.ids></links><search><creatorcontrib>Peng, Guili</creatorcontrib><creatorcontrib>Yang, Zijian</creatorcontrib><creatorcontrib>Wang, Shoubin</creatorcontrib><creatorcontrib>Zhou, Yuan</creatorcontrib><title>AMFLW-YOLO: A Lightweight Network for Remote Sensing Image Detection Based on Attention Mechanism and Multiscale Feature Fusion</title><title>IEEE transactions on geoscience and remote sensing</title><addtitle>TGRS</addtitle><description>The scale of targets in remote sensing images varies greatly and is diverse. It has many small targets that are distributed densely and high complexity of image background. The number of network model parameters and the computation amount of the object detection algorithms based on deep learning is huge. It is difficult to apply them on a platform with fixed performance and limited computing resources. A lightweight remote sensing object detection model is proposed in this article, which called attention and multiscale feature fusion lightweight-YOLO (AMFLW-YOLO). The deep separable convolution, inverted residual, and linear bottleneck structure are employed to replace the standard convolution layer to reduce the model parameters in the backbone network of the model. The coordinate attention (CA) mechanism is introduced into the feature fusion network to capture the direction- and location-aware information across channels at the same time, which improves the accuracy of the network. The bidirectional feature pyramid network (BiFPN) structure is employed to strengthen feature extraction. The learnable weights are introduced to learn the importance of different input features. The multiscale feature fusion is applied to improve the detection effect. The experimental results show that the algorithm achieves satisfactory performance in terms of efficiency and accuracy and has advantages in detection accuracy and model lightweight.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Attention mechanism</subject><subject>Computation</subject><subject>Computer networks</subject><subject>Convolution</subject><subject>Convolutional neural networks</subject><subject>Deep learning</subject><subject>Detection</subject><subject>Feature extraction</subject><subject>feature fusion</subject><subject>Image detection</subject><subject>Lightweight</subject><subject>lightweight network</subject><subject>Machine learning</subject><subject>Mathematical models</subject><subject>Model accuracy</subject><subject>Object detection</subject><subject>Object recognition</subject><subject>Optical sensors</subject><subject>Parameters</subject><subject>Remote sensing</subject><subject>remote sensing image</subject><subject>Sensors</subject><subject>Task analysis</subject><issn>0196-2892</issn><issn>1558-0644</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNpNUMlOwzAQtRBIlOUDkDhY4pziJd64laUFKaUSixCnyDiTEmgTsB0hTvw6Lu2Bw8wbjd57o3kIHVEypJSY04fJ3f2QEcaHnDPFtNhCAyqEzojM8200INTIjGnDdtFeCG-E0FxQNUA_o-m4eMqeZ8XsDI9w0cxf4xesOr6F-NX5d1x3Ht_BsouA76ENTTvHN0s7B3wJEVxsuhaf2wAVTsMoRmj_VlNwr7ZtwhLbtsLTfhGb4OwC8Bhs7H3CPiTeAdqp7SLA4Qb30eP46uHiOitmk5uLUZE5ZvKYca6dII5W2hlmQCnFQUhO6_SiUk5KKyudquLSKCmstNRKZTU4Z16qGvg-Oln7fvjus4cQy7eu9206WTKtFSVa5zyx6JrlfBeCh7r88M3S-u-SknKVc7nKuVzlXG5yTprjtaYBgH98ZoRIlr8hJXml</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Peng, Guili</creator><creator>Yang, Zijian</creator><creator>Wang, Shoubin</creator><creator>Zhou, Yuan</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7UA</scope><scope>8FD</scope><scope>C1K</scope><scope>F1W</scope><scope>FR3</scope><scope>H8D</scope><scope>H96</scope><scope>KR7</scope><scope>L.G</scope><scope>L7M</scope><orcidid>https://orcid.org/0000-0002-9845-5251</orcidid><orcidid>https://orcid.org/0000-0002-0322-2982</orcidid><orcidid>https://orcid.org/0000-0002-3920-7303</orcidid><orcidid>https://orcid.org/0009-0001-8899-6134</orcidid></search><sort><creationdate>2023</creationdate><title>AMFLW-YOLO: A Lightweight Network for Remote Sensing Image Detection Based on Attention Mechanism and Multiscale Feature Fusion</title><author>Peng, Guili ; Yang, Zijian ; Wang, Shoubin ; Zhou, Yuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c294t-338c50c1d8c929e7773e5631f55877c66a6d8a6dd369765a6a1a67a8ecc9bdfe3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Attention mechanism</topic><topic>Computation</topic><topic>Computer networks</topic><topic>Convolution</topic><topic>Convolutional neural networks</topic><topic>Deep learning</topic><topic>Detection</topic><topic>Feature extraction</topic><topic>feature fusion</topic><topic>Image detection</topic><topic>Lightweight</topic><topic>lightweight network</topic><topic>Machine learning</topic><topic>Mathematical models</topic><topic>Model accuracy</topic><topic>Object detection</topic><topic>Object recognition</topic><topic>Optical sensors</topic><topic>Parameters</topic><topic>Remote sensing</topic><topic>remote sensing image</topic><topic>Sensors</topic><topic>Task analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Peng, Guili</creatorcontrib><creatorcontrib>Yang, Zijian</creatorcontrib><creatorcontrib>Wang, Shoubin</creatorcontrib><creatorcontrib>Zhou, Yuan</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library Online</collection><collection>CrossRef</collection><collection>Water Resources Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy &amp; Non-Living Resources</collection><collection>Civil Engineering Abstracts</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) Professional</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>IEEE transactions on geoscience and remote sensing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Peng, Guili</au><au>Yang, Zijian</au><au>Wang, Shoubin</au><au>Zhou, Yuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>AMFLW-YOLO: A Lightweight Network for Remote Sensing Image Detection Based on Attention Mechanism and Multiscale Feature Fusion</atitle><jtitle>IEEE transactions on geoscience and remote sensing</jtitle><stitle>TGRS</stitle><date>2023</date><risdate>2023</risdate><volume>61</volume><spage>1</spage><epage>16</epage><pages>1-16</pages><issn>0196-2892</issn><eissn>1558-0644</eissn><coden>IGRSD2</coden><abstract>The scale of targets in remote sensing images varies greatly and is diverse. It has many small targets that are distributed densely and high complexity of image background. The number of network model parameters and the computation amount of the object detection algorithms based on deep learning is huge. It is difficult to apply them on a platform with fixed performance and limited computing resources. A lightweight remote sensing object detection model is proposed in this article, which called attention and multiscale feature fusion lightweight-YOLO (AMFLW-YOLO). The deep separable convolution, inverted residual, and linear bottleneck structure are employed to replace the standard convolution layer to reduce the model parameters in the backbone network of the model. The coordinate attention (CA) mechanism is introduced into the feature fusion network to capture the direction- and location-aware information across channels at the same time, which improves the accuracy of the network. The bidirectional feature pyramid network (BiFPN) structure is employed to strengthen feature extraction. The learnable weights are introduced to learn the importance of different input features. The multiscale feature fusion is applied to improve the detection effect. The experimental results show that the algorithm achieves satisfactory performance in terms of efficiency and accuracy and has advantages in detection accuracy and model lightweight.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TGRS.2023.3327285</doi><tpages>16</tpages><orcidid>https://orcid.org/0000-0002-9845-5251</orcidid><orcidid>https://orcid.org/0000-0002-0322-2982</orcidid><orcidid>https://orcid.org/0000-0002-3920-7303</orcidid><orcidid>https://orcid.org/0009-0001-8899-6134</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0196-2892
ispartof IEEE transactions on geoscience and remote sensing, 2023, Vol.61, p.1-16
issn 0196-2892
1558-0644
language eng
recordid cdi_ieee_primary_10295543
source IEEE Electronic Library (IEL) Journals
subjects Accuracy
Algorithms
Attention mechanism
Computation
Computer networks
Convolution
Convolutional neural networks
Deep learning
Detection
Feature extraction
feature fusion
Image detection
Lightweight
lightweight network
Machine learning
Mathematical models
Model accuracy
Object detection
Object recognition
Optical sensors
Parameters
Remote sensing
remote sensing image
Sensors
Task analysis
title AMFLW-YOLO: A Lightweight Network for Remote Sensing Image Detection Based on Attention Mechanism and Multiscale Feature Fusion
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-21T17%3A24%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=AMFLW-YOLO:%20A%20Lightweight%20Network%20for%20Remote%20Sensing%20Image%20Detection%20Based%20on%20Attention%20Mechanism%20and%20Multiscale%20Feature%20Fusion&rft.jtitle=IEEE%20transactions%20on%20geoscience%20and%20remote%20sensing&rft.au=Peng,%20Guili&rft.date=2023&rft.volume=61&rft.spage=1&rft.epage=16&rft.pages=1-16&rft.issn=0196-2892&rft.eissn=1558-0644&rft.coden=IGRSD2&rft_id=info:doi/10.1109/TGRS.2023.3327285&rft_dat=%3Cproquest_ieee_%3E2887108843%3C/proquest_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c294t-338c50c1d8c929e7773e5631f55877c66a6d8a6dd369765a6a1a67a8ecc9bdfe3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2887108843&rft_id=info:pmid/&rft_ieee_id=10295543&rfr_iscdi=true