Loading…
Efficient Multi-scale Network for Semantic Segmentation of fine-Resolution Remotely Sensed Images
Semantic segmentation of remote sensing urban scene images has diverse practical applications, including land cover mapping, urban change detection, environmental protection, and economic evaluation. However, classical semantic segmentation networks encounter challenges such as inadequate utilizatio...
Saved in:
Published in: | Measurement science & technology 2024-09 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Semantic segmentation of remote sensing urban scene images has diverse practical applications, including land cover mapping, urban change detection, environmental protection, and economic evaluation. However, classical semantic segmentation networks encounter challenges such as inadequate utilization of multi-scale semantic information and imprecise edge target segmentation in high-resolution remote sensing images. In response, this article introduces an Efficient Multi-scale Network (EMNet) tailored for semantic segmentation of common features in remote sensing images.To address these challenges, EMNet integrates several key components. Firstly, the Efficient Atrous Spatial Pyramid Pooling module(EASPP) is employed to enhance the relevance of multi-scale targets, facilitating improved extraction and processing of context information across different scales. Secondly, the Efficient Multi-Scale Attention mechanism (EMA)and multi-scale jump connections are utilized to fuse semantic features from various levels, thereby achieving precise segmentation boundaries and accurate position information. Finally, an encoder-decoder structure is incorporated to refine the segmentation results.The effectiveness of the proposed network is validated through experiments conducted on the publicly available DroneDeploy image dataset. Results indicate that EMNet achieves impressive performance metrics, with Mean Intersection over Union (MIoU), Mean Pixel Accuracy (MPA), Mean Precision (MPrecision), and Mean Recall (MRecall) reaching 75.99%, 85.08%, 86.76%, and 85.07%, respectively. Comparative analysis demonstrates that the network proposed in this article outperforms current mainstream semantic segmentation networks on the DroneDeploy dataset. |
---|---|
ISSN: | 0957-0233 1361-6501 |
DOI: | 10.1088/1361-6501/ad50fa |