Loading…

Building extraction based on hyperspectral remote sensing images and semisupervised deep learning with limited training samples

Hyperspectral remote sensing imaging technology provides assistance in various aspects of daily life through applications such as urban building information statistics and green vegetation estimation. Ensuring the accuracy of automatic thematic information extraction under limited samples is a chall...

Full description

Saved in:
Bibliographic Details
Published in:Computers & electrical engineering 2023-09, Vol.110, p.108851, Article 108851
Main Authors: Hui, He, Ya-Dong, Sun, Bo-Xiong, Yang, Mu-Xi, Xie, She-Lei, Li, Bo, Zhou, Kai-Cun, Zhang
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Hyperspectral remote sensing imaging technology provides assistance in various aspects of daily life through applications such as urban building information statistics and green vegetation estimation. Ensuring the accuracy of automatic thematic information extraction under limited samples is a challenge. In this manuscript, a lightweight semantic segmentation model based on the “encoder-decoder” structure is proposed for extracting buildings from hyperspectral remote sensing images. The proposed model employs the lightweight MobileNet combined with multiscale feature fusion and a group dilated convolution for modelling both shallow and deep spatial and spectral features as the encoder and an efficient combined standardized attention mechanism for selecting the most valuable bands and local information. Extensive experiments reveal that our method produces greater accuracy than state-of-the-art lightweight models in building extraction tasks. We also demonstrated the superiority of our method for insufficient training sample sizes. When only 50% of the samples of the initial training set were used, the mean intersection over union (mIOU) reached 91.90%, 4.5% higher than that of the next best method. For training sets composed of only 16 and 8 images, the mIOU values were 89.42 and 77.11%, respectively, 13.6 and 18 percentage points higher than that of the next best method. According to the visualization of the results, the proposed method obviously outperformed the compared methods. The model proposed in this paper is suitable for accurately extracting buildings from hyperspectral images in situations involving limited training samples.
ISSN:0045-7906
1879-0755
DOI:10.1016/j.compeleceng.2023.108851