Loading…

SWHFormer: A Vision Transformer for Significant Wave Height Estimation From Nautical Radar Images

This paper presented a novel significant wave height (SWH) estimation method, SWHFormer, which incorporates the Vision Transformer (ViT) to estimate SWH from X-band nautical radar images. Unlike traditional convolutional neural networks, the ViT model treats the input as a sequence, capitalizing on...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on geoscience and remote sensing 2024-01, Vol.62, p.1-1
Main Authors: Yang, Zhiding, Huang, Weimin
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper presented a novel significant wave height (SWH) estimation method, SWHFormer, which incorporates the Vision Transformer (ViT) to estimate SWH from X-band nautical radar images. Unlike traditional convolutional neural networks, the ViT model treats the input as a sequence, capitalizing on its attention mechanism to capture long-range dependencies, resulting in superior performance in capturing the complex patterns present in sea wave dynamics. The radar data undergo an image denoising routine, followed by patching, flattening, and embedding processes to form a sequence fed into the Transformer encoding module. The outputs from the encoder are then aggregated to derive the final regression result, i.e., SWH estimation. In order to evaluate the performance of SWHFormer, the dataset collected by a Decca radar aboard a free-navigating vessel is analyzed, both buoy and model-based data are utilized as ground truth. In this study, two traditional linear fitting methods, i.e., ensemble empirical mode decomposition (EEMD) and variational mode decomposition (VMD)-based approaches, and a recent deep learning algorithm, convolutional gated recurrent unit (CGRU) network are exploited for comparison with SWHFormer. It is found that the root mean square error (RMSE) of the estimated results using the proposed SWHFormer is decreased from 0.29 m, 0.26 m, and 0.18 m to 0.16 m after the temporal moving average, respectively, compared to the above three methods, when the buoy-measured SWH is served as ground truth. Besides, it is decreased from 0.30 m, 0.28 m, 0.16 m to 0.14 m, respectively, when the model-based SWH is employed as reference.
ISSN:0196-2892
1558-0644
DOI:10.1109/TGRS.2024.3376471