Loading…
MLP-Net: Multilayer Perceptron Fusion Network for Infrared Small Target Detection
Infrared small target detection (IRSTD) faces various challenges such as long distances, weak features, and small scales. While methodologies based on convolutional neural networks (CNNs) have made strides, they are inherently hampered by a bias toward local reduction, limiting their global interpre...
Saved in:
Published in: | IEEE transactions on geoscience and remote sensing 2025, Vol.63, p.1-13 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Infrared small target detection (IRSTD) faces various challenges such as long distances, weak features, and small scales. While methodologies based on convolutional neural networks (CNNs) have made strides, they are inherently hampered by a bias toward local reduction, limiting their global interpretive power. Conversely, Transformer-based approaches, though capable of capturing long-range dependencies, struggle with computational inefficiencies due to their quadratic complexity. To surmount these challenges, this article presents MLP-Net, a novel multilayer perceptron (MLP) fusion network for IRSTD. The architecture combines the advantages of CNNs and MLPs to capture global semantic information from local features and significantly enhance feature representation. Additionally, we develop a parallel token interaction mixer (PTIM) that processes the token representations with direction-specific interactive information across the height, width, and channel dimensions on MLPs, dynamically reinforcing the ability of long-range dependency modeling. Complementing this, we devise a contextual selection fusion module (CSFM) to gradually aggregate high-level semantics and low-level details from coarse to fine. This module integrates the complementary characteristics of different layers to promote detection accuracy. Finally, comprehensive experiments on the NUAA-SIRST, NUDT-SIRST, and IRSTD-1K benchmarks demonstrate that the proposed MLP-Net delivers promising detection performance, transcending other state-of-the-art alternatives. The relevant codes will be available at https://github.com/Zhishe-Wang/MLP-Net . |
---|---|
ISSN: | 0196-2892 |
DOI: | 10.1109/TGRS.2024.3515648 |