Loading…
RGB-T Saliency Detection Based on Multiscale Modal Reasoning Interaction
How to explore the interaction between RGB and thermal infrared modalities is critical to the success of RGB-T salient object detection (SOD). Most existing methods collect and convey information by collecting and conveying information from both modalities without exploring the implicit relationship...
Saved in:
Published in: | IEEE transactions on instrumentation and measurement 2024, Vol.73, p.1-15 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | How to explore the interaction between RGB and thermal infrared modalities is critical to the success of RGB-T salient object detection (SOD). Most existing methods collect and convey information by collecting and conveying information from both modalities without exploring the implicit relationship between the two modalities. To address this problem, we innovatively propose a multiscale modal inference interaction network that fully realizes the advantageous expression and complementary fusion of the two modalities by inferring the implicit expression between them. Specifically, we design a multiscale attention feature (MSAF) extraction module to adaptively balance the relationship between shallow image details and deep image semantic features, thus enhancing the feature representation of the network at different scales. Then, we design a cross-modal interaction module based on the graph model (CMGM) to utilize the inference capability of graph models to explore complementary information in visible and infrared images and fuse them effectively. In addition, we propose the edge-enhanced saliency map generation module (EEM) to improve the accuracy of salient object boundaries and optimize the generation of saliency maps. Extensive experiments show that our method achieves the best performance on all three publicly available datasets compared to state-of-the-art RGB-T saliency detection methods, proving its superiority for saliency detection tasks. |
---|---|
ISSN: | 0018-9456 1557-9662 |
DOI: | 10.1109/TIM.2024.3419115 |