Loading…

A pooling-based feature pyramid network for salient object detection

How to effectively utilize and fuse deep features has become a critical point for salient object detection. Most existing methods usually adopt the convolutional features based on U-shape structures and fuse multi-scale convolutional features without fully considering the different characteristics b...

Full description

Saved in:

Bibliographic Details
Published in:	Image and vision computing 2021-03, Vol.107, p.104099, Article 104099
Main Authors:	Shi, Caijuan, Zhang, Weiming, Duan, Changyu, Chen, Houru
Format:	Article
Language:	English
Subjects:	Convolutional neural network Deep feature learning Pooling Salient object detection U-shaped feature pyramid
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	How to effectively utilize and fuse deep features has become a critical point for salient object detection. Most existing methods usually adopt the convolutional features based on U-shape structures and fuse multi-scale convolutional features without fully considering the different characteristics between high-level features and low-level features. Furthermore, existing salient object detection methods rarely consider the role of pooling in convolutional neural networks. Moreover, there is still much room to improve the detection performance for objects in complex scenes. To address the problems mentioned above, we propose a pooling-based feature pyramid (PFP) network to boost salient object detection performance in this paper. First, we design two U-shaped feature pyramid modules to capture rich semantic information from high-level features and to obtain clear saliency boundaries from low-level features respectively. Second, a pyramid pooling refinement module is designed to utilize the pooling to capture more semantic information. Third, a universal channel-wise attention (UCA) module is designed to select effective high-level features of multi-scale and multi-receptive-field for rich semantic information, even in complex scenes. Finally, we fuse the selected high-level features and low-level features together, followed by an edge preservation loss to obtain accurate boundary location. Extensive experiments are conducted on five datasets and the experimental results indicate that our proposed method has the ability to get better salient object detection performance compared to the state-of-the-art methods. •A pooling-based feature pyramid network for salient object detection is proposed;•Two U-shaped feature pyramid modules are designed;•Pyramid pooling refinement module is designed to capture rich semantic information;•Universal channel-wise attention module is designed to select high-level features;•Experiments demonstrate the superiority of PFP for salient object detection.
ISSN:	0262-8856 1872-8138
DOI:	10.1016/j.imavis.2021.104099