Loading…

Customizing the feature modulation for visual tracking

In visual tracking, the target always undergoes appearance variations due to a variety of challenging situations, such as deformation and rotation. In this paper, we propose the target-guided feature modulation network based on siamese network to extract the more target-relevant features, which is g...

Full description

Saved in:
Bibliographic Details
Published in:The Visual computer 2024-09, Vol.40 (9), p.6547-6566
Main Authors: Zhang, Yuping, Yang, Zepeng, Ma, Bo, Wu, Jiahao, Jin, Fusheng
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In visual tracking, the target always undergoes appearance variations due to a variety of challenging situations, such as deformation and rotation. In this paper, we propose the target-guided feature modulation network based on siamese network to extract the more target-relevant features, which is guided by a template to modulate the search features that can be directly used for classification and localization. Specifically, we customize two target-guided feature modulation subnetworks for visual tracking, which are called template-guided spatial-attention modulation subnetwork and template-guided channel-attention modulation subnetwork, to achieve this proposal. The former controls the discriminative region based on the correlation of each search feature position with all template feature positions, whereas the latter readjusts the importance of each channel based on the response value of each channel feature of the template feature and the response value of each channel of the search feature. Extensive experiments on multiple datasets have demonstrated the effectiveness of the proposed approach in this paper.
ISSN:0178-2789
1432-2315
DOI:10.1007/s00371-023-03182-5