Loading…

Improving Feature Discrimination for Object Tracking by Structural-similarity-based Metric Learning

Existing approaches usually form the tracking task as an appearance matching procedure. However, the discrimination ability of appearance features is insufficient in these trackers, which is caused by their weak feature supervision constraints and inadequate exploitation of spatial contexts. To tack...

Full description

Saved in:
Bibliographic Details
Published in:ACM transactions on multimedia computing communications and applications 2022-11, Vol.18 (4), p.1-23
Main Authors: Wu, Jingjing, Jiang, Jianguo, Qi, Meibin, Chen, Cuiqun, Liu, Yimin
Format: Article
Language:English
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c155t-5ae5b82216bc173d837ba51e88a6c4d19d47ccb8db19bf4342f46fc4432bec473
cites cdi_FETCH-LOGICAL-c155t-5ae5b82216bc173d837ba51e88a6c4d19d47ccb8db19bf4342f46fc4432bec473
container_end_page 23
container_issue 4
container_start_page 1
container_title ACM transactions on multimedia computing communications and applications
container_volume 18
creator Wu, Jingjing
Jiang, Jianguo
Qi, Meibin
Chen, Cuiqun
Liu, Yimin
description Existing approaches usually form the tracking task as an appearance matching procedure. However, the discrimination ability of appearance features is insufficient in these trackers, which is caused by their weak feature supervision constraints and inadequate exploitation of spatial contexts. To tackle this issue, this article proposes a novel appearance matching tracking (AMT) method to strengthen the feature restraints and capture discriminative spatial representations. Specifically, we first utilize a triplet structural loss function, which improves the learning capability of features by applying a structural similarity constraint with a triplet metric format on the features. It leverages feature statistics to capture the complex interactions of visual parts. Second, we put forward an adaptive matching module that exploits the dual spatial enhancement module to reinforce target feature discrimination. This not only boosts the representation ability of spatial context but also realizes spatially dynamic feature selection by attending to target deformation information. Moreover, this model introduces a simple but effective matching unit to intuitively evaluate the relative appearance differences between the target and the proposals. In addition, with the obtained discriminative features, AMT is capable of providing precise localization for the target. Therefore, the impact of spatial suppression imposed by window functions can be alleviated, allowing for effective tracking of high-speed moving objects. Extensive experiments prove that AMT outperforms state-of-the-art methods on six public datasets and demonstrate the effectiveness of each component in AMT.
doi_str_mv 10.1145/3497746
format article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1145_3497746</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1145_3497746</sourcerecordid><originalsourceid>FETCH-LOGICAL-c155t-5ae5b82216bc173d837ba51e88a6c4d19d47ccb8db19bf4342f46fc4432bec473</originalsourceid><addsrcrecordid>eNo9kD1PwzAYhC0EEqUg_oI3JkMcf2ZEhZZKQR0oc2S_sZFLmlS2i5R_Tyoqprvh0enuELqnxSOlXDwxXinF5QWaUSEokVqKy38v1DW6SWlXFEwKLmcI1vtDHH5C_4WXzuRjdPglJIhhH3qTw9BjP0S8sTsHGW-jge8Takf8keMRJt50JE1wZ2LII7EmuRa_uxwD4NqZ2E_4Lbrypkvu7qxz9Ll83S7eSL1ZrRfPNYGpXSbCOGF1WVJpgSrWaqasEdRpbSTwllYtVwBWt5ZW1nPGS8-lB85ZaR1wxebo4S8X4pBSdL45TDtMHBtaNKdvmvM37Beqn1gW</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Improving Feature Discrimination for Object Tracking by Structural-similarity-based Metric Learning</title><source>Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list)</source><creator>Wu, Jingjing ; Jiang, Jianguo ; Qi, Meibin ; Chen, Cuiqun ; Liu, Yimin</creator><creatorcontrib>Wu, Jingjing ; Jiang, Jianguo ; Qi, Meibin ; Chen, Cuiqun ; Liu, Yimin</creatorcontrib><description>Existing approaches usually form the tracking task as an appearance matching procedure. However, the discrimination ability of appearance features is insufficient in these trackers, which is caused by their weak feature supervision constraints and inadequate exploitation of spatial contexts. To tackle this issue, this article proposes a novel appearance matching tracking (AMT) method to strengthen the feature restraints and capture discriminative spatial representations. Specifically, we first utilize a triplet structural loss function, which improves the learning capability of features by applying a structural similarity constraint with a triplet metric format on the features. It leverages feature statistics to capture the complex interactions of visual parts. Second, we put forward an adaptive matching module that exploits the dual spatial enhancement module to reinforce target feature discrimination. This not only boosts the representation ability of spatial context but also realizes spatially dynamic feature selection by attending to target deformation information. Moreover, this model introduces a simple but effective matching unit to intuitively evaluate the relative appearance differences between the target and the proposals. In addition, with the obtained discriminative features, AMT is capable of providing precise localization for the target. Therefore, the impact of spatial suppression imposed by window functions can be alleviated, allowing for effective tracking of high-speed moving objects. Extensive experiments prove that AMT outperforms state-of-the-art methods on six public datasets and demonstrate the effectiveness of each component in AMT.</description><identifier>ISSN: 1551-6857</identifier><identifier>EISSN: 1551-6865</identifier><identifier>DOI: 10.1145/3497746</identifier><language>eng</language><ispartof>ACM transactions on multimedia computing communications and applications, 2022-11, Vol.18 (4), p.1-23</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c155t-5ae5b82216bc173d837ba51e88a6c4d19d47ccb8db19bf4342f46fc4432bec473</citedby><cites>FETCH-LOGICAL-c155t-5ae5b82216bc173d837ba51e88a6c4d19d47ccb8db19bf4342f46fc4432bec473</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27903,27904</link.rule.ids></links><search><creatorcontrib>Wu, Jingjing</creatorcontrib><creatorcontrib>Jiang, Jianguo</creatorcontrib><creatorcontrib>Qi, Meibin</creatorcontrib><creatorcontrib>Chen, Cuiqun</creatorcontrib><creatorcontrib>Liu, Yimin</creatorcontrib><title>Improving Feature Discrimination for Object Tracking by Structural-similarity-based Metric Learning</title><title>ACM transactions on multimedia computing communications and applications</title><description>Existing approaches usually form the tracking task as an appearance matching procedure. However, the discrimination ability of appearance features is insufficient in these trackers, which is caused by their weak feature supervision constraints and inadequate exploitation of spatial contexts. To tackle this issue, this article proposes a novel appearance matching tracking (AMT) method to strengthen the feature restraints and capture discriminative spatial representations. Specifically, we first utilize a triplet structural loss function, which improves the learning capability of features by applying a structural similarity constraint with a triplet metric format on the features. It leverages feature statistics to capture the complex interactions of visual parts. Second, we put forward an adaptive matching module that exploits the dual spatial enhancement module to reinforce target feature discrimination. This not only boosts the representation ability of spatial context but also realizes spatially dynamic feature selection by attending to target deformation information. Moreover, this model introduces a simple but effective matching unit to intuitively evaluate the relative appearance differences between the target and the proposals. In addition, with the obtained discriminative features, AMT is capable of providing precise localization for the target. Therefore, the impact of spatial suppression imposed by window functions can be alleviated, allowing for effective tracking of high-speed moving objects. Extensive experiments prove that AMT outperforms state-of-the-art methods on six public datasets and demonstrate the effectiveness of each component in AMT.</description><issn>1551-6857</issn><issn>1551-6865</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNo9kD1PwzAYhC0EEqUg_oI3JkMcf2ZEhZZKQR0oc2S_sZFLmlS2i5R_Tyoqprvh0enuELqnxSOlXDwxXinF5QWaUSEokVqKy38v1DW6SWlXFEwKLmcI1vtDHH5C_4WXzuRjdPglJIhhH3qTw9BjP0S8sTsHGW-jge8Takf8keMRJt50JE1wZ2LII7EmuRa_uxwD4NqZ2E_4Lbrypkvu7qxz9Ll83S7eSL1ZrRfPNYGpXSbCOGF1WVJpgSrWaqasEdRpbSTwllYtVwBWt5ZW1nPGS8-lB85ZaR1wxebo4S8X4pBSdL45TDtMHBtaNKdvmvM37Beqn1gW</recordid><startdate>20221130</startdate><enddate>20221130</enddate><creator>Wu, Jingjing</creator><creator>Jiang, Jianguo</creator><creator>Qi, Meibin</creator><creator>Chen, Cuiqun</creator><creator>Liu, Yimin</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20221130</creationdate><title>Improving Feature Discrimination for Object Tracking by Structural-similarity-based Metric Learning</title><author>Wu, Jingjing ; Jiang, Jianguo ; Qi, Meibin ; Chen, Cuiqun ; Liu, Yimin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c155t-5ae5b82216bc173d837ba51e88a6c4d19d47ccb8db19bf4342f46fc4432bec473</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wu, Jingjing</creatorcontrib><creatorcontrib>Jiang, Jianguo</creatorcontrib><creatorcontrib>Qi, Meibin</creatorcontrib><creatorcontrib>Chen, Cuiqun</creatorcontrib><creatorcontrib>Liu, Yimin</creatorcontrib><collection>CrossRef</collection><jtitle>ACM transactions on multimedia computing communications and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wu, Jingjing</au><au>Jiang, Jianguo</au><au>Qi, Meibin</au><au>Chen, Cuiqun</au><au>Liu, Yimin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Improving Feature Discrimination for Object Tracking by Structural-similarity-based Metric Learning</atitle><jtitle>ACM transactions on multimedia computing communications and applications</jtitle><date>2022-11-30</date><risdate>2022</risdate><volume>18</volume><issue>4</issue><spage>1</spage><epage>23</epage><pages>1-23</pages><issn>1551-6857</issn><eissn>1551-6865</eissn><abstract>Existing approaches usually form the tracking task as an appearance matching procedure. However, the discrimination ability of appearance features is insufficient in these trackers, which is caused by their weak feature supervision constraints and inadequate exploitation of spatial contexts. To tackle this issue, this article proposes a novel appearance matching tracking (AMT) method to strengthen the feature restraints and capture discriminative spatial representations. Specifically, we first utilize a triplet structural loss function, which improves the learning capability of features by applying a structural similarity constraint with a triplet metric format on the features. It leverages feature statistics to capture the complex interactions of visual parts. Second, we put forward an adaptive matching module that exploits the dual spatial enhancement module to reinforce target feature discrimination. This not only boosts the representation ability of spatial context but also realizes spatially dynamic feature selection by attending to target deformation information. Moreover, this model introduces a simple but effective matching unit to intuitively evaluate the relative appearance differences between the target and the proposals. In addition, with the obtained discriminative features, AMT is capable of providing precise localization for the target. Therefore, the impact of spatial suppression imposed by window functions can be alleviated, allowing for effective tracking of high-speed moving objects. Extensive experiments prove that AMT outperforms state-of-the-art methods on six public datasets and demonstrate the effectiveness of each component in AMT.</abstract><doi>10.1145/3497746</doi><tpages>23</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1551-6857
ispartof ACM transactions on multimedia computing communications and applications, 2022-11, Vol.18 (4), p.1-23
issn 1551-6857
1551-6865
language eng
recordid cdi_crossref_primary_10_1145_3497746
source Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list)
title Improving Feature Discrimination for Object Tracking by Structural-similarity-based Metric Learning
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T03%3A30%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Improving%20Feature%20Discrimination%20for%20Object%20Tracking%20by%20Structural-similarity-based%20Metric%20Learning&rft.jtitle=ACM%20transactions%20on%20multimedia%20computing%20communications%20and%20applications&rft.au=Wu,%20Jingjing&rft.date=2022-11-30&rft.volume=18&rft.issue=4&rft.spage=1&rft.epage=23&rft.pages=1-23&rft.issn=1551-6857&rft.eissn=1551-6865&rft_id=info:doi/10.1145/3497746&rft_dat=%3Ccrossref%3E10_1145_3497746%3C/crossref%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c155t-5ae5b82216bc173d837ba51e88a6c4d19d47ccb8db19bf4342f46fc4432bec473%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true