Loading…

Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion

Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insuffic...

Full description

Saved in:
Bibliographic Details
Published in:IET image processing 2023-06, Vol.17 (8), p.2410-2421
Main Authors: Zhao, Min, Yue, Qiang, Sun, Dihua, Zhong, Yuan
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3
container_end_page 2421
container_issue 8
container_start_page 2410
container_title IET image processing
container_volume 17
creator Zhao, Min
Yue, Qiang
Sun, Dihua
Zhong, Yuan
description Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insufficient fusion of deep and shallow features leading to loss of shallow information and insufficient use of temporal information leading to inconsistency between target and template. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. The effectiveness of this method was verified on the OTB100 and GOT10K datasets. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features.
doi_str_mv 10.1049/ipr2.12803
format article
fullrecord <record><control><sourceid>wiley_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_27e076d09c3441d880411eef4ce05f19</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_27e076d09c3441d880411eef4ce05f19</doaj_id><sourcerecordid>IPR212803</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3</originalsourceid><addsrcrecordid>eNp9kM1KAzEUhQdRUKsbnyBroTWZZGaSpYg_hYKidSnhTnJTU6fNkEwt3fkIPqNPYn-kS1f3cvjOtzhZdsHogFGhrnwb8wHLJeUH2QmrCtZXZVkd7v9CHWenKU0pLRSVxUn2Npy1MXyiJS9LPx9HMB8k-fmkQdJBnGBHuk22Tgg0kxB99z4jNaR1IcxJaqHz4efru8NZGyI0xCF0i4jELZIP87PsyEGT8Pzv9rLXu9vxzUN_9Hg_vLke9Q2vOO8DRQfKlEjLnEmlBLeAzFIqeSUhL21pa-FQ1oXjEnlVC7DGSC6wBloh8F423HltgKluo59BXOkAXm-DECcaYudNgzqvkFalpcpwIZiVkgrGEJ0wSAvH1Np1uXOZGFKK6PY-RvVmZL0ZWW9HXsNsBy99g6t_SD18es53nV9BQoGi</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion</title><source>IET Digital Library Journals</source><source>Wiley Online Library Open Access</source><creator>Zhao, Min ; Yue, Qiang ; Sun, Dihua ; Zhong, Yuan</creator><creatorcontrib>Zhao, Min ; Yue, Qiang ; Sun, Dihua ; Zhong, Yuan</creatorcontrib><description>Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insufficient fusion of deep and shallow features leading to loss of shallow information and insufficient use of temporal information leading to inconsistency between target and template. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. The effectiveness of this method was verified on the OTB100 and GOT10K datasets. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features.</description><identifier>ISSN: 1751-9659</identifier><identifier>EISSN: 1751-9667</identifier><identifier>DOI: 10.1049/ipr2.12803</identifier><language>eng</language><publisher>Wiley</publisher><subject>computer vision ; feature extraction ; image processing ; object tracking</subject><ispartof>IET image processing, 2023-06, Vol.17 (8), p.2410-2421</ispartof><rights>2023 The Authors. published by John Wiley &amp; Sons Ltd on behalf of The Institution of Engineering and Technology.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3</cites><orcidid>0000-0003-1648-679X ; 0000-0001-6559-1495 ; 0000-0001-8466-8574 ; 0000-0003-3548-823X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1049%2Fipr2.12803$$EPDF$$P50$$Gwiley$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1049%2Fipr2.12803$$EHTML$$P50$$Gwiley$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,11562,27924,27925,46052,46476</link.rule.ids></links><search><creatorcontrib>Zhao, Min</creatorcontrib><creatorcontrib>Yue, Qiang</creatorcontrib><creatorcontrib>Sun, Dihua</creatorcontrib><creatorcontrib>Zhong, Yuan</creatorcontrib><title>Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion</title><title>IET image processing</title><description>Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insufficient fusion of deep and shallow features leading to loss of shallow information and insufficient use of temporal information leading to inconsistency between target and template. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. The effectiveness of this method was verified on the OTB100 and GOT10K datasets. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features.</description><subject>computer vision</subject><subject>feature extraction</subject><subject>image processing</subject><subject>object tracking</subject><issn>1751-9659</issn><issn>1751-9667</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>24P</sourceid><sourceid>DOA</sourceid><recordid>eNp9kM1KAzEUhQdRUKsbnyBroTWZZGaSpYg_hYKidSnhTnJTU6fNkEwt3fkIPqNPYn-kS1f3cvjOtzhZdsHogFGhrnwb8wHLJeUH2QmrCtZXZVkd7v9CHWenKU0pLRSVxUn2Npy1MXyiJS9LPx9HMB8k-fmkQdJBnGBHuk22Tgg0kxB99z4jNaR1IcxJaqHz4efru8NZGyI0xCF0i4jELZIP87PsyEGT8Pzv9rLXu9vxzUN_9Hg_vLke9Q2vOO8DRQfKlEjLnEmlBLeAzFIqeSUhL21pa-FQ1oXjEnlVC7DGSC6wBloh8F423HltgKluo59BXOkAXm-DECcaYudNgzqvkFalpcpwIZiVkgrGEJ0wSAvH1Np1uXOZGFKK6PY-RvVmZL0ZWW9HXsNsBy99g6t_SD18es53nV9BQoGi</recordid><startdate>20230601</startdate><enddate>20230601</enddate><creator>Zhao, Min</creator><creator>Yue, Qiang</creator><creator>Sun, Dihua</creator><creator>Zhong, Yuan</creator><general>Wiley</general><scope>24P</scope><scope>WIN</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-1648-679X</orcidid><orcidid>https://orcid.org/0000-0001-6559-1495</orcidid><orcidid>https://orcid.org/0000-0001-8466-8574</orcidid><orcidid>https://orcid.org/0000-0003-3548-823X</orcidid></search><sort><creationdate>20230601</creationdate><title>Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion</title><author>Zhao, Min ; Yue, Qiang ; Sun, Dihua ; Zhong, Yuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>computer vision</topic><topic>feature extraction</topic><topic>image processing</topic><topic>object tracking</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhao, Min</creatorcontrib><creatorcontrib>Yue, Qiang</creatorcontrib><creatorcontrib>Sun, Dihua</creatorcontrib><creatorcontrib>Zhong, Yuan</creatorcontrib><collection>Wiley Online Library Open Access</collection><collection>Wiley Open Access</collection><collection>CrossRef</collection><collection>Directory of Open Access Journals</collection><jtitle>IET image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhao, Min</au><au>Yue, Qiang</au><au>Sun, Dihua</au><au>Zhong, Yuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion</atitle><jtitle>IET image processing</jtitle><date>2023-06-01</date><risdate>2023</risdate><volume>17</volume><issue>8</issue><spage>2410</spage><epage>2421</epage><pages>2410-2421</pages><issn>1751-9659</issn><eissn>1751-9667</eissn><abstract>Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insufficient fusion of deep and shallow features leading to loss of shallow information and insufficient use of temporal information leading to inconsistency between target and template. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. The effectiveness of this method was verified on the OTB100 and GOT10K datasets. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features.</abstract><pub>Wiley</pub><doi>10.1049/ipr2.12803</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0003-1648-679X</orcidid><orcidid>https://orcid.org/0000-0001-6559-1495</orcidid><orcidid>https://orcid.org/0000-0001-8466-8574</orcidid><orcidid>https://orcid.org/0000-0003-3548-823X</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1751-9659
ispartof IET image processing, 2023-06, Vol.17 (8), p.2410-2421
issn 1751-9659
1751-9667
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_27e076d09c3441d880411eef4ce05f19
source IET Digital Library Journals; Wiley Online Library Open Access
subjects computer vision
feature extraction
image processing
object tracking
title Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T09%3A42%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-wiley_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Improved%20SwinTrack%20single%20target%20tracking%20algorithm%20based%20on%20spatio%E2%80%90temporal%20feature%20fusion&rft.jtitle=IET%20image%20processing&rft.au=Zhao,%20Min&rft.date=2023-06-01&rft.volume=17&rft.issue=8&rft.spage=2410&rft.epage=2421&rft.pages=2410-2421&rft.issn=1751-9659&rft.eissn=1751-9667&rft_id=info:doi/10.1049/ipr2.12803&rft_dat=%3Cwiley_doaj_%3EIPR212803%3C/wiley_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true