Loading…

Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion

Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insuffic...

Full description

Saved in:

Bibliographic Details
Published in:	IET image processing 2023-06, Vol.17 (8), p.2410-2421
Main Authors:	Zhao, Min, Yue, Qiang, Sun, Dihua, Zhong, Yuan
Format:	Article
Language:	English
Subjects:	computer vision feature extraction image processing object tracking
Citations:	Items that this one cites
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites	cdi_FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3
container_end_page	2421
container_issue	8
container_start_page	2410
container_title	IET image processing
container_volume	17
creator	Zhao, Min Yue, Qiang Sun, Dihua Zhong, Yuan
description	Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insufficient fusion of deep and shallow features leading to loss of shallow information and insufficient use of temporal information leading to inconsistency between target and template. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. The effectiveness of this method was verified on the OTB100 and GOT10K datasets. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features.
doi_str_mv	10.1049/ipr2.12803
format	article
fullrecord	<record><control><sourceid>wiley_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_27e076d09c3441d880411eef4ce05f19</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_27e076d09c3441d880411eef4ce05f19</doaj_id><sourcerecordid>IPR212803</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3</originalsourceid><addsrcrecordid>eNp9kM1KAzEUhQdRUKsbnyBroTWZZGaSpYg_hYKidSnhTnJTU6fNkEwt3fkIPqNPYn-kS1f3cvjOtzhZdsHogFGhrnwb8wHLJeUH2QmrCtZXZVkd7v9CHWenKU0pLRSVxUn2Npy1MXyiJS9LPx9HMB8k-fmkQdJBnGBHuk22Tgg0kxB99z4jNaR1IcxJaqHz4efru8NZGyI0xCF0i4jELZIP87PsyEGT8Pzv9rLXu9vxzUN_9Hg_vLke9Q2vOO8DRQfKlEjLnEmlBLeAzFIqeSUhL21pa-FQ1oXjEnlVC7DGSC6wBloh8F423HltgKluo59BXOkAXm-DECcaYudNgzqvkFalpcpwIZiVkgrGEJ0wSAvH1Np1uXOZGFKK6PY-RvVmZL0ZWW9HXsNsBy99g6t_SD18es53nV9BQoGi</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion</title><source>IET Digital Library Journals</source><source>Wiley Online Library Open Access</source><creator>Zhao, Min ; Yue, Qiang ; Sun, Dihua ; Zhong, Yuan</creator><creatorcontrib>Zhao, Min ; Yue, Qiang ; Sun, Dihua ; Zhong, Yuan</creatorcontrib><description>Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insufficient fusion of deep and shallow features leading to loss of shallow information and insufficient use of temporal information leading to inconsistency between target and template. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. The effectiveness of this method was verified on the OTB100 and GOT10K datasets. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features.</description><identifier>ISSN: 1751-9659</identifier><identifier>EISSN: 1751-9667</identifier><identifier>DOI: 10.1049/ipr2.12803</identifier><language>eng</language><publisher>Wiley</publisher><subject>computer vision ; feature extraction ; image processing ; object tracking</subject><ispartof>IET image processing, 2023-06, Vol.17 (8), p.2410-2421</ispartof><rights>2023 The Authors. published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3</cites><orcidid>0000-0003-1648-679X ; 0000-0001-6559-1495 ; 0000-0001-8466-8574 ; 0000-0003-3548-823X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1049%2Fipr2.12803$$EPDF$$P50$$Gwiley$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1049%2Fipr2.12803$$EHTML$$P50$$Gwiley$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,11562,27924,27925,46052,46476</link.rule.ids></links><search><creatorcontrib>Zhao, Min</creatorcontrib><creatorcontrib>Yue, Qiang</creatorcontrib><creatorcontrib>Sun, Dihua</creatorcontrib><creatorcontrib>Zhong, Yuan</creatorcontrib><title>Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion</title><title>IET image processing</title><description>Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insufficient fusion of deep and shallow features leading to loss of shallow information and insufficient use of temporal information leading to inconsistency between target and template. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. The effectiveness of this method was verified on the OTB100 and GOT10K datasets. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features.</description><subject>computer vision</subject><subject>feature extraction</subject><subject>image processing</subject><subject>object tracking</subject><issn>1751-9659</issn><issn>1751-9667</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>24P</sourceid><sourceid>DOA</sourceid><recordid>eNp9kM1KAzEUhQdRUKsbnyBroTWZZGaSpYg_hYKidSnhTnJTU6fNkEwt3fkIPqNPYn-kS1f3cvjOtzhZdsHogFGhrnwb8wHLJeUH2QmrCtZXZVkd7v9CHWenKU0pLRSVxUn2Npy1MXyiJS9LPx9HMB8k-fmkQdJBnGBHuk22Tgg0kxB99z4jNaR1IcxJaqHz4efru8NZGyI0xCF0i4jELZIP87PsyEGT8Pzv9rLXu9vxzUN_9Hg_vLke9Q2vOO8DRQfKlEjLnEmlBLeAzFIqeSUhL21pa-FQ1oXjEnlVC7DGSC6wBloh8F423HltgKluo59BXOkAXm-DECcaYudNgzqvkFalpcpwIZiVkgrGEJ0wSAvH1Np1uXOZGFKK6PY-RvVmZL0ZWW9HXsNsBy99g6t_SD18es53nV9BQoGi</recordid><startdate>20230601</startdate><enddate>20230601</enddate><creator>Zhao, Min</creator><creator>Yue, Qiang</creator><creator>Sun, Dihua</creator><creator>Zhong, Yuan</creator><general>Wiley</general><scope>24P</scope><scope>WIN</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-1648-679X</orcidid><orcidid>https://orcid.org/0000-0001-6559-1495</orcidid><orcidid>https://orcid.org/0000-0001-8466-8574</orcidid><orcidid>https://orcid.org/0000-0003-3548-823X</orcidid></search><sort><creationdate>20230601</creationdate><title>Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion</title><author>Zhao, Min ; Yue, Qiang ; Sun, Dihua ; Zhong, Yuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>computer vision</topic><topic>feature extraction</topic><topic>image processing</topic><topic>object tracking</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhao, Min</creatorcontrib><creatorcontrib>Yue, Qiang</creatorcontrib><creatorcontrib>Sun, Dihua</creatorcontrib><creatorcontrib>Zhong, Yuan</creatorcontrib><collection>Wiley Online Library Open Access</collection><collection>Wiley Open Access</collection><collection>CrossRef</collection><collection>Directory of Open Access Journals</collection><jtitle>IET image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhao, Min</au><au>Yue, Qiang</au><au>Sun, Dihua</au><au>Zhong, Yuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion</atitle><jtitle>IET image processing</jtitle><date>2023-06-01</date><risdate>2023</risdate><volume>17</volume><issue>8</issue><spage>2410</spage><epage>2421</epage><pages>2410-2421</pages><issn>1751-9659</issn><eissn>1751-9667</eissn><abstract>Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insufficient fusion of deep and shallow features leading to loss of shallow information and insufficient use of temporal information leading to inconsistency between target and template. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. The effectiveness of this method was verified on the OTB100 and GOT10K datasets. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features.</abstract><pub>Wiley</pub><doi>10.1049/ipr2.12803</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0003-1648-679X</orcidid><orcidid>https://orcid.org/0000-0001-6559-1495</orcidid><orcidid>https://orcid.org/0000-0001-8466-8574</orcidid><orcidid>https://orcid.org/0000-0003-3548-823X</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1751-9659
ispartof	IET image processing, 2023-06, Vol.17 (8), p.2410-2421
issn	1751-9659 1751-9667
language	eng
recordid	cdi_doaj_primary_oai_doaj_org_article_27e076d09c3441d880411eef4ce05f19
source	IET Digital Library Journals; Wiley Online Library Open Access
subjects	computer vision feature extraction image processing object tracking
title	Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T09%3A42%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-wiley_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Improved%20SwinTrack%20single%20target%20tracking%20algorithm%20based%20on%20spatio%E2%80%90temporal%20feature%20fusion&rft.jtitle=IET%20image%20processing&rft.au=Zhao,%20Min&rft.date=2023-06-01&rft.volume=17&rft.issue=8&rft.spage=2410&rft.epage=2421&rft.pages=2410-2421&rft.issn=1751-9659&rft.eissn=1751-9667&rft_id=info:doi/10.1049/ipr2.12803&rft_dat=%3Cwiley_doaj_%3EIPR212803%3C/wiley_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true