Loading…
Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion
Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insuffic...
Saved in:
Published in: | IET image processing 2023-06, Vol.17 (8), p.2410-2421 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3 |
container_end_page | 2421 |
container_issue | 8 |
container_start_page | 2410 |
container_title | IET image processing |
container_volume | 17 |
creator | Zhao, Min Yue, Qiang Sun, Dihua Zhong, Yuan |
description | Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insufficient fusion of deep and shallow features leading to loss of shallow information and insufficient use of temporal information leading to inconsistency between target and template. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. The effectiveness of this method was verified on the OTB100 and GOT10K datasets.
Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. |
doi_str_mv | 10.1049/ipr2.12803 |
format | article |
fullrecord | <record><control><sourceid>wiley_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_27e076d09c3441d880411eef4ce05f19</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_27e076d09c3441d880411eef4ce05f19</doaj_id><sourcerecordid>IPR212803</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3</originalsourceid><addsrcrecordid>eNp9kM1KAzEUhQdRUKsbnyBroTWZZGaSpYg_hYKidSnhTnJTU6fNkEwt3fkIPqNPYn-kS1f3cvjOtzhZdsHogFGhrnwb8wHLJeUH2QmrCtZXZVkd7v9CHWenKU0pLRSVxUn2Npy1MXyiJS9LPx9HMB8k-fmkQdJBnGBHuk22Tgg0kxB99z4jNaR1IcxJaqHz4efru8NZGyI0xCF0i4jELZIP87PsyEGT8Pzv9rLXu9vxzUN_9Hg_vLke9Q2vOO8DRQfKlEjLnEmlBLeAzFIqeSUhL21pa-FQ1oXjEnlVC7DGSC6wBloh8F423HltgKluo59BXOkAXm-DECcaYudNgzqvkFalpcpwIZiVkgrGEJ0wSAvH1Np1uXOZGFKK6PY-RvVmZL0ZWW9HXsNsBy99g6t_SD18es53nV9BQoGi</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion</title><source>IET Digital Library Journals</source><source>Wiley Online Library Open Access</source><creator>Zhao, Min ; Yue, Qiang ; Sun, Dihua ; Zhong, Yuan</creator><creatorcontrib>Zhao, Min ; Yue, Qiang ; Sun, Dihua ; Zhong, Yuan</creatorcontrib><description>Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insufficient fusion of deep and shallow features leading to loss of shallow information and insufficient use of temporal information leading to inconsistency between target and template. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. The effectiveness of this method was verified on the OTB100 and GOT10K datasets.
Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features.</description><identifier>ISSN: 1751-9659</identifier><identifier>EISSN: 1751-9667</identifier><identifier>DOI: 10.1049/ipr2.12803</identifier><language>eng</language><publisher>Wiley</publisher><subject>computer vision ; feature extraction ; image processing ; object tracking</subject><ispartof>IET image processing, 2023-06, Vol.17 (8), p.2410-2421</ispartof><rights>2023 The Authors. published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3</cites><orcidid>0000-0003-1648-679X ; 0000-0001-6559-1495 ; 0000-0001-8466-8574 ; 0000-0003-3548-823X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1049%2Fipr2.12803$$EPDF$$P50$$Gwiley$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1049%2Fipr2.12803$$EHTML$$P50$$Gwiley$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,11562,27924,27925,46052,46476</link.rule.ids></links><search><creatorcontrib>Zhao, Min</creatorcontrib><creatorcontrib>Yue, Qiang</creatorcontrib><creatorcontrib>Sun, Dihua</creatorcontrib><creatorcontrib>Zhong, Yuan</creatorcontrib><title>Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion</title><title>IET image processing</title><description>Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insufficient fusion of deep and shallow features leading to loss of shallow information and insufficient use of temporal information leading to inconsistency between target and template. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. The effectiveness of this method was verified on the OTB100 and GOT10K datasets.
Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features.</description><subject>computer vision</subject><subject>feature extraction</subject><subject>image processing</subject><subject>object tracking</subject><issn>1751-9659</issn><issn>1751-9667</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>24P</sourceid><sourceid>DOA</sourceid><recordid>eNp9kM1KAzEUhQdRUKsbnyBroTWZZGaSpYg_hYKidSnhTnJTU6fNkEwt3fkIPqNPYn-kS1f3cvjOtzhZdsHogFGhrnwb8wHLJeUH2QmrCtZXZVkd7v9CHWenKU0pLRSVxUn2Npy1MXyiJS9LPx9HMB8k-fmkQdJBnGBHuk22Tgg0kxB99z4jNaR1IcxJaqHz4efru8NZGyI0xCF0i4jELZIP87PsyEGT8Pzv9rLXu9vxzUN_9Hg_vLke9Q2vOO8DRQfKlEjLnEmlBLeAzFIqeSUhL21pa-FQ1oXjEnlVC7DGSC6wBloh8F423HltgKluo59BXOkAXm-DECcaYudNgzqvkFalpcpwIZiVkgrGEJ0wSAvH1Np1uXOZGFKK6PY-RvVmZL0ZWW9HXsNsBy99g6t_SD18es53nV9BQoGi</recordid><startdate>20230601</startdate><enddate>20230601</enddate><creator>Zhao, Min</creator><creator>Yue, Qiang</creator><creator>Sun, Dihua</creator><creator>Zhong, Yuan</creator><general>Wiley</general><scope>24P</scope><scope>WIN</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-1648-679X</orcidid><orcidid>https://orcid.org/0000-0001-6559-1495</orcidid><orcidid>https://orcid.org/0000-0001-8466-8574</orcidid><orcidid>https://orcid.org/0000-0003-3548-823X</orcidid></search><sort><creationdate>20230601</creationdate><title>Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion</title><author>Zhao, Min ; Yue, Qiang ; Sun, Dihua ; Zhong, Yuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>computer vision</topic><topic>feature extraction</topic><topic>image processing</topic><topic>object tracking</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhao, Min</creatorcontrib><creatorcontrib>Yue, Qiang</creatorcontrib><creatorcontrib>Sun, Dihua</creatorcontrib><creatorcontrib>Zhong, Yuan</creatorcontrib><collection>Wiley Online Library Open Access</collection><collection>Wiley Open Access</collection><collection>CrossRef</collection><collection>Directory of Open Access Journals</collection><jtitle>IET image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhao, Min</au><au>Yue, Qiang</au><au>Sun, Dihua</au><au>Zhong, Yuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion</atitle><jtitle>IET image processing</jtitle><date>2023-06-01</date><risdate>2023</risdate><volume>17</volume><issue>8</issue><spage>2410</spage><epage>2421</epage><pages>2410-2421</pages><issn>1751-9659</issn><eissn>1751-9667</eissn><abstract>Single target tracking based on computer vision helps to collect, analyse and exploit target information. The SwinTrack algorithm has received widespread attention as one of the twin network algorithms with the best trade‐off between tracking accuracy and speed, but it also suffers from the insufficient fusion of deep and shallow features leading to loss of shallow information and insufficient use of temporal information leading to inconsistency between target and template. Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features. The effectiveness of this method was verified on the OTB100 and GOT10K datasets.
Semantic information and detailed information are combined and multiple convolutional forms are introduced to propose a multi‐level feature fusion strategy to effectively fuse features in space. Besides, based on the idea of feedback, a dynamic template branching approach is also designed to fuse temporal features and enhance the representation of target features.</abstract><pub>Wiley</pub><doi>10.1049/ipr2.12803</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0003-1648-679X</orcidid><orcidid>https://orcid.org/0000-0001-6559-1495</orcidid><orcidid>https://orcid.org/0000-0001-8466-8574</orcidid><orcidid>https://orcid.org/0000-0003-3548-823X</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1751-9659 |
ispartof | IET image processing, 2023-06, Vol.17 (8), p.2410-2421 |
issn | 1751-9659 1751-9667 |
language | eng |
recordid | cdi_doaj_primary_oai_doaj_org_article_27e076d09c3441d880411eef4ce05f19 |
source | IET Digital Library Journals; Wiley Online Library Open Access |
subjects | computer vision feature extraction image processing object tracking |
title | Improved SwinTrack single target tracking algorithm based on spatio‐temporal feature fusion |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T09%3A42%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-wiley_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Improved%20SwinTrack%20single%20target%20tracking%20algorithm%20based%20on%20spatio%E2%80%90temporal%20feature%20fusion&rft.jtitle=IET%20image%20processing&rft.au=Zhao,%20Min&rft.date=2023-06-01&rft.volume=17&rft.issue=8&rft.spage=2410&rft.epage=2421&rft.pages=2410-2421&rft.issn=1751-9659&rft.eissn=1751-9667&rft_id=info:doi/10.1049/ipr2.12803&rft_dat=%3Cwiley_doaj_%3EIPR212803%3C/wiley_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3733-a0efa9c6e062189943dae1d008378a26d6db4fe8b5f38e37b4adcc834eba07ea3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |