Loading…

Generalized zero-shot action recognition through reservation-based gate and semantic-enhanced contrastive learning

Generalized zero-shot action recognition (GZSAR) aims to classify actions from both the classes seen in the training phase and unseen classes for which no samples are available. Since all training samples are derived from seen classes, conducting classification directly in a combined space that enco...

Full description

Saved in:
Bibliographic Details
Published in:Knowledge-based systems 2024-10, Vol.301, p.112283, Article 112283
Main Authors: Shang, Junyuan, Niu, Chang, Tao, Xiyuan, Zhou, Zhiheng, Yang, Junmei
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c185t-a268ee9828c7f716d6b6e2b5a7b6336a62363dce62df7045603c431a451d694d3
container_end_page
container_issue
container_start_page 112283
container_title Knowledge-based systems
container_volume 301
creator Shang, Junyuan
Niu, Chang
Tao, Xiyuan
Zhou, Zhiheng
Yang, Junmei
description Generalized zero-shot action recognition (GZSAR) aims to classify actions from both the classes seen in the training phase and unseen classes for which no samples are available. Since all training samples are derived from seen classes, conducting classification directly in a combined space that encompasses both seen and unseen classes would introduce a competition between the predicted scores of seen and unseen classes, potentially resulting in misclassification of unseen test samples as seen ones. Besides, existing generative methods rely solely on the provided class-level semantic features and overlook the exploration of interrelations among the semantic features, thereby limiting the quality of the generated features. In this paper, we tackle GZSAR through a novel method known as reservation-based gate and semantic-enhanced contrastive learning (RGSCL). We introduce a reserved classifier and optimize it with constructed fictive samples to learn the reservation-based gate which avoids the competition and alleviates the impact of biased classification scores towards seen classes. Further, we propose to conduct contrastive learning based on the hypersphere-based enhanced semantic features, aiming to ensure the generated features maintain a consistent relationship with corresponding semantic features, thereby improving the comprehension of the generator to the semantic interrelations. RGSCL exhibits strong compatibility with existing generative GZSAR methods. Extensive experimental results on three datasets of both conventional zero-shot and generalized zero-shot settings showcase the effectiveness of the proposed RGSCL.
doi_str_mv 10.1016/j.knosys.2024.112283
format article
fullrecord <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_knosys_2024_112283</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0950705124009171</els_id><sourcerecordid>S0950705124009171</sourcerecordid><originalsourceid>FETCH-LOGICAL-c185t-a268ee9828c7f716d6b6e2b5a7b6336a62363dce62df7045603c431a451d694d3</originalsourceid><addsrcrecordid>eNp9kM1OwzAQhH0AiVJ4Aw55gQT_JE56QUIVFKRKXOBsbexN4tLayDaV2qcnIZw57WpWM9r5CLljtGCUyftd8el8PMWCU14WjHHeiAuyoKuK5jWt2BW5jnFHKeWcNQsSNugwwN6e0WRnDD6Pg08Z6GS9ywJq3zv7u6ch-O9-GLWI4QiTlrcQR1sPCTNwJot4AJesztEN4PR40t6lADHZI2Z7hOCs62_IZQf7iLd_c0k-np_e1y_59m3zun7c5po1VcqBywZx1fBG113NpJGtRN5WULdSCAmSCymMRslNV9OyklToUjAoK2bkqjRiSco5VwcfY8BOfQV7gHBSjKqJldqpmZWaWKmZ1Wh7mG04_na0GFTUFqc2dsSRlPH2_4AfU2d5vg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Generalized zero-shot action recognition through reservation-based gate and semantic-enhanced contrastive learning</title><source>ScienceDirect Freedom Collection 2022-2024</source><creator>Shang, Junyuan ; Niu, Chang ; Tao, Xiyuan ; Zhou, Zhiheng ; Yang, Junmei</creator><creatorcontrib>Shang, Junyuan ; Niu, Chang ; Tao, Xiyuan ; Zhou, Zhiheng ; Yang, Junmei</creatorcontrib><description>Generalized zero-shot action recognition (GZSAR) aims to classify actions from both the classes seen in the training phase and unseen classes for which no samples are available. Since all training samples are derived from seen classes, conducting classification directly in a combined space that encompasses both seen and unseen classes would introduce a competition between the predicted scores of seen and unseen classes, potentially resulting in misclassification of unseen test samples as seen ones. Besides, existing generative methods rely solely on the provided class-level semantic features and overlook the exploration of interrelations among the semantic features, thereby limiting the quality of the generated features. In this paper, we tackle GZSAR through a novel method known as reservation-based gate and semantic-enhanced contrastive learning (RGSCL). We introduce a reserved classifier and optimize it with constructed fictive samples to learn the reservation-based gate which avoids the competition and alleviates the impact of biased classification scores towards seen classes. Further, we propose to conduct contrastive learning based on the hypersphere-based enhanced semantic features, aiming to ensure the generated features maintain a consistent relationship with corresponding semantic features, thereby improving the comprehension of the generator to the semantic interrelations. RGSCL exhibits strong compatibility with existing generative GZSAR methods. Extensive experimental results on three datasets of both conventional zero-shot and generalized zero-shot settings showcase the effectiveness of the proposed RGSCL.</description><identifier>ISSN: 0950-7051</identifier><identifier>DOI: 10.1016/j.knosys.2024.112283</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Action recognition ; Zero-shot action recognition ; Zero-shot learning</subject><ispartof>Knowledge-based systems, 2024-10, Vol.301, p.112283, Article 112283</ispartof><rights>2024 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c185t-a268ee9828c7f716d6b6e2b5a7b6336a62363dce62df7045603c431a451d694d3</cites><orcidid>0000-0002-9677-0768 ; 0000-0001-5075-0545 ; 0000-0002-7426-1479 ; 0000-0003-4040-0175 ; 0000-0003-4301-750X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Shang, Junyuan</creatorcontrib><creatorcontrib>Niu, Chang</creatorcontrib><creatorcontrib>Tao, Xiyuan</creatorcontrib><creatorcontrib>Zhou, Zhiheng</creatorcontrib><creatorcontrib>Yang, Junmei</creatorcontrib><title>Generalized zero-shot action recognition through reservation-based gate and semantic-enhanced contrastive learning</title><title>Knowledge-based systems</title><description>Generalized zero-shot action recognition (GZSAR) aims to classify actions from both the classes seen in the training phase and unseen classes for which no samples are available. Since all training samples are derived from seen classes, conducting classification directly in a combined space that encompasses both seen and unseen classes would introduce a competition between the predicted scores of seen and unseen classes, potentially resulting in misclassification of unseen test samples as seen ones. Besides, existing generative methods rely solely on the provided class-level semantic features and overlook the exploration of interrelations among the semantic features, thereby limiting the quality of the generated features. In this paper, we tackle GZSAR through a novel method known as reservation-based gate and semantic-enhanced contrastive learning (RGSCL). We introduce a reserved classifier and optimize it with constructed fictive samples to learn the reservation-based gate which avoids the competition and alleviates the impact of biased classification scores towards seen classes. Further, we propose to conduct contrastive learning based on the hypersphere-based enhanced semantic features, aiming to ensure the generated features maintain a consistent relationship with corresponding semantic features, thereby improving the comprehension of the generator to the semantic interrelations. RGSCL exhibits strong compatibility with existing generative GZSAR methods. Extensive experimental results on three datasets of both conventional zero-shot and generalized zero-shot settings showcase the effectiveness of the proposed RGSCL.</description><subject>Action recognition</subject><subject>Zero-shot action recognition</subject><subject>Zero-shot learning</subject><issn>0950-7051</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kM1OwzAQhH0AiVJ4Aw55gQT_JE56QUIVFKRKXOBsbexN4tLayDaV2qcnIZw57WpWM9r5CLljtGCUyftd8el8PMWCU14WjHHeiAuyoKuK5jWt2BW5jnFHKeWcNQsSNugwwN6e0WRnDD6Pg08Z6GS9ywJq3zv7u6ch-O9-GLWI4QiTlrcQR1sPCTNwJot4AJesztEN4PR40t6lADHZI2Z7hOCs62_IZQf7iLd_c0k-np_e1y_59m3zun7c5po1VcqBywZx1fBG113NpJGtRN5WULdSCAmSCymMRslNV9OyklToUjAoK2bkqjRiSco5VwcfY8BOfQV7gHBSjKqJldqpmZWaWKmZ1Wh7mG04_na0GFTUFqc2dsSRlPH2_4AfU2d5vg</recordid><startdate>20241009</startdate><enddate>20241009</enddate><creator>Shang, Junyuan</creator><creator>Niu, Chang</creator><creator>Tao, Xiyuan</creator><creator>Zhou, Zhiheng</creator><creator>Yang, Junmei</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-9677-0768</orcidid><orcidid>https://orcid.org/0000-0001-5075-0545</orcidid><orcidid>https://orcid.org/0000-0002-7426-1479</orcidid><orcidid>https://orcid.org/0000-0003-4040-0175</orcidid><orcidid>https://orcid.org/0000-0003-4301-750X</orcidid></search><sort><creationdate>20241009</creationdate><title>Generalized zero-shot action recognition through reservation-based gate and semantic-enhanced contrastive learning</title><author>Shang, Junyuan ; Niu, Chang ; Tao, Xiyuan ; Zhou, Zhiheng ; Yang, Junmei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c185t-a268ee9828c7f716d6b6e2b5a7b6336a62363dce62df7045603c431a451d694d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Action recognition</topic><topic>Zero-shot action recognition</topic><topic>Zero-shot learning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shang, Junyuan</creatorcontrib><creatorcontrib>Niu, Chang</creatorcontrib><creatorcontrib>Tao, Xiyuan</creatorcontrib><creatorcontrib>Zhou, Zhiheng</creatorcontrib><creatorcontrib>Yang, Junmei</creatorcontrib><collection>CrossRef</collection><jtitle>Knowledge-based systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shang, Junyuan</au><au>Niu, Chang</au><au>Tao, Xiyuan</au><au>Zhou, Zhiheng</au><au>Yang, Junmei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Generalized zero-shot action recognition through reservation-based gate and semantic-enhanced contrastive learning</atitle><jtitle>Knowledge-based systems</jtitle><date>2024-10-09</date><risdate>2024</risdate><volume>301</volume><spage>112283</spage><pages>112283-</pages><artnum>112283</artnum><issn>0950-7051</issn><abstract>Generalized zero-shot action recognition (GZSAR) aims to classify actions from both the classes seen in the training phase and unseen classes for which no samples are available. Since all training samples are derived from seen classes, conducting classification directly in a combined space that encompasses both seen and unseen classes would introduce a competition between the predicted scores of seen and unseen classes, potentially resulting in misclassification of unseen test samples as seen ones. Besides, existing generative methods rely solely on the provided class-level semantic features and overlook the exploration of interrelations among the semantic features, thereby limiting the quality of the generated features. In this paper, we tackle GZSAR through a novel method known as reservation-based gate and semantic-enhanced contrastive learning (RGSCL). We introduce a reserved classifier and optimize it with constructed fictive samples to learn the reservation-based gate which avoids the competition and alleviates the impact of biased classification scores towards seen classes. Further, we propose to conduct contrastive learning based on the hypersphere-based enhanced semantic features, aiming to ensure the generated features maintain a consistent relationship with corresponding semantic features, thereby improving the comprehension of the generator to the semantic interrelations. RGSCL exhibits strong compatibility with existing generative GZSAR methods. Extensive experimental results on three datasets of both conventional zero-shot and generalized zero-shot settings showcase the effectiveness of the proposed RGSCL.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.knosys.2024.112283</doi><orcidid>https://orcid.org/0000-0002-9677-0768</orcidid><orcidid>https://orcid.org/0000-0001-5075-0545</orcidid><orcidid>https://orcid.org/0000-0002-7426-1479</orcidid><orcidid>https://orcid.org/0000-0003-4040-0175</orcidid><orcidid>https://orcid.org/0000-0003-4301-750X</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0950-7051
ispartof Knowledge-based systems, 2024-10, Vol.301, p.112283, Article 112283
issn 0950-7051
language eng
recordid cdi_crossref_primary_10_1016_j_knosys_2024_112283
source ScienceDirect Freedom Collection 2022-2024
subjects Action recognition
Zero-shot action recognition
Zero-shot learning
title Generalized zero-shot action recognition through reservation-based gate and semantic-enhanced contrastive learning
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T08%3A58%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Generalized%20zero-shot%20action%20recognition%20through%20reservation-based%20gate%20and%20semantic-enhanced%20contrastive%20learning&rft.jtitle=Knowledge-based%20systems&rft.au=Shang,%20Junyuan&rft.date=2024-10-09&rft.volume=301&rft.spage=112283&rft.pages=112283-&rft.artnum=112283&rft.issn=0950-7051&rft_id=info:doi/10.1016/j.knosys.2024.112283&rft_dat=%3Celsevier_cross%3ES0950705124009171%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c185t-a268ee9828c7f716d6b6e2b5a7b6336a62363dce62df7045603c431a451d694d3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true