Loading…
Generalized zero-shot action recognition through reservation-based gate and semantic-enhanced contrastive learning
Generalized zero-shot action recognition (GZSAR) aims to classify actions from both the classes seen in the training phase and unseen classes for which no samples are available. Since all training samples are derived from seen classes, conducting classification directly in a combined space that enco...
Saved in:
Published in: | Knowledge-based systems 2024-10, Vol.301, p.112283, Article 112283 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c185t-a268ee9828c7f716d6b6e2b5a7b6336a62363dce62df7045603c431a451d694d3 |
container_end_page | |
container_issue | |
container_start_page | 112283 |
container_title | Knowledge-based systems |
container_volume | 301 |
creator | Shang, Junyuan Niu, Chang Tao, Xiyuan Zhou, Zhiheng Yang, Junmei |
description | Generalized zero-shot action recognition (GZSAR) aims to classify actions from both the classes seen in the training phase and unseen classes for which no samples are available. Since all training samples are derived from seen classes, conducting classification directly in a combined space that encompasses both seen and unseen classes would introduce a competition between the predicted scores of seen and unseen classes, potentially resulting in misclassification of unseen test samples as seen ones. Besides, existing generative methods rely solely on the provided class-level semantic features and overlook the exploration of interrelations among the semantic features, thereby limiting the quality of the generated features. In this paper, we tackle GZSAR through a novel method known as reservation-based gate and semantic-enhanced contrastive learning (RGSCL). We introduce a reserved classifier and optimize it with constructed fictive samples to learn the reservation-based gate which avoids the competition and alleviates the impact of biased classification scores towards seen classes. Further, we propose to conduct contrastive learning based on the hypersphere-based enhanced semantic features, aiming to ensure the generated features maintain a consistent relationship with corresponding semantic features, thereby improving the comprehension of the generator to the semantic interrelations. RGSCL exhibits strong compatibility with existing generative GZSAR methods. Extensive experimental results on three datasets of both conventional zero-shot and generalized zero-shot settings showcase the effectiveness of the proposed RGSCL. |
doi_str_mv | 10.1016/j.knosys.2024.112283 |
format | article |
fullrecord | <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_knosys_2024_112283</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0950705124009171</els_id><sourcerecordid>S0950705124009171</sourcerecordid><originalsourceid>FETCH-LOGICAL-c185t-a268ee9828c7f716d6b6e2b5a7b6336a62363dce62df7045603c431a451d694d3</originalsourceid><addsrcrecordid>eNp9kM1OwzAQhH0AiVJ4Aw55gQT_JE56QUIVFKRKXOBsbexN4tLayDaV2qcnIZw57WpWM9r5CLljtGCUyftd8el8PMWCU14WjHHeiAuyoKuK5jWt2BW5jnFHKeWcNQsSNugwwN6e0WRnDD6Pg08Z6GS9ywJq3zv7u6ch-O9-GLWI4QiTlrcQR1sPCTNwJot4AJesztEN4PR40t6lADHZI2Z7hOCs62_IZQf7iLd_c0k-np_e1y_59m3zun7c5po1VcqBywZx1fBG113NpJGtRN5WULdSCAmSCymMRslNV9OyklToUjAoK2bkqjRiSco5VwcfY8BOfQV7gHBSjKqJldqpmZWaWKmZ1Wh7mG04_na0GFTUFqc2dsSRlPH2_4AfU2d5vg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Generalized zero-shot action recognition through reservation-based gate and semantic-enhanced contrastive learning</title><source>ScienceDirect Freedom Collection 2022-2024</source><creator>Shang, Junyuan ; Niu, Chang ; Tao, Xiyuan ; Zhou, Zhiheng ; Yang, Junmei</creator><creatorcontrib>Shang, Junyuan ; Niu, Chang ; Tao, Xiyuan ; Zhou, Zhiheng ; Yang, Junmei</creatorcontrib><description>Generalized zero-shot action recognition (GZSAR) aims to classify actions from both the classes seen in the training phase and unseen classes for which no samples are available. Since all training samples are derived from seen classes, conducting classification directly in a combined space that encompasses both seen and unseen classes would introduce a competition between the predicted scores of seen and unseen classes, potentially resulting in misclassification of unseen test samples as seen ones. Besides, existing generative methods rely solely on the provided class-level semantic features and overlook the exploration of interrelations among the semantic features, thereby limiting the quality of the generated features. In this paper, we tackle GZSAR through a novel method known as reservation-based gate and semantic-enhanced contrastive learning (RGSCL). We introduce a reserved classifier and optimize it with constructed fictive samples to learn the reservation-based gate which avoids the competition and alleviates the impact of biased classification scores towards seen classes. Further, we propose to conduct contrastive learning based on the hypersphere-based enhanced semantic features, aiming to ensure the generated features maintain a consistent relationship with corresponding semantic features, thereby improving the comprehension of the generator to the semantic interrelations. RGSCL exhibits strong compatibility with existing generative GZSAR methods. Extensive experimental results on three datasets of both conventional zero-shot and generalized zero-shot settings showcase the effectiveness of the proposed RGSCL.</description><identifier>ISSN: 0950-7051</identifier><identifier>DOI: 10.1016/j.knosys.2024.112283</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Action recognition ; Zero-shot action recognition ; Zero-shot learning</subject><ispartof>Knowledge-based systems, 2024-10, Vol.301, p.112283, Article 112283</ispartof><rights>2024 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c185t-a268ee9828c7f716d6b6e2b5a7b6336a62363dce62df7045603c431a451d694d3</cites><orcidid>0000-0002-9677-0768 ; 0000-0001-5075-0545 ; 0000-0002-7426-1479 ; 0000-0003-4040-0175 ; 0000-0003-4301-750X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Shang, Junyuan</creatorcontrib><creatorcontrib>Niu, Chang</creatorcontrib><creatorcontrib>Tao, Xiyuan</creatorcontrib><creatorcontrib>Zhou, Zhiheng</creatorcontrib><creatorcontrib>Yang, Junmei</creatorcontrib><title>Generalized zero-shot action recognition through reservation-based gate and semantic-enhanced contrastive learning</title><title>Knowledge-based systems</title><description>Generalized zero-shot action recognition (GZSAR) aims to classify actions from both the classes seen in the training phase and unseen classes for which no samples are available. Since all training samples are derived from seen classes, conducting classification directly in a combined space that encompasses both seen and unseen classes would introduce a competition between the predicted scores of seen and unseen classes, potentially resulting in misclassification of unseen test samples as seen ones. Besides, existing generative methods rely solely on the provided class-level semantic features and overlook the exploration of interrelations among the semantic features, thereby limiting the quality of the generated features. In this paper, we tackle GZSAR through a novel method known as reservation-based gate and semantic-enhanced contrastive learning (RGSCL). We introduce a reserved classifier and optimize it with constructed fictive samples to learn the reservation-based gate which avoids the competition and alleviates the impact of biased classification scores towards seen classes. Further, we propose to conduct contrastive learning based on the hypersphere-based enhanced semantic features, aiming to ensure the generated features maintain a consistent relationship with corresponding semantic features, thereby improving the comprehension of the generator to the semantic interrelations. RGSCL exhibits strong compatibility with existing generative GZSAR methods. Extensive experimental results on three datasets of both conventional zero-shot and generalized zero-shot settings showcase the effectiveness of the proposed RGSCL.</description><subject>Action recognition</subject><subject>Zero-shot action recognition</subject><subject>Zero-shot learning</subject><issn>0950-7051</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kM1OwzAQhH0AiVJ4Aw55gQT_JE56QUIVFKRKXOBsbexN4tLayDaV2qcnIZw57WpWM9r5CLljtGCUyftd8el8PMWCU14WjHHeiAuyoKuK5jWt2BW5jnFHKeWcNQsSNugwwN6e0WRnDD6Pg08Z6GS9ywJq3zv7u6ch-O9-GLWI4QiTlrcQR1sPCTNwJot4AJesztEN4PR40t6lADHZI2Z7hOCs62_IZQf7iLd_c0k-np_e1y_59m3zun7c5po1VcqBywZx1fBG113NpJGtRN5WULdSCAmSCymMRslNV9OyklToUjAoK2bkqjRiSco5VwcfY8BOfQV7gHBSjKqJldqpmZWaWKmZ1Wh7mG04_na0GFTUFqc2dsSRlPH2_4AfU2d5vg</recordid><startdate>20241009</startdate><enddate>20241009</enddate><creator>Shang, Junyuan</creator><creator>Niu, Chang</creator><creator>Tao, Xiyuan</creator><creator>Zhou, Zhiheng</creator><creator>Yang, Junmei</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-9677-0768</orcidid><orcidid>https://orcid.org/0000-0001-5075-0545</orcidid><orcidid>https://orcid.org/0000-0002-7426-1479</orcidid><orcidid>https://orcid.org/0000-0003-4040-0175</orcidid><orcidid>https://orcid.org/0000-0003-4301-750X</orcidid></search><sort><creationdate>20241009</creationdate><title>Generalized zero-shot action recognition through reservation-based gate and semantic-enhanced contrastive learning</title><author>Shang, Junyuan ; Niu, Chang ; Tao, Xiyuan ; Zhou, Zhiheng ; Yang, Junmei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c185t-a268ee9828c7f716d6b6e2b5a7b6336a62363dce62df7045603c431a451d694d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Action recognition</topic><topic>Zero-shot action recognition</topic><topic>Zero-shot learning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shang, Junyuan</creatorcontrib><creatorcontrib>Niu, Chang</creatorcontrib><creatorcontrib>Tao, Xiyuan</creatorcontrib><creatorcontrib>Zhou, Zhiheng</creatorcontrib><creatorcontrib>Yang, Junmei</creatorcontrib><collection>CrossRef</collection><jtitle>Knowledge-based systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shang, Junyuan</au><au>Niu, Chang</au><au>Tao, Xiyuan</au><au>Zhou, Zhiheng</au><au>Yang, Junmei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Generalized zero-shot action recognition through reservation-based gate and semantic-enhanced contrastive learning</atitle><jtitle>Knowledge-based systems</jtitle><date>2024-10-09</date><risdate>2024</risdate><volume>301</volume><spage>112283</spage><pages>112283-</pages><artnum>112283</artnum><issn>0950-7051</issn><abstract>Generalized zero-shot action recognition (GZSAR) aims to classify actions from both the classes seen in the training phase and unseen classes for which no samples are available. Since all training samples are derived from seen classes, conducting classification directly in a combined space that encompasses both seen and unseen classes would introduce a competition between the predicted scores of seen and unseen classes, potentially resulting in misclassification of unseen test samples as seen ones. Besides, existing generative methods rely solely on the provided class-level semantic features and overlook the exploration of interrelations among the semantic features, thereby limiting the quality of the generated features. In this paper, we tackle GZSAR through a novel method known as reservation-based gate and semantic-enhanced contrastive learning (RGSCL). We introduce a reserved classifier and optimize it with constructed fictive samples to learn the reservation-based gate which avoids the competition and alleviates the impact of biased classification scores towards seen classes. Further, we propose to conduct contrastive learning based on the hypersphere-based enhanced semantic features, aiming to ensure the generated features maintain a consistent relationship with corresponding semantic features, thereby improving the comprehension of the generator to the semantic interrelations. RGSCL exhibits strong compatibility with existing generative GZSAR methods. Extensive experimental results on three datasets of both conventional zero-shot and generalized zero-shot settings showcase the effectiveness of the proposed RGSCL.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.knosys.2024.112283</doi><orcidid>https://orcid.org/0000-0002-9677-0768</orcidid><orcidid>https://orcid.org/0000-0001-5075-0545</orcidid><orcidid>https://orcid.org/0000-0002-7426-1479</orcidid><orcidid>https://orcid.org/0000-0003-4040-0175</orcidid><orcidid>https://orcid.org/0000-0003-4301-750X</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0950-7051 |
ispartof | Knowledge-based systems, 2024-10, Vol.301, p.112283, Article 112283 |
issn | 0950-7051 |
language | eng |
recordid | cdi_crossref_primary_10_1016_j_knosys_2024_112283 |
source | ScienceDirect Freedom Collection 2022-2024 |
subjects | Action recognition Zero-shot action recognition Zero-shot learning |
title | Generalized zero-shot action recognition through reservation-based gate and semantic-enhanced contrastive learning |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T08%3A58%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Generalized%20zero-shot%20action%20recognition%20through%20reservation-based%20gate%20and%20semantic-enhanced%20contrastive%20learning&rft.jtitle=Knowledge-based%20systems&rft.au=Shang,%20Junyuan&rft.date=2024-10-09&rft.volume=301&rft.spage=112283&rft.pages=112283-&rft.artnum=112283&rft.issn=0950-7051&rft_id=info:doi/10.1016/j.knosys.2024.112283&rft_dat=%3Celsevier_cross%3ES0950705124009171%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c185t-a268ee9828c7f716d6b6e2b5a7b6336a62363dce62df7045603c431a451d694d3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |