Loading…

Mutual Balancing in State-Object Components for Compositional Zero-Shot Learning

Compositional Zero-Shot Learning (CZSL) aims to recognize unseen compositions from seen states and objects. The disparity between the manually labeled semantic information and its actual visual features causes a significant imbalance of visual deviation in the distribution of various object classes...

Full description

Saved in:
Bibliographic Details
Published in:Pattern recognition 2024-08, Vol.152, p.110451, Article 110451
Main Authors: Jiang, Chenyi, Ye, Qiaolin, Wang, Shidong, Shen, Yuming, Zhang, Zheng, Zhang, Haofeng
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c301t-eb43f23c59ba87c67655e54a3cbdf1e63c2ebc553f6e48b0b11b6828f7f566a83
container_end_page
container_issue
container_start_page 110451
container_title Pattern recognition
container_volume 152
creator Jiang, Chenyi
Ye, Qiaolin
Wang, Shidong
Shen, Yuming
Zhang, Zheng
Zhang, Haofeng
description Compositional Zero-Shot Learning (CZSL) aims to recognize unseen compositions from seen states and objects. The disparity between the manually labeled semantic information and its actual visual features causes a significant imbalance of visual deviation in the distribution of various object classes and state classes, which is ignored by existing methods. To ameliorate these issues, we consider the CZSL task as an unbalanced multi-label classification task and propose a novel method called MUtual balancing in STate-object components (MUST) for CZSL, which provides a balancing inductive bias for the model. In particular, we split the classification of the composition classes into two consecutive processes to analyze the entanglement of the two components to get additional knowledge in advance, which reflects the degree of visual deviation between the two components. We use the knowledge gained to modify the model’s training process in order to generate more distinct class borders for classes with significant visual deviations. Extensive experiments demonstrate that our approach significantly outperforms the state-of-the-art on MIT-States, UT-Zappos, and C-GQA when combined with the basic CZSL frameworks, and it can improve various CZSL frameworks. Our code is available at https://github.com/LanchJL/MUST. •The method considers CZSL as an unbalanced multi-label classification, utilizing visual deviation of components to provide an inductive bias.•Component imbalance info is used to re-weight CZSL training, enabling the model to reconstruct inter-component balance.•The method outperforms SoTAs with base CZSL methods, and augments joint embedding function based approaches.
doi_str_mv 10.1016/j.patcog.2024.110451
format article
fullrecord <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_patcog_2024_110451</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0031320324002024</els_id><sourcerecordid>S0031320324002024</sourcerecordid><originalsourceid>FETCH-LOGICAL-c301t-eb43f23c59ba87c67655e54a3cbdf1e63c2ebc553f6e48b0b11b6828f7f566a83</originalsourceid><addsrcrecordid>eNp9kMFKAzEQhoMoWKtv4GFfYNfMZpNdL4IWrUKlQvXiJSTppGZpk5Kkgm_vlvXsafgZvp-Zj5BroBVQEDd9tVfZhE1V07qpAGjD4YRMoGtZyaGpT8mEUgYlqyk7Jxcp9ZRCOywm5O31kA9qWzyorfLG-U3hfLHKKmO51D2aXMzCbh88-pwKG-IYk8su-AH7xBjK1VfIxQJV9AN_Sc6s2ia8-ptT8vH0-D57LhfL-cvsflEaRiGXqBtma2b4rVZda0QrOEfeKGb02gIKZmrUhnNmBTadphpAi67ubGu5EKpjU9KMvSaGlCJauY9up-KPBCqPVmQvRyvyaEWOVgbsbsRwuO3bYZTJOPQG1y4O38p1cP8X_ALlgm2j</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Mutual Balancing in State-Object Components for Compositional Zero-Shot Learning</title><source>Elsevier</source><creator>Jiang, Chenyi ; Ye, Qiaolin ; Wang, Shidong ; Shen, Yuming ; Zhang, Zheng ; Zhang, Haofeng</creator><creatorcontrib>Jiang, Chenyi ; Ye, Qiaolin ; Wang, Shidong ; Shen, Yuming ; Zhang, Zheng ; Zhang, Haofeng</creatorcontrib><description>Compositional Zero-Shot Learning (CZSL) aims to recognize unseen compositions from seen states and objects. The disparity between the manually labeled semantic information and its actual visual features causes a significant imbalance of visual deviation in the distribution of various object classes and state classes, which is ignored by existing methods. To ameliorate these issues, we consider the CZSL task as an unbalanced multi-label classification task and propose a novel method called MUtual balancing in STate-object components (MUST) for CZSL, which provides a balancing inductive bias for the model. In particular, we split the classification of the composition classes into two consecutive processes to analyze the entanglement of the two components to get additional knowledge in advance, which reflects the degree of visual deviation between the two components. We use the knowledge gained to modify the model’s training process in order to generate more distinct class borders for classes with significant visual deviations. Extensive experiments demonstrate that our approach significantly outperforms the state-of-the-art on MIT-States, UT-Zappos, and C-GQA when combined with the basic CZSL frameworks, and it can improve various CZSL frameworks. Our code is available at https://github.com/LanchJL/MUST. •The method considers CZSL as an unbalanced multi-label classification, utilizing visual deviation of components to provide an inductive bias.•Component imbalance info is used to re-weight CZSL training, enabling the model to reconstruct inter-component balance.•The method outperforms SoTAs with base CZSL methods, and augments joint embedding function based approaches.</description><identifier>ISSN: 0031-3203</identifier><identifier>EISSN: 1873-5142</identifier><identifier>DOI: 10.1016/j.patcog.2024.110451</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Compositional Zero-Shot Learning ; Image classification ; Mutual Balancing ; Visual-attribute</subject><ispartof>Pattern recognition, 2024-08, Vol.152, p.110451, Article 110451</ispartof><rights>2024 Elsevier Ltd</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c301t-eb43f23c59ba87c67655e54a3cbdf1e63c2ebc553f6e48b0b11b6828f7f566a83</cites><orcidid>0000-0003-1023-1286 ; 0000-0002-4039-7618 ; 0000-0002-9031-4163</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Jiang, Chenyi</creatorcontrib><creatorcontrib>Ye, Qiaolin</creatorcontrib><creatorcontrib>Wang, Shidong</creatorcontrib><creatorcontrib>Shen, Yuming</creatorcontrib><creatorcontrib>Zhang, Zheng</creatorcontrib><creatorcontrib>Zhang, Haofeng</creatorcontrib><title>Mutual Balancing in State-Object Components for Compositional Zero-Shot Learning</title><title>Pattern recognition</title><description>Compositional Zero-Shot Learning (CZSL) aims to recognize unseen compositions from seen states and objects. The disparity between the manually labeled semantic information and its actual visual features causes a significant imbalance of visual deviation in the distribution of various object classes and state classes, which is ignored by existing methods. To ameliorate these issues, we consider the CZSL task as an unbalanced multi-label classification task and propose a novel method called MUtual balancing in STate-object components (MUST) for CZSL, which provides a balancing inductive bias for the model. In particular, we split the classification of the composition classes into two consecutive processes to analyze the entanglement of the two components to get additional knowledge in advance, which reflects the degree of visual deviation between the two components. We use the knowledge gained to modify the model’s training process in order to generate more distinct class borders for classes with significant visual deviations. Extensive experiments demonstrate that our approach significantly outperforms the state-of-the-art on MIT-States, UT-Zappos, and C-GQA when combined with the basic CZSL frameworks, and it can improve various CZSL frameworks. Our code is available at https://github.com/LanchJL/MUST. •The method considers CZSL as an unbalanced multi-label classification, utilizing visual deviation of components to provide an inductive bias.•Component imbalance info is used to re-weight CZSL training, enabling the model to reconstruct inter-component balance.•The method outperforms SoTAs with base CZSL methods, and augments joint embedding function based approaches.</description><subject>Compositional Zero-Shot Learning</subject><subject>Image classification</subject><subject>Mutual Balancing</subject><subject>Visual-attribute</subject><issn>0031-3203</issn><issn>1873-5142</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kMFKAzEQhoMoWKtv4GFfYNfMZpNdL4IWrUKlQvXiJSTppGZpk5Kkgm_vlvXsafgZvp-Zj5BroBVQEDd9tVfZhE1V07qpAGjD4YRMoGtZyaGpT8mEUgYlqyk7Jxcp9ZRCOywm5O31kA9qWzyorfLG-U3hfLHKKmO51D2aXMzCbh88-pwKG-IYk8su-AH7xBjK1VfIxQJV9AN_Sc6s2ia8-ptT8vH0-D57LhfL-cvsflEaRiGXqBtma2b4rVZda0QrOEfeKGb02gIKZmrUhnNmBTadphpAi67ubGu5EKpjU9KMvSaGlCJauY9up-KPBCqPVmQvRyvyaEWOVgbsbsRwuO3bYZTJOPQG1y4O38p1cP8X_ALlgm2j</recordid><startdate>202408</startdate><enddate>202408</enddate><creator>Jiang, Chenyi</creator><creator>Ye, Qiaolin</creator><creator>Wang, Shidong</creator><creator>Shen, Yuming</creator><creator>Zhang, Zheng</creator><creator>Zhang, Haofeng</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-1023-1286</orcidid><orcidid>https://orcid.org/0000-0002-4039-7618</orcidid><orcidid>https://orcid.org/0000-0002-9031-4163</orcidid></search><sort><creationdate>202408</creationdate><title>Mutual Balancing in State-Object Components for Compositional Zero-Shot Learning</title><author>Jiang, Chenyi ; Ye, Qiaolin ; Wang, Shidong ; Shen, Yuming ; Zhang, Zheng ; Zhang, Haofeng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c301t-eb43f23c59ba87c67655e54a3cbdf1e63c2ebc553f6e48b0b11b6828f7f566a83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Compositional Zero-Shot Learning</topic><topic>Image classification</topic><topic>Mutual Balancing</topic><topic>Visual-attribute</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jiang, Chenyi</creatorcontrib><creatorcontrib>Ye, Qiaolin</creatorcontrib><creatorcontrib>Wang, Shidong</creatorcontrib><creatorcontrib>Shen, Yuming</creatorcontrib><creatorcontrib>Zhang, Zheng</creatorcontrib><creatorcontrib>Zhang, Haofeng</creatorcontrib><collection>CrossRef</collection><jtitle>Pattern recognition</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jiang, Chenyi</au><au>Ye, Qiaolin</au><au>Wang, Shidong</au><au>Shen, Yuming</au><au>Zhang, Zheng</au><au>Zhang, Haofeng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Mutual Balancing in State-Object Components for Compositional Zero-Shot Learning</atitle><jtitle>Pattern recognition</jtitle><date>2024-08</date><risdate>2024</risdate><volume>152</volume><spage>110451</spage><pages>110451-</pages><artnum>110451</artnum><issn>0031-3203</issn><eissn>1873-5142</eissn><abstract>Compositional Zero-Shot Learning (CZSL) aims to recognize unseen compositions from seen states and objects. The disparity between the manually labeled semantic information and its actual visual features causes a significant imbalance of visual deviation in the distribution of various object classes and state classes, which is ignored by existing methods. To ameliorate these issues, we consider the CZSL task as an unbalanced multi-label classification task and propose a novel method called MUtual balancing in STate-object components (MUST) for CZSL, which provides a balancing inductive bias for the model. In particular, we split the classification of the composition classes into two consecutive processes to analyze the entanglement of the two components to get additional knowledge in advance, which reflects the degree of visual deviation between the two components. We use the knowledge gained to modify the model’s training process in order to generate more distinct class borders for classes with significant visual deviations. Extensive experiments demonstrate that our approach significantly outperforms the state-of-the-art on MIT-States, UT-Zappos, and C-GQA when combined with the basic CZSL frameworks, and it can improve various CZSL frameworks. Our code is available at https://github.com/LanchJL/MUST. •The method considers CZSL as an unbalanced multi-label classification, utilizing visual deviation of components to provide an inductive bias.•Component imbalance info is used to re-weight CZSL training, enabling the model to reconstruct inter-component balance.•The method outperforms SoTAs with base CZSL methods, and augments joint embedding function based approaches.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.patcog.2024.110451</doi><orcidid>https://orcid.org/0000-0003-1023-1286</orcidid><orcidid>https://orcid.org/0000-0002-4039-7618</orcidid><orcidid>https://orcid.org/0000-0002-9031-4163</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0031-3203
ispartof Pattern recognition, 2024-08, Vol.152, p.110451, Article 110451
issn 0031-3203
1873-5142
language eng
recordid cdi_crossref_primary_10_1016_j_patcog_2024_110451
source Elsevier
subjects Compositional Zero-Shot Learning
Image classification
Mutual Balancing
Visual-attribute
title Mutual Balancing in State-Object Components for Compositional Zero-Shot Learning
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T22%3A53%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Mutual%20Balancing%20in%20State-Object%20Components%20for%20Compositional%20Zero-Shot%20Learning&rft.jtitle=Pattern%20recognition&rft.au=Jiang,%20Chenyi&rft.date=2024-08&rft.volume=152&rft.spage=110451&rft.pages=110451-&rft.artnum=110451&rft.issn=0031-3203&rft.eissn=1873-5142&rft_id=info:doi/10.1016/j.patcog.2024.110451&rft_dat=%3Celsevier_cross%3ES0031320324002024%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c301t-eb43f23c59ba87c67655e54a3cbdf1e63c2ebc553f6e48b0b11b6828f7f566a83%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true