Loading…

Dual-perspective semantic-aware representation blending for multi-label image recognition with partial labels

Recently, multi-label image recognition with partial labels (MLR-PL) has attracted increasing attention, in which only some labels are known while others are unknown for each image. However, current algorithms rely on pre-trained image similarity models or iteratively updating the image classificati...

Full description

Saved in:
Bibliographic Details
Published in:Expert systems with applications 2024-09, Vol.249, p.123526, Article 123526
Main Authors: Pu, Tao, Chen, Tianshui, Wu, Hefeng, Shi, Yukai, Yang, Zhijing, Lin, Liang
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Recently, multi-label image recognition with partial labels (MLR-PL) has attracted increasing attention, in which only some labels are known while others are unknown for each image. However, current algorithms rely on pre-trained image similarity models or iteratively updating the image classification models to generate pseudo labels for unknown labels. Thus, they depend on a certain amount of annotations and inevitably suffer from obvious performance drops. To address this dilemma, we propose a dual-perspective semantic-aware representation blending (DSRB) framework that blends multi-granularity category-specific semantic representation across different images, from an instance and prototype perspective, respectively, to transfer information of known labels to complement unknown labels. Specifically, an instance-perspective representation blending (IPRB) module is designed to blend the representations of the known labels in an image with the representations of the corresponding unknown labels in another image to complement these unknown labels. Meanwhile, a prototype-perspective representation blending (PPRB) module is introduced to learn more stable representation prototypes for each category and blends the representation of unknown labels with the prototypes of corresponding labels in a location-sensitive manner to complement these unknown labels. Extensive experiments on various datasets show that the proposed DSRB consistently outperforms current state-of-the-art algorithms on all known label proportion settings.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2024.123526