Loading…

Adversarial Graph Convolutional Network for Cross-Modal Retrieval

The completeness of semantic expression plays an important role in cross-modal retrieval tasks, which contributes to align the cross-modal data and thus narrow the modality gap. But due to the abstractness of semantics, the same topic may have different aspects to be well described so it may be inco...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE transactions on circuits and systems for video technology 2022-03, Vol.32 (3), p.1634-1645
Main Authors:	Dong, Xinfeng, Liu, Li, Zhu, Lei, Nie, Liqiang, Zhang, Huaxiang
Format:	Article
Language:	English
Subjects:	Artificial neural networks Convolutional codes Correlation Cross-modal retrieval Feature extraction generative adversarial network Generative adversarial networks Generators graph convolutional network Graphical representations Minimax technique Modal data Retrieval Semantics Task analysis
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	The completeness of semantic expression plays an important role in cross-modal retrieval tasks, which contributes to align the cross-modal data and thus narrow the modality gap. But due to the abstractness of semantics, the same topic may have different aspects to be well described so it may be incomplete to express semantics with only one sample. In order to obtain semantic complementary information and strengthen similar information for samples with the same semantics, we utilize a graph convolutional network (GCN) to reconstruct the sample representation based on the adjacency relationship between the sample itself and its neighborhoods. We construct a local graph for each instance, and propose a novel Graph Feature Generator based on GCN and a fully-connected network to reconstruct node features based on local graph and map the features of two modalities into a common space. The Graph Feature Generator and Graph Feature Discriminator adopt a minimax game strategy to generate modality-invariant graph feature representations. Experiments on three benchmark datasets demonstrate the superiority of our proposed model compared with several state-of-the-art methods.
ISSN:	1051-8215 1558-2205
DOI:	10.1109/TCSVT.2021.3075242