Loading…

Denoising cosine similarity: A theory-driven approach for efficient representation learning

Representation learning has been increasing its impact on the research and practice of machine learning, since it enables to learn representations that can apply to various downstream tasks efficiently. However, recent works pay little attention to the fact that real-world datasets used during the s...

Full description

Saved in:
Bibliographic Details
Published in:Neural networks 2024-01, Vol.169, p.226-241
Main Authors: Nakagawa, Takumi, Sanada, Yutaro, Waida, Hiroki, Zhang, Yuhui, Wada, Yuichiro, Takanashi, Kōsaku, Yamada, Tomonori, Kanamori, Takafumi
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Representation learning has been increasing its impact on the research and practice of machine learning, since it enables to learn representations that can apply to various downstream tasks efficiently. However, recent works pay little attention to the fact that real-world datasets used during the stage of representation learning are commonly contaminated by noise, which can degrade the quality of learned representations. This paper tackles the problem to learn robust representations against noise in a raw dataset. To this end, inspired by recent works on denoising and the success of the cosine-similarity-based objective functions in representation learning, we propose the denoising Cosine-Similarity (dCS) loss. The dCS loss is a modified cosine-similarity loss and incorporates a denoising property, which is supported by both our theoretical and empirical findings. To make the dCS loss implementable, we also construct the estimators of the dCS loss with statistical guarantees. Finally, we empirically show the efficiency of the dCS loss over the baseline objective functions in vision and speech domains. •A modified cosine-similarity loss with a denoising property is proposed.•The denoising property of the cosine similarity loss is theoretically investigated.•An estimator of the modified loss is introduced with statistical guarantees.•The quality enhancement of representations learned by the modified loss is observed.
ISSN:0893-6080
1879-2782
1879-2782
DOI:10.1016/j.neunet.2023.10.027