Loading…

Self-supervised on-line cumulative learning from video streams

We present a novel online self-supervised method for face identity learning from video streams. The method exploits deep face feature descriptors together with a memory based learning mechanism that takes advantage of the temporal coherence of visual data. Specifically, we introduce a discriminative...

Full description

Saved in:
Bibliographic Details
Published in:Computer vision and image understanding 2020-08, Vol.197-198, p.102983, Article 102983
Main Authors: Pernici, Federico, Bruni, Matteo, Del Bimbo, Alberto
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We present a novel online self-supervised method for face identity learning from video streams. The method exploits deep face feature descriptors together with a memory based learning mechanism that takes advantage of the temporal coherence of visual data. Specifically, we introduce a discriminative descriptor matching solution based on Reverse Nearest Neighbor and a memory based cumulative learning strategy that discards redundant descriptors while time progresses. This allows building a comprehensive and cumulative representation of all the past visual information observed so far. It is shown that the proposed learning procedure is asymptotically stable and can be effectively used in relevant applications like multiple face identification and tracking from unconstrained video streams. Experimental results show that the proposed method achieves comparable results in the task of multiple face tracking and better performance in face identification with offline approaches exploiting future information. •We propose online identities learning from unconstrained video streams•Video streams are infinitely long, past knowledge preservation is required•We avoid the deletion of identities after a fixed number of frames has passed•We selectively remove observed features based on temporal locality in feature space•We address very long-term object re-acquisition in online MOT processing mode
ISSN:1077-3142
1090-235X
DOI:10.1016/j.cviu.2020.102983