Loading…

Get to the Point: Content Classification of Animated Graphics Interchange Formats with Key-Frame Attention

Animated Graphics Interchange Formats (GIFS) are low-bandwidth short image sequences that can continuously display multiple frames without sound. In this paper, we focus on a new content classification task that is important in real-world applications. A key problem for this task is that some frames...

Full description

Saved in:
Bibliographic Details
Main Authors: Ma, Yongjuan, Wang, Yu, Zhu, Pengfei, Pan, Junwen, Shi, Hong
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Animated Graphics Interchange Formats (GIFS) are low-bandwidth short image sequences that can continuously display multiple frames without sound. In this paper, we focus on a new content classification task that is important in real-world applications. A key problem for this task is that some frames in an animated GF are irrelevant to the label, which may drastically reduce the classification performance. To this end, we first collect a new dataset of Web animated GIFS (WGF) that includes some typical samples in which only several key-frames are relevant to the ground truth. Then, an attention-based method is designed to learn to produce importance scores of the frames, and subsequently multi-frame predicted scores are merged to obtain the final prediction. Besides, an additional entropy loss is also used to sharpen the attention results to further emphasize the key-frames. Experimental results on WGF show that the proposed approach significantly outperforms various baseline methods.
ISSN:2381-8549
DOI:10.1109/ICIP42928.2021.9506230