Loading…

Heterogeneous Feature Fusion for Improving Performance of Action Detection

We present a novel framework aimed at improving video action detection through the integration of heterogeneous features. Conventional action detection methods which focus on modeling the relationships between person/object instances rely exclusively on video features and do not exploit valuable int...

Full description

Saved in:
Bibliographic Details
Published in:Journal of physics. Conference series 2024-05, Vol.2759 (1), p.12001
Main Authors: Babazaki, Yasunori, Iwamoto, Kota, Takahashi, Katsuhiko, Li, Kai, Patel, Deep, Kruus, Erik, Peter Graf, Hans
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We present a novel framework aimed at improving video action detection through the integration of heterogeneous features. Conventional action detection methods which focus on modeling the relationships between person/object instances rely exclusively on video features and do not exploit valuable intra-instance heterogeneous features, such as person pose, positional information or object category, that can support action recognition. Our proposed framework, termed Heterogeneous Feature Fusion (HFF) framework, addresses this limitation by integrating such intra-instance heterogeneous features for person/object instances, and can improve existing action detection methods. To efficiently exploit each heterogeneous feature, which vary in importance depending on actions and/or scenes, we introduce an attention mechanism to dynamically enhance important heterogeneous features within an instance. Experiments on JHMDB and AVA v2.2 datasets show that our HFF significantly enhances the action detection performance of two existing methods.
ISSN:1742-6588
1742-6596
DOI:10.1088/1742-6596/2759/1/012001