Loading…

Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications

Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on industrial informatics 2024-12, Vol.20 (12), p.13676-13686
Main Authors: Munsif, Muhammad, Khan, Noman, Hussain, Altaf, Kim, Min Je, Baik, Sung Wook
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63
container_end_page 13686
container_issue 12
container_start_page 13676
container_title IEEE transactions on industrial informatics
container_volume 20
creator Munsif, Muhammad
Khan, Noman
Hussain, Altaf
Kim, Min Je
Baik, Sung Wook
description Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-based human action recognition (AR) models exhibit limitations in scalability for action understanding under uncertain, low-light, or dark conditions. Integrating these with IR data faces challenges due to changes in modality, high resource demands, and strict latency requirements. Such issues hinder the deployment of these technologies in real-world settings. To overcome these challenges, we introduce a novel slow-fast tubelet (SFT) processing framework designed for efficient and accurate AR in IR-based scenarios. The SFT framework comprises three modules: tubelet preprocessing (TPP), feature extraction, and the feature lateral connection and recognition module (FELCM). The TPP module refines IR streams by extracting the region of interest, filtering detected objects, removing noise, and generating tubelets of refined frames. The FELCM processes refined tubelet through two pathways, where the fast tubelet path operates at a high rate and the slow tubelet path operates at a slow rate. These pathways interconnect through lateral connections, facilitating mutual updates, and enhancing the prediction efficiency. We conducted extensive experiments on benchmark datasets, including NTURGB-D 120 and infrared action recognition (InfAR). The results demonstrate that our proposed SFT framework surpasses state-of-the-art approaches in terms of accuracy (2.7% and 3.3% improvement, respectively), computational cost, and inference latency while maintaining the competitive recognition performance. Our framework's promising results underscore its potential for direct deployment in real-world applications.
doi_str_mv 10.1109/TII.2024.3431070
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3141617252</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10636303</ieee_id><sourcerecordid>3141617252</sourcerecordid><originalsourceid>FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63</originalsourceid><addsrcrecordid>eNpNkLFOwzAURSMEEqWwMzBYYk6x49hx2CJooVIFEpQ5cpznym2Ig-204u9J1A5M7w7n3iedKLoleEYIzh_Wy-UswUk6oyklOMNn0YTkKYkxZvh8yIyRmCaYXkZX3m8xphmm-STqnqXbteB9XNSyC2YPqFDB2BZ9gLKb1oz5Ea1gD05uTLtBc62NMtAGtO4raCCgz8Ye4oX0Ab1BOFi3Q9o6tGzr3gdnZIOKrmuMkuOUv44utGw83JzuNPpazNdPr_Hq_WX5VKxiRTIW4rqikOYMKkG55FwpqXQlRMU1E6kQQoosraTkdS4oq2jOqpoDYAJag2SK02l0f9ztnP3pwYdya3vXDi9LSlLCSZawZKDwkVLOeu9Al50z39L9lgSXo9dy8FqOXsuT16Fyd6wYAPiHc8oppvQP6jt2Nw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3141617252</pqid></control><display><type>article</type><title>Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications</title><source>IEEE Xplore (Online service)</source><creator>Munsif, Muhammad ; Khan, Noman ; Hussain, Altaf ; Kim, Min Je ; Baik, Sung Wook</creator><creatorcontrib>Munsif, Muhammad ; Khan, Noman ; Hussain, Altaf ; Kim, Min Je ; Baik, Sung Wook</creatorcontrib><description>Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-based human action recognition (AR) models exhibit limitations in scalability for action understanding under uncertain, low-light, or dark conditions. Integrating these with IR data faces challenges due to changes in modality, high resource demands, and strict latency requirements. Such issues hinder the deployment of these technologies in real-world settings. To overcome these challenges, we introduce a novel slow-fast tubelet (SFT) processing framework designed for efficient and accurate AR in IR-based scenarios. The SFT framework comprises three modules: tubelet preprocessing (TPP), feature extraction, and the feature lateral connection and recognition module (FELCM). The TPP module refines IR streams by extracting the region of interest, filtering detected objects, removing noise, and generating tubelets of refined frames. The FELCM processes refined tubelet through two pathways, where the fast tubelet path operates at a high rate and the slow tubelet path operates at a slow rate. These pathways interconnect through lateral connections, facilitating mutual updates, and enhancing the prediction efficiency. We conducted extensive experiments on benchmark datasets, including NTURGB-D 120 and infrared action recognition (InfAR). The results demonstrate that our proposed SFT framework surpasses state-of-the-art approaches in terms of accuracy (2.7% and 3.3% improvement, respectively), computational cost, and inference latency while maintaining the competitive recognition performance. Our framework's promising results underscore its potential for direct deployment in real-world applications.</description><identifier>ISSN: 1551-3203</identifier><identifier>EISSN: 1941-0050</identifier><identifier>DOI: 10.1109/TII.2024.3431070</identifier><identifier>CODEN: ITIICH</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Accuracy ; Action recognition (AR) ; Autonomous cars ; Computational modeling ; Computer architecture ; Computer vision ; Dark adaptation ; Darkness ; Feature extraction ; Heuristic algorithms ; Human activity recognition ; Industrial applications ; Industry applications ; infrared (IR) vision ; Infrared imaging ; Large scale integration ; large-scale optimization ; Modules ; Robotics ; State of the art</subject><ispartof>IEEE transactions on industrial informatics, 2024-12, Vol.20 (12), p.13676-13686</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63</cites><orcidid>0000-0002-6678-7788 ; 0000-0001-9031-780X ; 0000-0001-7531-3827</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10636303$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,27903,27904,54775</link.rule.ids></links><search><creatorcontrib>Munsif, Muhammad</creatorcontrib><creatorcontrib>Khan, Noman</creatorcontrib><creatorcontrib>Hussain, Altaf</creatorcontrib><creatorcontrib>Kim, Min Je</creatorcontrib><creatorcontrib>Baik, Sung Wook</creatorcontrib><title>Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications</title><title>IEEE transactions on industrial informatics</title><addtitle>TII</addtitle><description>Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-based human action recognition (AR) models exhibit limitations in scalability for action understanding under uncertain, low-light, or dark conditions. Integrating these with IR data faces challenges due to changes in modality, high resource demands, and strict latency requirements. Such issues hinder the deployment of these technologies in real-world settings. To overcome these challenges, we introduce a novel slow-fast tubelet (SFT) processing framework designed for efficient and accurate AR in IR-based scenarios. The SFT framework comprises three modules: tubelet preprocessing (TPP), feature extraction, and the feature lateral connection and recognition module (FELCM). The TPP module refines IR streams by extracting the region of interest, filtering detected objects, removing noise, and generating tubelets of refined frames. The FELCM processes refined tubelet through two pathways, where the fast tubelet path operates at a high rate and the slow tubelet path operates at a slow rate. These pathways interconnect through lateral connections, facilitating mutual updates, and enhancing the prediction efficiency. We conducted extensive experiments on benchmark datasets, including NTURGB-D 120 and infrared action recognition (InfAR). The results demonstrate that our proposed SFT framework surpasses state-of-the-art approaches in terms of accuracy (2.7% and 3.3% improvement, respectively), computational cost, and inference latency while maintaining the competitive recognition performance. Our framework's promising results underscore its potential for direct deployment in real-world applications.</description><subject>Accuracy</subject><subject>Action recognition (AR)</subject><subject>Autonomous cars</subject><subject>Computational modeling</subject><subject>Computer architecture</subject><subject>Computer vision</subject><subject>Dark adaptation</subject><subject>Darkness</subject><subject>Feature extraction</subject><subject>Heuristic algorithms</subject><subject>Human activity recognition</subject><subject>Industrial applications</subject><subject>Industry applications</subject><subject>infrared (IR) vision</subject><subject>Infrared imaging</subject><subject>Large scale integration</subject><subject>large-scale optimization</subject><subject>Modules</subject><subject>Robotics</subject><subject>State of the art</subject><issn>1551-3203</issn><issn>1941-0050</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpNkLFOwzAURSMEEqWwMzBYYk6x49hx2CJooVIFEpQ5cpznym2Ig-204u9J1A5M7w7n3iedKLoleEYIzh_Wy-UswUk6oyklOMNn0YTkKYkxZvh8yIyRmCaYXkZX3m8xphmm-STqnqXbteB9XNSyC2YPqFDB2BZ9gLKb1oz5Ea1gD05uTLtBc62NMtAGtO4raCCgz8Ye4oX0Ab1BOFi3Q9o6tGzr3gdnZIOKrmuMkuOUv44utGw83JzuNPpazNdPr_Hq_WX5VKxiRTIW4rqikOYMKkG55FwpqXQlRMU1E6kQQoosraTkdS4oq2jOqpoDYAJag2SK02l0f9ztnP3pwYdya3vXDi9LSlLCSZawZKDwkVLOeu9Al50z39L9lgSXo9dy8FqOXsuT16Fyd6wYAPiHc8oppvQP6jt2Nw</recordid><startdate>20241201</startdate><enddate>20241201</enddate><creator>Munsif, Muhammad</creator><creator>Khan, Noman</creator><creator>Hussain, Altaf</creator><creator>Kim, Min Je</creator><creator>Baik, Sung Wook</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-6678-7788</orcidid><orcidid>https://orcid.org/0000-0001-9031-780X</orcidid><orcidid>https://orcid.org/0000-0001-7531-3827</orcidid></search><sort><creationdate>20241201</creationdate><title>Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications</title><author>Munsif, Muhammad ; Khan, Noman ; Hussain, Altaf ; Kim, Min Je ; Baik, Sung Wook</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Action recognition (AR)</topic><topic>Autonomous cars</topic><topic>Computational modeling</topic><topic>Computer architecture</topic><topic>Computer vision</topic><topic>Dark adaptation</topic><topic>Darkness</topic><topic>Feature extraction</topic><topic>Heuristic algorithms</topic><topic>Human activity recognition</topic><topic>Industrial applications</topic><topic>Industry applications</topic><topic>infrared (IR) vision</topic><topic>Infrared imaging</topic><topic>Large scale integration</topic><topic>large-scale optimization</topic><topic>Modules</topic><topic>Robotics</topic><topic>State of the art</topic><toplevel>online_resources</toplevel><creatorcontrib>Munsif, Muhammad</creatorcontrib><creatorcontrib>Khan, Noman</creatorcontrib><creatorcontrib>Hussain, Altaf</creatorcontrib><creatorcontrib>Kim, Min Je</creatorcontrib><creatorcontrib>Baik, Sung Wook</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on industrial informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Munsif, Muhammad</au><au>Khan, Noman</au><au>Hussain, Altaf</au><au>Kim, Min Je</au><au>Baik, Sung Wook</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications</atitle><jtitle>IEEE transactions on industrial informatics</jtitle><stitle>TII</stitle><date>2024-12-01</date><risdate>2024</risdate><volume>20</volume><issue>12</issue><spage>13676</spage><epage>13686</epage><pages>13676-13686</pages><issn>1551-3203</issn><eissn>1941-0050</eissn><coden>ITIICH</coden><abstract>Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-based human action recognition (AR) models exhibit limitations in scalability for action understanding under uncertain, low-light, or dark conditions. Integrating these with IR data faces challenges due to changes in modality, high resource demands, and strict latency requirements. Such issues hinder the deployment of these technologies in real-world settings. To overcome these challenges, we introduce a novel slow-fast tubelet (SFT) processing framework designed for efficient and accurate AR in IR-based scenarios. The SFT framework comprises three modules: tubelet preprocessing (TPP), feature extraction, and the feature lateral connection and recognition module (FELCM). The TPP module refines IR streams by extracting the region of interest, filtering detected objects, removing noise, and generating tubelets of refined frames. The FELCM processes refined tubelet through two pathways, where the fast tubelet path operates at a high rate and the slow tubelet path operates at a slow rate. These pathways interconnect through lateral connections, facilitating mutual updates, and enhancing the prediction efficiency. We conducted extensive experiments on benchmark datasets, including NTURGB-D 120 and infrared action recognition (InfAR). The results demonstrate that our proposed SFT framework surpasses state-of-the-art approaches in terms of accuracy (2.7% and 3.3% improvement, respectively), computational cost, and inference latency while maintaining the competitive recognition performance. Our framework's promising results underscore its potential for direct deployment in real-world applications.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TII.2024.3431070</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0002-6678-7788</orcidid><orcidid>https://orcid.org/0000-0001-9031-780X</orcidid><orcidid>https://orcid.org/0000-0001-7531-3827</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1551-3203
ispartof IEEE transactions on industrial informatics, 2024-12, Vol.20 (12), p.13676-13686
issn 1551-3203
1941-0050
language eng
recordid cdi_proquest_journals_3141617252
source IEEE Xplore (Online service)
subjects Accuracy
Action recognition (AR)
Autonomous cars
Computational modeling
Computer architecture
Computer vision
Dark adaptation
Darkness
Feature extraction
Heuristic algorithms
Human activity recognition
Industrial applications
Industry applications
infrared (IR) vision
Infrared imaging
Large scale integration
large-scale optimization
Modules
Robotics
State of the art
title Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T05%3A47%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Darkness-Adaptive%20Action%20Recognition:%20Leveraging%20Efficient%20Tubelet%20Slow-Fast%20Network%20for%20Industrial%20Applications&rft.jtitle=IEEE%20transactions%20on%20industrial%20informatics&rft.au=Munsif,%20Muhammad&rft.date=2024-12-01&rft.volume=20&rft.issue=12&rft.spage=13676&rft.epage=13686&rft.pages=13676-13686&rft.issn=1551-3203&rft.eissn=1941-0050&rft.coden=ITIICH&rft_id=info:doi/10.1109/TII.2024.3431070&rft_dat=%3Cproquest_cross%3E3141617252%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3141617252&rft_id=info:pmid/&rft_ieee_id=10636303&rfr_iscdi=true