Loading…

Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications

Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE transactions on industrial informatics 2024-12, Vol.20 (12), p.13676-13686
Main Authors:	Munsif, Muhammad, Khan, Noman, Hussain, Altaf, Kim, Min Je, Baik, Sung Wook
Format:	Article
Language:	English
Subjects:	Accuracy Action recognition (AR) Autonomous cars Computational modeling Computer architecture Computer vision Dark adaptation Darkness Feature extraction Heuristic algorithms Human activity recognition Industrial applications Industry applications infrared (IR) vision Infrared imaging Large scale integration large-scale optimization Modules Robotics State of the art
Citations:	Items that this one cites
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites	cdi_FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63
container_end_page	13686
container_issue	12
container_start_page	13676
container_title	IEEE transactions on industrial informatics
container_volume	20
creator	Munsif, Muhammad Khan, Noman Hussain, Altaf Kim, Min Je Baik, Sung Wook
description	Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-based human action recognition (AR) models exhibit limitations in scalability for action understanding under uncertain, low-light, or dark conditions. Integrating these with IR data faces challenges due to changes in modality, high resource demands, and strict latency requirements. Such issues hinder the deployment of these technologies in real-world settings. To overcome these challenges, we introduce a novel slow-fast tubelet (SFT) processing framework designed for efficient and accurate AR in IR-based scenarios. The SFT framework comprises three modules: tubelet preprocessing (TPP), feature extraction, and the feature lateral connection and recognition module (FELCM). The TPP module refines IR streams by extracting the region of interest, filtering detected objects, removing noise, and generating tubelets of refined frames. The FELCM processes refined tubelet through two pathways, where the fast tubelet path operates at a high rate and the slow tubelet path operates at a slow rate. These pathways interconnect through lateral connections, facilitating mutual updates, and enhancing the prediction efficiency. We conducted extensive experiments on benchmark datasets, including NTURGB-D 120 and infrared action recognition (InfAR). The results demonstrate that our proposed SFT framework surpasses state-of-the-art approaches in terms of accuracy (2.7% and 3.3% improvement, respectively), computational cost, and inference latency while maintaining the competitive recognition performance. Our framework's promising results underscore its potential for direct deployment in real-world applications.
doi_str_mv	10.1109/TII.2024.3431070
format	article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3141617252</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10636303</ieee_id><sourcerecordid>3141617252</sourcerecordid><originalsourceid>FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63</originalsourceid><addsrcrecordid>eNpNkLFOwzAURSMEEqWwMzBYYk6x49hx2CJooVIFEpQ5cpznym2Ig-204u9J1A5M7w7n3iedKLoleEYIzh_Wy-UswUk6oyklOMNn0YTkKYkxZvh8yIyRmCaYXkZX3m8xphmm-STqnqXbteB9XNSyC2YPqFDB2BZ9gLKb1oz5Ea1gD05uTLtBc62NMtAGtO4raCCgz8Ye4oX0Ab1BOFi3Q9o6tGzr3gdnZIOKrmuMkuOUv44utGw83JzuNPpazNdPr_Hq_WX5VKxiRTIW4rqikOYMKkG55FwpqXQlRMU1E6kQQoosraTkdS4oq2jOqpoDYAJag2SK02l0f9ztnP3pwYdya3vXDi9LSlLCSZawZKDwkVLOeu9Al50z39L9lgSXo9dy8FqOXsuT16Fyd6wYAPiHc8oppvQP6jt2Nw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3141617252</pqid></control><display><type>article</type><title>Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications</title><source>IEEE Xplore (Online service)</source><creator>Munsif, Muhammad ; Khan, Noman ; Hussain, Altaf ; Kim, Min Je ; Baik, Sung Wook</creator><creatorcontrib>Munsif, Muhammad ; Khan, Noman ; Hussain, Altaf ; Kim, Min Je ; Baik, Sung Wook</creatorcontrib><description>Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-based human action recognition (AR) models exhibit limitations in scalability for action understanding under uncertain, low-light, or dark conditions. Integrating these with IR data faces challenges due to changes in modality, high resource demands, and strict latency requirements. Such issues hinder the deployment of these technologies in real-world settings. To overcome these challenges, we introduce a novel slow-fast tubelet (SFT) processing framework designed for efficient and accurate AR in IR-based scenarios. The SFT framework comprises three modules: tubelet preprocessing (TPP), feature extraction, and the feature lateral connection and recognition module (FELCM). The TPP module refines IR streams by extracting the region of interest, filtering detected objects, removing noise, and generating tubelets of refined frames. The FELCM processes refined tubelet through two pathways, where the fast tubelet path operates at a high rate and the slow tubelet path operates at a slow rate. These pathways interconnect through lateral connections, facilitating mutual updates, and enhancing the prediction efficiency. We conducted extensive experiments on benchmark datasets, including NTURGB-D 120 and infrared action recognition (InfAR). The results demonstrate that our proposed SFT framework surpasses state-of-the-art approaches in terms of accuracy (2.7% and 3.3% improvement, respectively), computational cost, and inference latency while maintaining the competitive recognition performance. Our framework's promising results underscore its potential for direct deployment in real-world applications.</description><identifier>ISSN: 1551-3203</identifier><identifier>EISSN: 1941-0050</identifier><identifier>DOI: 10.1109/TII.2024.3431070</identifier><identifier>CODEN: ITIICH</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Accuracy ; Action recognition (AR) ; Autonomous cars ; Computational modeling ; Computer architecture ; Computer vision ; Dark adaptation ; Darkness ; Feature extraction ; Heuristic algorithms ; Human activity recognition ; Industrial applications ; Industry applications ; infrared (IR) vision ; Infrared imaging ; Large scale integration ; large-scale optimization ; Modules ; Robotics ; State of the art</subject><ispartof>IEEE transactions on industrial informatics, 2024-12, Vol.20 (12), p.13676-13686</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63</cites><orcidid>0000-0002-6678-7788 ; 0000-0001-9031-780X ; 0000-0001-7531-3827</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10636303$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,27903,27904,54775</link.rule.ids></links><search><creatorcontrib>Munsif, Muhammad</creatorcontrib><creatorcontrib>Khan, Noman</creatorcontrib><creatorcontrib>Hussain, Altaf</creatorcontrib><creatorcontrib>Kim, Min Je</creatorcontrib><creatorcontrib>Baik, Sung Wook</creatorcontrib><title>Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications</title><title>IEEE transactions on industrial informatics</title><addtitle>TII</addtitle><description>Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-based human action recognition (AR) models exhibit limitations in scalability for action understanding under uncertain, low-light, or dark conditions. Integrating these with IR data faces challenges due to changes in modality, high resource demands, and strict latency requirements. Such issues hinder the deployment of these technologies in real-world settings. To overcome these challenges, we introduce a novel slow-fast tubelet (SFT) processing framework designed for efficient and accurate AR in IR-based scenarios. The SFT framework comprises three modules: tubelet preprocessing (TPP), feature extraction, and the feature lateral connection and recognition module (FELCM). The TPP module refines IR streams by extracting the region of interest, filtering detected objects, removing noise, and generating tubelets of refined frames. The FELCM processes refined tubelet through two pathways, where the fast tubelet path operates at a high rate and the slow tubelet path operates at a slow rate. These pathways interconnect through lateral connections, facilitating mutual updates, and enhancing the prediction efficiency. We conducted extensive experiments on benchmark datasets, including NTURGB-D 120 and infrared action recognition (InfAR). The results demonstrate that our proposed SFT framework surpasses state-of-the-art approaches in terms of accuracy (2.7% and 3.3% improvement, respectively), computational cost, and inference latency while maintaining the competitive recognition performance. Our framework's promising results underscore its potential for direct deployment in real-world applications.</description><subject>Accuracy</subject><subject>Action recognition (AR)</subject><subject>Autonomous cars</subject><subject>Computational modeling</subject><subject>Computer architecture</subject><subject>Computer vision</subject><subject>Dark adaptation</subject><subject>Darkness</subject><subject>Feature extraction</subject><subject>Heuristic algorithms</subject><subject>Human activity recognition</subject><subject>Industrial applications</subject><subject>Industry applications</subject><subject>infrared (IR) vision</subject><subject>Infrared imaging</subject><subject>Large scale integration</subject><subject>large-scale optimization</subject><subject>Modules</subject><subject>Robotics</subject><subject>State of the art</subject><issn>1551-3203</issn><issn>1941-0050</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpNkLFOwzAURSMEEqWwMzBYYk6x49hx2CJooVIFEpQ5cpznym2Ig-204u9J1A5M7w7n3iedKLoleEYIzh_Wy-UswUk6oyklOMNn0YTkKYkxZvh8yIyRmCaYXkZX3m8xphmm-STqnqXbteB9XNSyC2YPqFDB2BZ9gLKb1oz5Ea1gD05uTLtBc62NMtAGtO4raCCgz8Ye4oX0Ab1BOFi3Q9o6tGzr3gdnZIOKrmuMkuOUv44utGw83JzuNPpazNdPr_Hq_WX5VKxiRTIW4rqikOYMKkG55FwpqXQlRMU1E6kQQoosraTkdS4oq2jOqpoDYAJag2SK02l0f9ztnP3pwYdya3vXDi9LSlLCSZawZKDwkVLOeu9Al50z39L9lgSXo9dy8FqOXsuT16Fyd6wYAPiHc8oppvQP6jt2Nw</recordid><startdate>20241201</startdate><enddate>20241201</enddate><creator>Munsif, Muhammad</creator><creator>Khan, Noman</creator><creator>Hussain, Altaf</creator><creator>Kim, Min Je</creator><creator>Baik, Sung Wook</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-6678-7788</orcidid><orcidid>https://orcid.org/0000-0001-9031-780X</orcidid><orcidid>https://orcid.org/0000-0001-7531-3827</orcidid></search><sort><creationdate>20241201</creationdate><title>Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications</title><author>Munsif, Muhammad ; Khan, Noman ; Hussain, Altaf ; Kim, Min Je ; Baik, Sung Wook</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Action recognition (AR)</topic><topic>Autonomous cars</topic><topic>Computational modeling</topic><topic>Computer architecture</topic><topic>Computer vision</topic><topic>Dark adaptation</topic><topic>Darkness</topic><topic>Feature extraction</topic><topic>Heuristic algorithms</topic><topic>Human activity recognition</topic><topic>Industrial applications</topic><topic>Industry applications</topic><topic>infrared (IR) vision</topic><topic>Infrared imaging</topic><topic>Large scale integration</topic><topic>large-scale optimization</topic><topic>Modules</topic><topic>Robotics</topic><topic>State of the art</topic><toplevel>online_resources</toplevel><creatorcontrib>Munsif, Muhammad</creatorcontrib><creatorcontrib>Khan, Noman</creatorcontrib><creatorcontrib>Hussain, Altaf</creatorcontrib><creatorcontrib>Kim, Min Je</creatorcontrib><creatorcontrib>Baik, Sung Wook</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on industrial informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Munsif, Muhammad</au><au>Khan, Noman</au><au>Hussain, Altaf</au><au>Kim, Min Je</au><au>Baik, Sung Wook</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications</atitle><jtitle>IEEE transactions on industrial informatics</jtitle><stitle>TII</stitle><date>2024-12-01</date><risdate>2024</risdate><volume>20</volume><issue>12</issue><spage>13676</spage><epage>13686</epage><pages>13676-13686</pages><issn>1551-3203</issn><eissn>1941-0050</eissn><coden>ITIICH</coden><abstract>Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-based human action recognition (AR) models exhibit limitations in scalability for action understanding under uncertain, low-light, or dark conditions. Integrating these with IR data faces challenges due to changes in modality, high resource demands, and strict latency requirements. Such issues hinder the deployment of these technologies in real-world settings. To overcome these challenges, we introduce a novel slow-fast tubelet (SFT) processing framework designed for efficient and accurate AR in IR-based scenarios. The SFT framework comprises three modules: tubelet preprocessing (TPP), feature extraction, and the feature lateral connection and recognition module (FELCM). The TPP module refines IR streams by extracting the region of interest, filtering detected objects, removing noise, and generating tubelets of refined frames. The FELCM processes refined tubelet through two pathways, where the fast tubelet path operates at a high rate and the slow tubelet path operates at a slow rate. These pathways interconnect through lateral connections, facilitating mutual updates, and enhancing the prediction efficiency. We conducted extensive experiments on benchmark datasets, including NTURGB-D 120 and infrared action recognition (InfAR). The results demonstrate that our proposed SFT framework surpasses state-of-the-art approaches in terms of accuracy (2.7% and 3.3% improvement, respectively), computational cost, and inference latency while maintaining the competitive recognition performance. Our framework's promising results underscore its potential for direct deployment in real-world applications.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TII.2024.3431070</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0002-6678-7788</orcidid><orcidid>https://orcid.org/0000-0001-9031-780X</orcidid><orcidid>https://orcid.org/0000-0001-7531-3827</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 1551-3203
ispartof	IEEE transactions on industrial informatics, 2024-12, Vol.20 (12), p.13676-13686
issn	1551-3203 1941-0050
language	eng
recordid	cdi_proquest_journals_3141617252
source	IEEE Xplore (Online service)
subjects	Accuracy Action recognition (AR) Autonomous cars Computational modeling Computer architecture Computer vision Dark adaptation Darkness Feature extraction Heuristic algorithms Human activity recognition Industrial applications Industry applications infrared (IR) vision Infrared imaging Large scale integration large-scale optimization Modules Robotics State of the art
title	Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T05%3A47%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Darkness-Adaptive%20Action%20Recognition:%20Leveraging%20Efficient%20Tubelet%20Slow-Fast%20Network%20for%20Industrial%20Applications&rft.jtitle=IEEE%20transactions%20on%20industrial%20informatics&rft.au=Munsif,%20Muhammad&rft.date=2024-12-01&rft.volume=20&rft.issue=12&rft.spage=13676&rft.epage=13686&rft.pages=13676-13686&rft.issn=1551-3203&rft.eissn=1941-0050&rft.coden=ITIICH&rft_id=info:doi/10.1109/TII.2024.3431070&rft_dat=%3Cproquest_cross%3E3141617252%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3141617252&rft_id=info:pmid/&rft_ieee_id=10636303&rfr_iscdi=true