Loading…
Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications
Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-...
Saved in:
Published in: | IEEE transactions on industrial informatics 2024-12, Vol.20 (12), p.13676-13686 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63 |
container_end_page | 13686 |
container_issue | 12 |
container_start_page | 13676 |
container_title | IEEE transactions on industrial informatics |
container_volume | 20 |
creator | Munsif, Muhammad Khan, Noman Hussain, Altaf Kim, Min Je Baik, Sung Wook |
description | Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-based human action recognition (AR) models exhibit limitations in scalability for action understanding under uncertain, low-light, or dark conditions. Integrating these with IR data faces challenges due to changes in modality, high resource demands, and strict latency requirements. Such issues hinder the deployment of these technologies in real-world settings. To overcome these challenges, we introduce a novel slow-fast tubelet (SFT) processing framework designed for efficient and accurate AR in IR-based scenarios. The SFT framework comprises three modules: tubelet preprocessing (TPP), feature extraction, and the feature lateral connection and recognition module (FELCM). The TPP module refines IR streams by extracting the region of interest, filtering detected objects, removing noise, and generating tubelets of refined frames. The FELCM processes refined tubelet through two pathways, where the fast tubelet path operates at a high rate and the slow tubelet path operates at a slow rate. These pathways interconnect through lateral connections, facilitating mutual updates, and enhancing the prediction efficiency. We conducted extensive experiments on benchmark datasets, including NTURGB-D 120 and infrared action recognition (InfAR). The results demonstrate that our proposed SFT framework surpasses state-of-the-art approaches in terms of accuracy (2.7% and 3.3% improvement, respectively), computational cost, and inference latency while maintaining the competitive recognition performance. Our framework's promising results underscore its potential for direct deployment in real-world applications. |
doi_str_mv | 10.1109/TII.2024.3431070 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3141617252</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10636303</ieee_id><sourcerecordid>3141617252</sourcerecordid><originalsourceid>FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63</originalsourceid><addsrcrecordid>eNpNkLFOwzAURSMEEqWwMzBYYk6x49hx2CJooVIFEpQ5cpznym2Ig-204u9J1A5M7w7n3iedKLoleEYIzh_Wy-UswUk6oyklOMNn0YTkKYkxZvh8yIyRmCaYXkZX3m8xphmm-STqnqXbteB9XNSyC2YPqFDB2BZ9gLKb1oz5Ea1gD05uTLtBc62NMtAGtO4raCCgz8Ye4oX0Ab1BOFi3Q9o6tGzr3gdnZIOKrmuMkuOUv44utGw83JzuNPpazNdPr_Hq_WX5VKxiRTIW4rqikOYMKkG55FwpqXQlRMU1E6kQQoosraTkdS4oq2jOqpoDYAJag2SK02l0f9ztnP3pwYdya3vXDi9LSlLCSZawZKDwkVLOeu9Al50z39L9lgSXo9dy8FqOXsuT16Fyd6wYAPiHc8oppvQP6jt2Nw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3141617252</pqid></control><display><type>article</type><title>Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications</title><source>IEEE Xplore (Online service)</source><creator>Munsif, Muhammad ; Khan, Noman ; Hussain, Altaf ; Kim, Min Je ; Baik, Sung Wook</creator><creatorcontrib>Munsif, Muhammad ; Khan, Noman ; Hussain, Altaf ; Kim, Min Je ; Baik, Sung Wook</creatorcontrib><description>Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-based human action recognition (AR) models exhibit limitations in scalability for action understanding under uncertain, low-light, or dark conditions. Integrating these with IR data faces challenges due to changes in modality, high resource demands, and strict latency requirements. Such issues hinder the deployment of these technologies in real-world settings. To overcome these challenges, we introduce a novel slow-fast tubelet (SFT) processing framework designed for efficient and accurate AR in IR-based scenarios. The SFT framework comprises three modules: tubelet preprocessing (TPP), feature extraction, and the feature lateral connection and recognition module (FELCM). The TPP module refines IR streams by extracting the region of interest, filtering detected objects, removing noise, and generating tubelets of refined frames. The FELCM processes refined tubelet through two pathways, where the fast tubelet path operates at a high rate and the slow tubelet path operates at a slow rate. These pathways interconnect through lateral connections, facilitating mutual updates, and enhancing the prediction efficiency. We conducted extensive experiments on benchmark datasets, including NTURGB-D 120 and infrared action recognition (InfAR). The results demonstrate that our proposed SFT framework surpasses state-of-the-art approaches in terms of accuracy (2.7% and 3.3% improvement, respectively), computational cost, and inference latency while maintaining the competitive recognition performance. Our framework's promising results underscore its potential for direct deployment in real-world applications.</description><identifier>ISSN: 1551-3203</identifier><identifier>EISSN: 1941-0050</identifier><identifier>DOI: 10.1109/TII.2024.3431070</identifier><identifier>CODEN: ITIICH</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Accuracy ; Action recognition (AR) ; Autonomous cars ; Computational modeling ; Computer architecture ; Computer vision ; Dark adaptation ; Darkness ; Feature extraction ; Heuristic algorithms ; Human activity recognition ; Industrial applications ; Industry applications ; infrared (IR) vision ; Infrared imaging ; Large scale integration ; large-scale optimization ; Modules ; Robotics ; State of the art</subject><ispartof>IEEE transactions on industrial informatics, 2024-12, Vol.20 (12), p.13676-13686</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63</cites><orcidid>0000-0002-6678-7788 ; 0000-0001-9031-780X ; 0000-0001-7531-3827</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10636303$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,27903,27904,54775</link.rule.ids></links><search><creatorcontrib>Munsif, Muhammad</creatorcontrib><creatorcontrib>Khan, Noman</creatorcontrib><creatorcontrib>Hussain, Altaf</creatorcontrib><creatorcontrib>Kim, Min Je</creatorcontrib><creatorcontrib>Baik, Sung Wook</creatorcontrib><title>Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications</title><title>IEEE transactions on industrial informatics</title><addtitle>TII</addtitle><description>Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-based human action recognition (AR) models exhibit limitations in scalability for action understanding under uncertain, low-light, or dark conditions. Integrating these with IR data faces challenges due to changes in modality, high resource demands, and strict latency requirements. Such issues hinder the deployment of these technologies in real-world settings. To overcome these challenges, we introduce a novel slow-fast tubelet (SFT) processing framework designed for efficient and accurate AR in IR-based scenarios. The SFT framework comprises three modules: tubelet preprocessing (TPP), feature extraction, and the feature lateral connection and recognition module (FELCM). The TPP module refines IR streams by extracting the region of interest, filtering detected objects, removing noise, and generating tubelets of refined frames. The FELCM processes refined tubelet through two pathways, where the fast tubelet path operates at a high rate and the slow tubelet path operates at a slow rate. These pathways interconnect through lateral connections, facilitating mutual updates, and enhancing the prediction efficiency. We conducted extensive experiments on benchmark datasets, including NTURGB-D 120 and infrared action recognition (InfAR). The results demonstrate that our proposed SFT framework surpasses state-of-the-art approaches in terms of accuracy (2.7% and 3.3% improvement, respectively), computational cost, and inference latency while maintaining the competitive recognition performance. Our framework's promising results underscore its potential for direct deployment in real-world applications.</description><subject>Accuracy</subject><subject>Action recognition (AR)</subject><subject>Autonomous cars</subject><subject>Computational modeling</subject><subject>Computer architecture</subject><subject>Computer vision</subject><subject>Dark adaptation</subject><subject>Darkness</subject><subject>Feature extraction</subject><subject>Heuristic algorithms</subject><subject>Human activity recognition</subject><subject>Industrial applications</subject><subject>Industry applications</subject><subject>infrared (IR) vision</subject><subject>Infrared imaging</subject><subject>Large scale integration</subject><subject>large-scale optimization</subject><subject>Modules</subject><subject>Robotics</subject><subject>State of the art</subject><issn>1551-3203</issn><issn>1941-0050</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpNkLFOwzAURSMEEqWwMzBYYk6x49hx2CJooVIFEpQ5cpznym2Ig-204u9J1A5M7w7n3iedKLoleEYIzh_Wy-UswUk6oyklOMNn0YTkKYkxZvh8yIyRmCaYXkZX3m8xphmm-STqnqXbteB9XNSyC2YPqFDB2BZ9gLKb1oz5Ea1gD05uTLtBc62NMtAGtO4raCCgz8Ye4oX0Ab1BOFi3Q9o6tGzr3gdnZIOKrmuMkuOUv44utGw83JzuNPpazNdPr_Hq_WX5VKxiRTIW4rqikOYMKkG55FwpqXQlRMU1E6kQQoosraTkdS4oq2jOqpoDYAJag2SK02l0f9ztnP3pwYdya3vXDi9LSlLCSZawZKDwkVLOeu9Al50z39L9lgSXo9dy8FqOXsuT16Fyd6wYAPiHc8oppvQP6jt2Nw</recordid><startdate>20241201</startdate><enddate>20241201</enddate><creator>Munsif, Muhammad</creator><creator>Khan, Noman</creator><creator>Hussain, Altaf</creator><creator>Kim, Min Je</creator><creator>Baik, Sung Wook</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-6678-7788</orcidid><orcidid>https://orcid.org/0000-0001-9031-780X</orcidid><orcidid>https://orcid.org/0000-0001-7531-3827</orcidid></search><sort><creationdate>20241201</creationdate><title>Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications</title><author>Munsif, Muhammad ; Khan, Noman ; Hussain, Altaf ; Kim, Min Je ; Baik, Sung Wook</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Action recognition (AR)</topic><topic>Autonomous cars</topic><topic>Computational modeling</topic><topic>Computer architecture</topic><topic>Computer vision</topic><topic>Dark adaptation</topic><topic>Darkness</topic><topic>Feature extraction</topic><topic>Heuristic algorithms</topic><topic>Human activity recognition</topic><topic>Industrial applications</topic><topic>Industry applications</topic><topic>infrared (IR) vision</topic><topic>Infrared imaging</topic><topic>Large scale integration</topic><topic>large-scale optimization</topic><topic>Modules</topic><topic>Robotics</topic><topic>State of the art</topic><toplevel>online_resources</toplevel><creatorcontrib>Munsif, Muhammad</creatorcontrib><creatorcontrib>Khan, Noman</creatorcontrib><creatorcontrib>Hussain, Altaf</creatorcontrib><creatorcontrib>Kim, Min Je</creatorcontrib><creatorcontrib>Baik, Sung Wook</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on industrial informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Munsif, Muhammad</au><au>Khan, Noman</au><au>Hussain, Altaf</au><au>Kim, Min Je</au><au>Baik, Sung Wook</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications</atitle><jtitle>IEEE transactions on industrial informatics</jtitle><stitle>TII</stitle><date>2024-12-01</date><risdate>2024</risdate><volume>20</volume><issue>12</issue><spage>13676</spage><epage>13686</epage><pages>13676-13686</pages><issn>1551-3203</issn><eissn>1941-0050</eissn><coden>ITIICH</coden><abstract>Infrared (IR) technology has emerged as a solution for monitoring dark environments. It offers resilience to shifting illumination, appearance changes, and shadows, with applications spanning self-driving cars, robotics, nighttime security, and many other fields. While existing state-of-the-art RGB-based human action recognition (AR) models exhibit limitations in scalability for action understanding under uncertain, low-light, or dark conditions. Integrating these with IR data faces challenges due to changes in modality, high resource demands, and strict latency requirements. Such issues hinder the deployment of these technologies in real-world settings. To overcome these challenges, we introduce a novel slow-fast tubelet (SFT) processing framework designed for efficient and accurate AR in IR-based scenarios. The SFT framework comprises three modules: tubelet preprocessing (TPP), feature extraction, and the feature lateral connection and recognition module (FELCM). The TPP module refines IR streams by extracting the region of interest, filtering detected objects, removing noise, and generating tubelets of refined frames. The FELCM processes refined tubelet through two pathways, where the fast tubelet path operates at a high rate and the slow tubelet path operates at a slow rate. These pathways interconnect through lateral connections, facilitating mutual updates, and enhancing the prediction efficiency. We conducted extensive experiments on benchmark datasets, including NTURGB-D 120 and infrared action recognition (InfAR). The results demonstrate that our proposed SFT framework surpasses state-of-the-art approaches in terms of accuracy (2.7% and 3.3% improvement, respectively), computational cost, and inference latency while maintaining the competitive recognition performance. Our framework's promising results underscore its potential for direct deployment in real-world applications.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TII.2024.3431070</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0002-6678-7788</orcidid><orcidid>https://orcid.org/0000-0001-9031-780X</orcidid><orcidid>https://orcid.org/0000-0001-7531-3827</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1551-3203 |
ispartof | IEEE transactions on industrial informatics, 2024-12, Vol.20 (12), p.13676-13686 |
issn | 1551-3203 1941-0050 |
language | eng |
recordid | cdi_proquest_journals_3141617252 |
source | IEEE Xplore (Online service) |
subjects | Accuracy Action recognition (AR) Autonomous cars Computational modeling Computer architecture Computer vision Dark adaptation Darkness Feature extraction Heuristic algorithms Human activity recognition Industrial applications Industry applications infrared (IR) vision Infrared imaging Large scale integration large-scale optimization Modules Robotics State of the art |
title | Darkness-Adaptive Action Recognition: Leveraging Efficient Tubelet Slow-Fast Network for Industrial Applications |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T05%3A47%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Darkness-Adaptive%20Action%20Recognition:%20Leveraging%20Efficient%20Tubelet%20Slow-Fast%20Network%20for%20Industrial%20Applications&rft.jtitle=IEEE%20transactions%20on%20industrial%20informatics&rft.au=Munsif,%20Muhammad&rft.date=2024-12-01&rft.volume=20&rft.issue=12&rft.spage=13676&rft.epage=13686&rft.pages=13676-13686&rft.issn=1551-3203&rft.eissn=1941-0050&rft.coden=ITIICH&rft_id=info:doi/10.1109/TII.2024.3431070&rft_dat=%3Cproquest_cross%3E3141617252%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c175t-db3e495eb836a66ccacfb88b6f584888a874baa6d9835b395bd6ee01effea5c63%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3141617252&rft_id=info:pmid/&rft_ieee_id=10636303&rfr_iscdi=true |