Loading…
Sequential inter-hop graph convolution neural network (SIhGCN) for skeleton-based human action recognition
•A graph convolution model for skeleton-based action recognition is proposed.•Normalized Laplacian Matrix is utilized to encode the graph information.•An attention-based feature aggregation is proposed to extract the salient features.•The proposed method achieves better results than the baseline mod...
Saved in:
Published in: | Expert systems with applications 2022-06, Vol.195, p.116566, Article 116566 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c328t-d4469e8a92de56971822d34a4c6060d4b7778aa750f0f9262ba721e9a6abe2803 |
---|---|
cites | cdi_FETCH-LOGICAL-c328t-d4469e8a92de56971822d34a4c6060d4b7778aa750f0f9262ba721e9a6abe2803 |
container_end_page | |
container_issue | |
container_start_page | 116566 |
container_title | Expert systems with applications |
container_volume | 195 |
creator | Setiawan, Feri Yahya, Bernardo Nugroho Chun, Seok-Ju Lee, Seok-Lyong |
description | •A graph convolution model for skeleton-based action recognition is proposed.•Normalized Laplacian Matrix is utilized to encode the graph information.•An attention-based feature aggregation is proposed to extract the salient features.•The proposed method achieves better results than the baseline models.
Skeleton-based human action recognition has attracted a lot of attention due to its capability and potential to provide more information than just using the sequence of RGB images. The use of Graph Convolutional Neural Network (GCN) becomes more popular since it can model the human skeleton very well. However, the existing GCN architectures ignore the different levels of importance on each hop during the feature aggregation and use the final hop information for further calculation, resulting in considerable information loss. Besides, they use the standard Laplacian or adjacency matrix to encode the property of a graph into a set of vectors which has a limitation in terms of graph invariants. In this work, we propose a Sequential Inter-hop Graph Convolution Neural Network (SIhGCN) which can capture salient graph information from every single hop rather than the final hop only and our work utilizes the normalized Laplacian matrix which provides better representation since it relates well to graph invariants. The proposed method is validated on two large datasets, NTU-RBG + D and Kinetics, to demonstrate the superiority of our proposed method. |
doi_str_mv | 10.1016/j.eswa.2022.116566 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2647397260</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417422000641</els_id><sourcerecordid>2647397260</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-d4469e8a92de56971822d34a4c6060d4b7778aa750f0f9262ba721e9a6abe2803</originalsourceid><addsrcrecordid>eNp9kL1OwzAURi0EEqXwAkyWWGBIsJ3ETiQWVEGphGAozJaT3DROUzvYSSvenoQyM907fOf-HISuKQkpofy-CcEfVMgIYyGlPOH8BM1oKqKAiyw6RTOSJSKIqYjP0YX3DSFUECJmqFnD1wCm16rF2vTggtp2eONUV-PCmr1th15bgw0MbowY6A_WbfHtelUvF293uLIO-y200FsT5MpDiethpwxWxS_noLAbo6f-Ep1VqvVw9Vfn6PP56WPxEry-L1eLx9egiFjaB2Uc8wxSlbESEp4JmjJWRrGKC044KeNcCJEqJRJSkSpjnOVKMAqZ4ioHlpJojm6Ocztnx998Lxs7ODOulIzHIsoE41OKHVOFs947qGTn9E65b0mJnJzKRk5O5eRUHp2O0MMRgvH-vQYnfaHBFFDq8dFellb_h_8AZC6AUQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2647397260</pqid></control><display><type>article</type><title>Sequential inter-hop graph convolution neural network (SIhGCN) for skeleton-based human action recognition</title><source>ScienceDirect Freedom Collection</source><creator>Setiawan, Feri ; Yahya, Bernardo Nugroho ; Chun, Seok-Ju ; Lee, Seok-Lyong</creator><creatorcontrib>Setiawan, Feri ; Yahya, Bernardo Nugroho ; Chun, Seok-Ju ; Lee, Seok-Lyong</creatorcontrib><description>•A graph convolution model for skeleton-based action recognition is proposed.•Normalized Laplacian Matrix is utilized to encode the graph information.•An attention-based feature aggregation is proposed to extract the salient features.•The proposed method achieves better results than the baseline models.
Skeleton-based human action recognition has attracted a lot of attention due to its capability and potential to provide more information than just using the sequence of RGB images. The use of Graph Convolutional Neural Network (GCN) becomes more popular since it can model the human skeleton very well. However, the existing GCN architectures ignore the different levels of importance on each hop during the feature aggregation and use the final hop information for further calculation, resulting in considerable information loss. Besides, they use the standard Laplacian or adjacency matrix to encode the property of a graph into a set of vectors which has a limitation in terms of graph invariants. In this work, we propose a Sequential Inter-hop Graph Convolution Neural Network (SIhGCN) which can capture salient graph information from every single hop rather than the final hop only and our work utilizes the normalized Laplacian matrix which provides better representation since it relates well to graph invariants. The proposed method is validated on two large datasets, NTU-RBG + D and Kinetics, to demonstrate the superiority of our proposed method.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2022.116566</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Action recognition ; Artificial neural networks ; Attention mechanism ; Color imagery ; Feature aggregation ; Graph convolutional neural network ; Human activity recognition ; Human motion ; Invariants ; Neural networks ; Normalized Laplacian matrix</subject><ispartof>Expert systems with applications, 2022-06, Vol.195, p.116566, Article 116566</ispartof><rights>2022 Elsevier Ltd</rights><rights>Copyright Elsevier BV Jun 1, 2022</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-d4469e8a92de56971822d34a4c6060d4b7778aa750f0f9262ba721e9a6abe2803</citedby><cites>FETCH-LOGICAL-c328t-d4469e8a92de56971822d34a4c6060d4b7778aa750f0f9262ba721e9a6abe2803</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Setiawan, Feri</creatorcontrib><creatorcontrib>Yahya, Bernardo Nugroho</creatorcontrib><creatorcontrib>Chun, Seok-Ju</creatorcontrib><creatorcontrib>Lee, Seok-Lyong</creatorcontrib><title>Sequential inter-hop graph convolution neural network (SIhGCN) for skeleton-based human action recognition</title><title>Expert systems with applications</title><description>•A graph convolution model for skeleton-based action recognition is proposed.•Normalized Laplacian Matrix is utilized to encode the graph information.•An attention-based feature aggregation is proposed to extract the salient features.•The proposed method achieves better results than the baseline models.
Skeleton-based human action recognition has attracted a lot of attention due to its capability and potential to provide more information than just using the sequence of RGB images. The use of Graph Convolutional Neural Network (GCN) becomes more popular since it can model the human skeleton very well. However, the existing GCN architectures ignore the different levels of importance on each hop during the feature aggregation and use the final hop information for further calculation, resulting in considerable information loss. Besides, they use the standard Laplacian or adjacency matrix to encode the property of a graph into a set of vectors which has a limitation in terms of graph invariants. In this work, we propose a Sequential Inter-hop Graph Convolution Neural Network (SIhGCN) which can capture salient graph information from every single hop rather than the final hop only and our work utilizes the normalized Laplacian matrix which provides better representation since it relates well to graph invariants. The proposed method is validated on two large datasets, NTU-RBG + D and Kinetics, to demonstrate the superiority of our proposed method.</description><subject>Action recognition</subject><subject>Artificial neural networks</subject><subject>Attention mechanism</subject><subject>Color imagery</subject><subject>Feature aggregation</subject><subject>Graph convolutional neural network</subject><subject>Human activity recognition</subject><subject>Human motion</subject><subject>Invariants</subject><subject>Neural networks</subject><subject>Normalized Laplacian matrix</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kL1OwzAURi0EEqXwAkyWWGBIsJ3ETiQWVEGphGAozJaT3DROUzvYSSvenoQyM907fOf-HISuKQkpofy-CcEfVMgIYyGlPOH8BM1oKqKAiyw6RTOSJSKIqYjP0YX3DSFUECJmqFnD1wCm16rF2vTggtp2eONUV-PCmr1th15bgw0MbowY6A_WbfHtelUvF293uLIO-y200FsT5MpDiethpwxWxS_noLAbo6f-Ep1VqvVw9Vfn6PP56WPxEry-L1eLx9egiFjaB2Uc8wxSlbESEp4JmjJWRrGKC044KeNcCJEqJRJSkSpjnOVKMAqZ4ioHlpJojm6Ocztnx998Lxs7ODOulIzHIsoE41OKHVOFs947qGTn9E65b0mJnJzKRk5O5eRUHp2O0MMRgvH-vQYnfaHBFFDq8dFellb_h_8AZC6AUQ</recordid><startdate>20220601</startdate><enddate>20220601</enddate><creator>Setiawan, Feri</creator><creator>Yahya, Bernardo Nugroho</creator><creator>Chun, Seok-Ju</creator><creator>Lee, Seok-Lyong</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20220601</creationdate><title>Sequential inter-hop graph convolution neural network (SIhGCN) for skeleton-based human action recognition</title><author>Setiawan, Feri ; Yahya, Bernardo Nugroho ; Chun, Seok-Ju ; Lee, Seok-Lyong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-d4469e8a92de56971822d34a4c6060d4b7778aa750f0f9262ba721e9a6abe2803</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Action recognition</topic><topic>Artificial neural networks</topic><topic>Attention mechanism</topic><topic>Color imagery</topic><topic>Feature aggregation</topic><topic>Graph convolutional neural network</topic><topic>Human activity recognition</topic><topic>Human motion</topic><topic>Invariants</topic><topic>Neural networks</topic><topic>Normalized Laplacian matrix</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Setiawan, Feri</creatorcontrib><creatorcontrib>Yahya, Bernardo Nugroho</creatorcontrib><creatorcontrib>Chun, Seok-Ju</creatorcontrib><creatorcontrib>Lee, Seok-Lyong</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Setiawan, Feri</au><au>Yahya, Bernardo Nugroho</au><au>Chun, Seok-Ju</au><au>Lee, Seok-Lyong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Sequential inter-hop graph convolution neural network (SIhGCN) for skeleton-based human action recognition</atitle><jtitle>Expert systems with applications</jtitle><date>2022-06-01</date><risdate>2022</risdate><volume>195</volume><spage>116566</spage><pages>116566-</pages><artnum>116566</artnum><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•A graph convolution model for skeleton-based action recognition is proposed.•Normalized Laplacian Matrix is utilized to encode the graph information.•An attention-based feature aggregation is proposed to extract the salient features.•The proposed method achieves better results than the baseline models.
Skeleton-based human action recognition has attracted a lot of attention due to its capability and potential to provide more information than just using the sequence of RGB images. The use of Graph Convolutional Neural Network (GCN) becomes more popular since it can model the human skeleton very well. However, the existing GCN architectures ignore the different levels of importance on each hop during the feature aggregation and use the final hop information for further calculation, resulting in considerable information loss. Besides, they use the standard Laplacian or adjacency matrix to encode the property of a graph into a set of vectors which has a limitation in terms of graph invariants. In this work, we propose a Sequential Inter-hop Graph Convolution Neural Network (SIhGCN) which can capture salient graph information from every single hop rather than the final hop only and our work utilizes the normalized Laplacian matrix which provides better representation since it relates well to graph invariants. The proposed method is validated on two large datasets, NTU-RBG + D and Kinetics, to demonstrate the superiority of our proposed method.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2022.116566</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0957-4174 |
ispartof | Expert systems with applications, 2022-06, Vol.195, p.116566, Article 116566 |
issn | 0957-4174 1873-6793 |
language | eng |
recordid | cdi_proquest_journals_2647397260 |
source | ScienceDirect Freedom Collection |
subjects | Action recognition Artificial neural networks Attention mechanism Color imagery Feature aggregation Graph convolutional neural network Human activity recognition Human motion Invariants Neural networks Normalized Laplacian matrix |
title | Sequential inter-hop graph convolution neural network (SIhGCN) for skeleton-based human action recognition |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T14%3A45%3A26IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Sequential%20inter-hop%20graph%20convolution%20neural%20network%20(SIhGCN)%20for%20skeleton-based%20human%20action%20recognition&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Setiawan,%20Feri&rft.date=2022-06-01&rft.volume=195&rft.spage=116566&rft.pages=116566-&rft.artnum=116566&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2022.116566&rft_dat=%3Cproquest_cross%3E2647397260%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c328t-d4469e8a92de56971822d34a4c6060d4b7778aa750f0f9262ba721e9a6abe2803%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2647397260&rft_id=info:pmid/&rfr_iscdi=true |