Loading…

Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification

In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpo...

Full description

Saved in:

Bibliographic Details
Main Authors:	Wall, Conor, Zhang, Li, Yu, Yonghong, Mistry, Kamlesh
Format:	Conference Proceeding
Language:	English
Subjects:	attention mechanism audio classification bidirectional Recurrent Neural Network COVID COVID-19 Deep learning Evolutionary computation Long Short-Term Memory Lung lung disease Pulmonary diseases Recurrent neural networks
Citations:	Items that cite this one
Online Access:	Request full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403
cites
container_end_page	8
container_issue
container_start_page	1
container_title
container_volume
creator	Wall, Conor Zhang, Li Yu, Yonghong Mistry, Kamlesh
description	In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpose, with a variety of different layer structures implemented for undertaking audio classification. Specifically, bidirectional Long Short-Term Memory (BiLSTM) and Gated Recurrent Units (GRU) networks in conjunction with an attention mechanism, are implemented in this research for chronic and non-chronic lung disease and COVID-19 diagnosis. We employ two audio datasets, i.e. the Respiratory Sound and the Coswara datasets, to evaluate the proposed model architectures pertaining to lung disease classification. The Respiratory Sound Database contains audio data with respect to lung conditions such as Chronic Obstructive Pulmonary Disease (COPD) and asthma, while the Coswara dataset contains coughing audio samples associated with COVID-19. After a comprehensive evaluation and experimentation process, as the most performant architecture, the proposed attention BiLSTM network (A-BiLSTM) achieves accuracy rates of 96.2% and 96.8% for the Respiratory Sound and the Coswara datasets, respectively. Our research indicates that the implementation of the BiLSTM and attention mechanism was effective in improving performance for undertaking audio classification with respect to various lung condition diagnoses.
doi_str_mv	10.1109/IJCNN52387.2021.9533966
format	conference_proceeding
fullrecord	<record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_9533966</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9533966</ieee_id><sourcerecordid>9533966</sourcerecordid><originalsourceid>FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403</originalsourceid><addsrcrecordid>eNotkEtOwzAYhA0SEm3hBCzwBRJ-27EdL6MApagECcE6clxHNeRR2a6q3J4guhpp5ptZDEL3BFJCQD1sXsuq4pTlMqVASao4Y0qIC7QkQvCMKQB1iRaUCJJkGchrtAzhG4AypdgC1Y_WHvCHNUfv7RBxZY9ed7PE0-h_Aj65uMdFjHPmxgG_WbPXgwt9wO3o5144OK_j6CdcDGOvuwmXnQ7Btc7ov8YNump1F-ztWVfo6_nps3xJtu_rTVlsE8NAxIQ1uaECQO70DohqqWlMLmUmJVfCGC4booWi0sqGzgzjWhg-e2amLc2ArdDd_66z1tYH73rtp_p8BvsFUxxWWA</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification</title><source>IEEE Xplore All Conference Series</source><creator>Wall, Conor ; Zhang, Li ; Yu, Yonghong ; Mistry, Kamlesh</creator><creatorcontrib>Wall, Conor ; Zhang, Li ; Yu, Yonghong ; Mistry, Kamlesh</creatorcontrib><description>In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpose, with a variety of different layer structures implemented for undertaking audio classification. Specifically, bidirectional Long Short-Term Memory (BiLSTM) and Gated Recurrent Units (GRU) networks in conjunction with an attention mechanism, are implemented in this research for chronic and non-chronic lung disease and COVID-19 diagnosis. We employ two audio datasets, i.e. the Respiratory Sound and the Coswara datasets, to evaluate the proposed model architectures pertaining to lung disease classification. The Respiratory Sound Database contains audio data with respect to lung conditions such as Chronic Obstructive Pulmonary Disease (COPD) and asthma, while the Coswara dataset contains coughing audio samples associated with COVID-19. After a comprehensive evaluation and experimentation process, as the most performant architecture, the proposed attention BiLSTM network (A-BiLSTM) achieves accuracy rates of 96.2% and 96.8% for the Respiratory Sound and the Coswara datasets, respectively. Our research indicates that the implementation of the BiLSTM and attention mechanism was effective in improving performance for undertaking audio classification with respect to various lung condition diagnoses.</description><identifier>EISSN: 2161-4407</identifier><identifier>EISBN: 1665439009</identifier><identifier>EISBN: 9781665439008</identifier><identifier>DOI: 10.1109/IJCNN52387.2021.9533966</identifier><language>eng</language><publisher>IEEE</publisher><subject>attention mechanism ; audio classification ; bidirectional Recurrent Neural Network ; COVID ; COVID-19 ; Deep learning ; Evolutionary computation ; Long Short-Term Memory ; Lung ; lung disease ; Pulmonary diseases ; Recurrent neural networks</subject><ispartof>2021 International Joint Conference on Neural Networks (IJCNN), 2021, p.1-8</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9533966$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,23930,23931,25140,27925,54555,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9533966$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Wall, Conor</creatorcontrib><creatorcontrib>Zhang, Li</creatorcontrib><creatorcontrib>Yu, Yonghong</creatorcontrib><creatorcontrib>Mistry, Kamlesh</creatorcontrib><title>Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification</title><title>2021 International Joint Conference on Neural Networks (IJCNN)</title><addtitle>IJCNN</addtitle><description>In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpose, with a variety of different layer structures implemented for undertaking audio classification. Specifically, bidirectional Long Short-Term Memory (BiLSTM) and Gated Recurrent Units (GRU) networks in conjunction with an attention mechanism, are implemented in this research for chronic and non-chronic lung disease and COVID-19 diagnosis. We employ two audio datasets, i.e. the Respiratory Sound and the Coswara datasets, to evaluate the proposed model architectures pertaining to lung disease classification. The Respiratory Sound Database contains audio data with respect to lung conditions such as Chronic Obstructive Pulmonary Disease (COPD) and asthma, while the Coswara dataset contains coughing audio samples associated with COVID-19. After a comprehensive evaluation and experimentation process, as the most performant architecture, the proposed attention BiLSTM network (A-BiLSTM) achieves accuracy rates of 96.2% and 96.8% for the Respiratory Sound and the Coswara datasets, respectively. Our research indicates that the implementation of the BiLSTM and attention mechanism was effective in improving performance for undertaking audio classification with respect to various lung condition diagnoses.</description><subject>attention mechanism</subject><subject>audio classification</subject><subject>bidirectional Recurrent Neural Network</subject><subject>COVID</subject><subject>COVID-19</subject><subject>Deep learning</subject><subject>Evolutionary computation</subject><subject>Long Short-Term Memory</subject><subject>Lung</subject><subject>lung disease</subject><subject>Pulmonary diseases</subject><subject>Recurrent neural networks</subject><issn>2161-4407</issn><isbn>1665439009</isbn><isbn>9781665439008</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2021</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotkEtOwzAYhA0SEm3hBCzwBRJ-27EdL6MApagECcE6clxHNeRR2a6q3J4guhpp5ptZDEL3BFJCQD1sXsuq4pTlMqVASao4Y0qIC7QkQvCMKQB1iRaUCJJkGchrtAzhG4AypdgC1Y_WHvCHNUfv7RBxZY9ed7PE0-h_Aj65uMdFjHPmxgG_WbPXgwt9wO3o5144OK_j6CdcDGOvuwmXnQ7Btc7ov8YNump1F-ztWVfo6_nps3xJtu_rTVlsE8NAxIQ1uaECQO70DohqqWlMLmUmJVfCGC4booWi0sqGzgzjWhg-e2amLc2ArdDd_66z1tYH73rtp_p8BvsFUxxWWA</recordid><startdate>20210718</startdate><enddate>20210718</enddate><creator>Wall, Conor</creator><creator>Zhang, Li</creator><creator>Yu, Yonghong</creator><creator>Mistry, Kamlesh</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20210718</creationdate><title>Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification</title><author>Wall, Conor ; Zhang, Li ; Yu, Yonghong ; Mistry, Kamlesh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2021</creationdate><topic>attention mechanism</topic><topic>audio classification</topic><topic>bidirectional Recurrent Neural Network</topic><topic>COVID</topic><topic>COVID-19</topic><topic>Deep learning</topic><topic>Evolutionary computation</topic><topic>Long Short-Term Memory</topic><topic>Lung</topic><topic>lung disease</topic><topic>Pulmonary diseases</topic><topic>Recurrent neural networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Wall, Conor</creatorcontrib><creatorcontrib>Zhang, Li</creatorcontrib><creatorcontrib>Yu, Yonghong</creatorcontrib><creatorcontrib>Mistry, Kamlesh</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEL</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Wall, Conor</au><au>Zhang, Li</au><au>Yu, Yonghong</au><au>Mistry, Kamlesh</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification</atitle><btitle>2021 International Joint Conference on Neural Networks (IJCNN)</btitle><stitle>IJCNN</stitle><date>2021-07-18</date><risdate>2021</risdate><spage>1</spage><epage>8</epage><pages>1-8</pages><eissn>2161-4407</eissn><eisbn>1665439009</eisbn><eisbn>9781665439008</eisbn><abstract>In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpose, with a variety of different layer structures implemented for undertaking audio classification. Specifically, bidirectional Long Short-Term Memory (BiLSTM) and Gated Recurrent Units (GRU) networks in conjunction with an attention mechanism, are implemented in this research for chronic and non-chronic lung disease and COVID-19 diagnosis. We employ two audio datasets, i.e. the Respiratory Sound and the Coswara datasets, to evaluate the proposed model architectures pertaining to lung disease classification. The Respiratory Sound Database contains audio data with respect to lung conditions such as Chronic Obstructive Pulmonary Disease (COPD) and asthma, while the Coswara dataset contains coughing audio samples associated with COVID-19. After a comprehensive evaluation and experimentation process, as the most performant architecture, the proposed attention BiLSTM network (A-BiLSTM) achieves accuracy rates of 96.2% and 96.8% for the Respiratory Sound and the Coswara datasets, respectively. Our research indicates that the implementation of the BiLSTM and attention mechanism was effective in improving performance for undertaking audio classification with respect to various lung condition diagnoses.</abstract><pub>IEEE</pub><doi>10.1109/IJCNN52387.2021.9533966</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	EISSN: 2161-4407
ispartof	2021 International Joint Conference on Neural Networks (IJCNN), 2021, p.1-8
issn	2161-4407
language	eng
recordid	cdi_ieee_primary_9533966
source	IEEE Xplore All Conference Series
subjects	attention mechanism audio classification bidirectional Recurrent Neural Network COVID COVID-19 Deep learning Evolutionary computation Long Short-Term Memory Lung lung disease Pulmonary diseases Recurrent neural networks
title	Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T18%3A05%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Deep%20Recurrent%20Neural%20Networks%20with%20Attention%20Mechanisms%20for%20Respiratory%20Anomaly%20Classification&rft.btitle=2021%20International%20Joint%20Conference%20on%20Neural%20Networks%20(IJCNN)&rft.au=Wall,%20Conor&rft.date=2021-07-18&rft.spage=1&rft.epage=8&rft.pages=1-8&rft.eissn=2161-4407&rft_id=info:doi/10.1109/IJCNN52387.2021.9533966&rft.eisbn=1665439009&rft.eisbn_list=9781665439008&rft_dat=%3Cieee_CHZPO%3E9533966%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=9533966&rfr_iscdi=true