Loading…

Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification

In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpo...

Full description

Saved in:
Bibliographic Details
Main Authors: Wall, Conor, Zhang, Li, Yu, Yonghong, Mistry, Kamlesh
Format: Conference Proceeding
Language:English
Subjects:
Citations: Items that cite this one
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403
cites
container_end_page 8
container_issue
container_start_page 1
container_title
container_volume
creator Wall, Conor
Zhang, Li
Yu, Yonghong
Mistry, Kamlesh
description In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpose, with a variety of different layer structures implemented for undertaking audio classification. Specifically, bidirectional Long Short-Term Memory (BiLSTM) and Gated Recurrent Units (GRU) networks in conjunction with an attention mechanism, are implemented in this research for chronic and non-chronic lung disease and COVID-19 diagnosis. We employ two audio datasets, i.e. the Respiratory Sound and the Coswara datasets, to evaluate the proposed model architectures pertaining to lung disease classification. The Respiratory Sound Database contains audio data with respect to lung conditions such as Chronic Obstructive Pulmonary Disease (COPD) and asthma, while the Coswara dataset contains coughing audio samples associated with COVID-19. After a comprehensive evaluation and experimentation process, as the most performant architecture, the proposed attention BiLSTM network (A-BiLSTM) achieves accuracy rates of 96.2% and 96.8% for the Respiratory Sound and the Coswara datasets, respectively. Our research indicates that the implementation of the BiLSTM and attention mechanism was effective in improving performance for undertaking audio classification with respect to various lung condition diagnoses.
doi_str_mv 10.1109/IJCNN52387.2021.9533966
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_9533966</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9533966</ieee_id><sourcerecordid>9533966</sourcerecordid><originalsourceid>FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403</originalsourceid><addsrcrecordid>eNotkEtOwzAYhA0SEm3hBCzwBRJ-27EdL6MApagECcE6clxHNeRR2a6q3J4guhpp5ptZDEL3BFJCQD1sXsuq4pTlMqVASao4Y0qIC7QkQvCMKQB1iRaUCJJkGchrtAzhG4AypdgC1Y_WHvCHNUfv7RBxZY9ed7PE0-h_Aj65uMdFjHPmxgG_WbPXgwt9wO3o5144OK_j6CdcDGOvuwmXnQ7Btc7ov8YNump1F-ztWVfo6_nps3xJtu_rTVlsE8NAxIQ1uaECQO70DohqqWlMLmUmJVfCGC4booWi0sqGzgzjWhg-e2amLc2ArdDd_66z1tYH73rtp_p8BvsFUxxWWA</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification</title><source>IEEE Xplore All Conference Series</source><creator>Wall, Conor ; Zhang, Li ; Yu, Yonghong ; Mistry, Kamlesh</creator><creatorcontrib>Wall, Conor ; Zhang, Li ; Yu, Yonghong ; Mistry, Kamlesh</creatorcontrib><description>In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpose, with a variety of different layer structures implemented for undertaking audio classification. Specifically, bidirectional Long Short-Term Memory (BiLSTM) and Gated Recurrent Units (GRU) networks in conjunction with an attention mechanism, are implemented in this research for chronic and non-chronic lung disease and COVID-19 diagnosis. We employ two audio datasets, i.e. the Respiratory Sound and the Coswara datasets, to evaluate the proposed model architectures pertaining to lung disease classification. The Respiratory Sound Database contains audio data with respect to lung conditions such as Chronic Obstructive Pulmonary Disease (COPD) and asthma, while the Coswara dataset contains coughing audio samples associated with COVID-19. After a comprehensive evaluation and experimentation process, as the most performant architecture, the proposed attention BiLSTM network (A-BiLSTM) achieves accuracy rates of 96.2% and 96.8% for the Respiratory Sound and the Coswara datasets, respectively. Our research indicates that the implementation of the BiLSTM and attention mechanism was effective in improving performance for undertaking audio classification with respect to various lung condition diagnoses.</description><identifier>EISSN: 2161-4407</identifier><identifier>EISBN: 1665439009</identifier><identifier>EISBN: 9781665439008</identifier><identifier>DOI: 10.1109/IJCNN52387.2021.9533966</identifier><language>eng</language><publisher>IEEE</publisher><subject>attention mechanism ; audio classification ; bidirectional Recurrent Neural Network ; COVID ; COVID-19 ; Deep learning ; Evolutionary computation ; Long Short-Term Memory ; Lung ; lung disease ; Pulmonary diseases ; Recurrent neural networks</subject><ispartof>2021 International Joint Conference on Neural Networks (IJCNN), 2021, p.1-8</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9533966$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,23930,23931,25140,27925,54555,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9533966$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Wall, Conor</creatorcontrib><creatorcontrib>Zhang, Li</creatorcontrib><creatorcontrib>Yu, Yonghong</creatorcontrib><creatorcontrib>Mistry, Kamlesh</creatorcontrib><title>Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification</title><title>2021 International Joint Conference on Neural Networks (IJCNN)</title><addtitle>IJCNN</addtitle><description>In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpose, with a variety of different layer structures implemented for undertaking audio classification. Specifically, bidirectional Long Short-Term Memory (BiLSTM) and Gated Recurrent Units (GRU) networks in conjunction with an attention mechanism, are implemented in this research for chronic and non-chronic lung disease and COVID-19 diagnosis. We employ two audio datasets, i.e. the Respiratory Sound and the Coswara datasets, to evaluate the proposed model architectures pertaining to lung disease classification. The Respiratory Sound Database contains audio data with respect to lung conditions such as Chronic Obstructive Pulmonary Disease (COPD) and asthma, while the Coswara dataset contains coughing audio samples associated with COVID-19. After a comprehensive evaluation and experimentation process, as the most performant architecture, the proposed attention BiLSTM network (A-BiLSTM) achieves accuracy rates of 96.2% and 96.8% for the Respiratory Sound and the Coswara datasets, respectively. Our research indicates that the implementation of the BiLSTM and attention mechanism was effective in improving performance for undertaking audio classification with respect to various lung condition diagnoses.</description><subject>attention mechanism</subject><subject>audio classification</subject><subject>bidirectional Recurrent Neural Network</subject><subject>COVID</subject><subject>COVID-19</subject><subject>Deep learning</subject><subject>Evolutionary computation</subject><subject>Long Short-Term Memory</subject><subject>Lung</subject><subject>lung disease</subject><subject>Pulmonary diseases</subject><subject>Recurrent neural networks</subject><issn>2161-4407</issn><isbn>1665439009</isbn><isbn>9781665439008</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2021</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotkEtOwzAYhA0SEm3hBCzwBRJ-27EdL6MApagECcE6clxHNeRR2a6q3J4guhpp5ptZDEL3BFJCQD1sXsuq4pTlMqVASao4Y0qIC7QkQvCMKQB1iRaUCJJkGchrtAzhG4AypdgC1Y_WHvCHNUfv7RBxZY9ed7PE0-h_Aj65uMdFjHPmxgG_WbPXgwt9wO3o5144OK_j6CdcDGOvuwmXnQ7Btc7ov8YNump1F-ztWVfo6_nps3xJtu_rTVlsE8NAxIQ1uaECQO70DohqqWlMLmUmJVfCGC4booWi0sqGzgzjWhg-e2amLc2ArdDd_66z1tYH73rtp_p8BvsFUxxWWA</recordid><startdate>20210718</startdate><enddate>20210718</enddate><creator>Wall, Conor</creator><creator>Zhang, Li</creator><creator>Yu, Yonghong</creator><creator>Mistry, Kamlesh</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20210718</creationdate><title>Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification</title><author>Wall, Conor ; Zhang, Li ; Yu, Yonghong ; Mistry, Kamlesh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2021</creationdate><topic>attention mechanism</topic><topic>audio classification</topic><topic>bidirectional Recurrent Neural Network</topic><topic>COVID</topic><topic>COVID-19</topic><topic>Deep learning</topic><topic>Evolutionary computation</topic><topic>Long Short-Term Memory</topic><topic>Lung</topic><topic>lung disease</topic><topic>Pulmonary diseases</topic><topic>Recurrent neural networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Wall, Conor</creatorcontrib><creatorcontrib>Zhang, Li</creatorcontrib><creatorcontrib>Yu, Yonghong</creatorcontrib><creatorcontrib>Mistry, Kamlesh</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEL</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Wall, Conor</au><au>Zhang, Li</au><au>Yu, Yonghong</au><au>Mistry, Kamlesh</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification</atitle><btitle>2021 International Joint Conference on Neural Networks (IJCNN)</btitle><stitle>IJCNN</stitle><date>2021-07-18</date><risdate>2021</risdate><spage>1</spage><epage>8</epage><pages>1-8</pages><eissn>2161-4407</eissn><eisbn>1665439009</eisbn><eisbn>9781665439008</eisbn><abstract>In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpose, with a variety of different layer structures implemented for undertaking audio classification. Specifically, bidirectional Long Short-Term Memory (BiLSTM) and Gated Recurrent Units (GRU) networks in conjunction with an attention mechanism, are implemented in this research for chronic and non-chronic lung disease and COVID-19 diagnosis. We employ two audio datasets, i.e. the Respiratory Sound and the Coswara datasets, to evaluate the proposed model architectures pertaining to lung disease classification. The Respiratory Sound Database contains audio data with respect to lung conditions such as Chronic Obstructive Pulmonary Disease (COPD) and asthma, while the Coswara dataset contains coughing audio samples associated with COVID-19. After a comprehensive evaluation and experimentation process, as the most performant architecture, the proposed attention BiLSTM network (A-BiLSTM) achieves accuracy rates of 96.2% and 96.8% for the Respiratory Sound and the Coswara datasets, respectively. Our research indicates that the implementation of the BiLSTM and attention mechanism was effective in improving performance for undertaking audio classification with respect to various lung condition diagnoses.</abstract><pub>IEEE</pub><doi>10.1109/IJCNN52387.2021.9533966</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier EISSN: 2161-4407
ispartof 2021 International Joint Conference on Neural Networks (IJCNN), 2021, p.1-8
issn 2161-4407
language eng
recordid cdi_ieee_primary_9533966
source IEEE Xplore All Conference Series
subjects attention mechanism
audio classification
bidirectional Recurrent Neural Network
COVID
COVID-19
Deep learning
Evolutionary computation
Long Short-Term Memory
Lung
lung disease
Pulmonary diseases
Recurrent neural networks
title Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T18%3A05%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Deep%20Recurrent%20Neural%20Networks%20with%20Attention%20Mechanisms%20for%20Respiratory%20Anomaly%20Classification&rft.btitle=2021%20International%20Joint%20Conference%20on%20Neural%20Networks%20(IJCNN)&rft.au=Wall,%20Conor&rft.date=2021-07-18&rft.spage=1&rft.epage=8&rft.pages=1-8&rft.eissn=2161-4407&rft_id=info:doi/10.1109/IJCNN52387.2021.9533966&rft.eisbn=1665439009&rft.eisbn_list=9781665439008&rft_dat=%3Cieee_CHZPO%3E9533966%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=9533966&rfr_iscdi=true