Loading…
Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification
In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpo...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Citations: | Items that cite this one |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403 |
---|---|
cites | |
container_end_page | 8 |
container_issue | |
container_start_page | 1 |
container_title | |
container_volume | |
creator | Wall, Conor Zhang, Li Yu, Yonghong Mistry, Kamlesh |
description | In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpose, with a variety of different layer structures implemented for undertaking audio classification. Specifically, bidirectional Long Short-Term Memory (BiLSTM) and Gated Recurrent Units (GRU) networks in conjunction with an attention mechanism, are implemented in this research for chronic and non-chronic lung disease and COVID-19 diagnosis. We employ two audio datasets, i.e. the Respiratory Sound and the Coswara datasets, to evaluate the proposed model architectures pertaining to lung disease classification. The Respiratory Sound Database contains audio data with respect to lung conditions such as Chronic Obstructive Pulmonary Disease (COPD) and asthma, while the Coswara dataset contains coughing audio samples associated with COVID-19. After a comprehensive evaluation and experimentation process, as the most performant architecture, the proposed attention BiLSTM network (A-BiLSTM) achieves accuracy rates of 96.2% and 96.8% for the Respiratory Sound and the Coswara datasets, respectively. Our research indicates that the implementation of the BiLSTM and attention mechanism was effective in improving performance for undertaking audio classification with respect to various lung condition diagnoses. |
doi_str_mv | 10.1109/IJCNN52387.2021.9533966 |
format | conference_proceeding |
fullrecord | <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_9533966</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9533966</ieee_id><sourcerecordid>9533966</sourcerecordid><originalsourceid>FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403</originalsourceid><addsrcrecordid>eNotkEtOwzAYhA0SEm3hBCzwBRJ-27EdL6MApagECcE6clxHNeRR2a6q3J4guhpp5ptZDEL3BFJCQD1sXsuq4pTlMqVASao4Y0qIC7QkQvCMKQB1iRaUCJJkGchrtAzhG4AypdgC1Y_WHvCHNUfv7RBxZY9ed7PE0-h_Aj65uMdFjHPmxgG_WbPXgwt9wO3o5144OK_j6CdcDGOvuwmXnQ7Btc7ov8YNump1F-ztWVfo6_nps3xJtu_rTVlsE8NAxIQ1uaECQO70DohqqWlMLmUmJVfCGC4booWi0sqGzgzjWhg-e2amLc2ArdDd_66z1tYH73rtp_p8BvsFUxxWWA</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification</title><source>IEEE Xplore All Conference Series</source><creator>Wall, Conor ; Zhang, Li ; Yu, Yonghong ; Mistry, Kamlesh</creator><creatorcontrib>Wall, Conor ; Zhang, Li ; Yu, Yonghong ; Mistry, Kamlesh</creatorcontrib><description>In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpose, with a variety of different layer structures implemented for undertaking audio classification. Specifically, bidirectional Long Short-Term Memory (BiLSTM) and Gated Recurrent Units (GRU) networks in conjunction with an attention mechanism, are implemented in this research for chronic and non-chronic lung disease and COVID-19 diagnosis. We employ two audio datasets, i.e. the Respiratory Sound and the Coswara datasets, to evaluate the proposed model architectures pertaining to lung disease classification. The Respiratory Sound Database contains audio data with respect to lung conditions such as Chronic Obstructive Pulmonary Disease (COPD) and asthma, while the Coswara dataset contains coughing audio samples associated with COVID-19. After a comprehensive evaluation and experimentation process, as the most performant architecture, the proposed attention BiLSTM network (A-BiLSTM) achieves accuracy rates of 96.2% and 96.8% for the Respiratory Sound and the Coswara datasets, respectively. Our research indicates that the implementation of the BiLSTM and attention mechanism was effective in improving performance for undertaking audio classification with respect to various lung condition diagnoses.</description><identifier>EISSN: 2161-4407</identifier><identifier>EISBN: 1665439009</identifier><identifier>EISBN: 9781665439008</identifier><identifier>DOI: 10.1109/IJCNN52387.2021.9533966</identifier><language>eng</language><publisher>IEEE</publisher><subject>attention mechanism ; audio classification ; bidirectional Recurrent Neural Network ; COVID ; COVID-19 ; Deep learning ; Evolutionary computation ; Long Short-Term Memory ; Lung ; lung disease ; Pulmonary diseases ; Recurrent neural networks</subject><ispartof>2021 International Joint Conference on Neural Networks (IJCNN), 2021, p.1-8</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9533966$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,23930,23931,25140,27925,54555,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9533966$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Wall, Conor</creatorcontrib><creatorcontrib>Zhang, Li</creatorcontrib><creatorcontrib>Yu, Yonghong</creatorcontrib><creatorcontrib>Mistry, Kamlesh</creatorcontrib><title>Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification</title><title>2021 International Joint Conference on Neural Networks (IJCNN)</title><addtitle>IJCNN</addtitle><description>In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpose, with a variety of different layer structures implemented for undertaking audio classification. Specifically, bidirectional Long Short-Term Memory (BiLSTM) and Gated Recurrent Units (GRU) networks in conjunction with an attention mechanism, are implemented in this research for chronic and non-chronic lung disease and COVID-19 diagnosis. We employ two audio datasets, i.e. the Respiratory Sound and the Coswara datasets, to evaluate the proposed model architectures pertaining to lung disease classification. The Respiratory Sound Database contains audio data with respect to lung conditions such as Chronic Obstructive Pulmonary Disease (COPD) and asthma, while the Coswara dataset contains coughing audio samples associated with COVID-19. After a comprehensive evaluation and experimentation process, as the most performant architecture, the proposed attention BiLSTM network (A-BiLSTM) achieves accuracy rates of 96.2% and 96.8% for the Respiratory Sound and the Coswara datasets, respectively. Our research indicates that the implementation of the BiLSTM and attention mechanism was effective in improving performance for undertaking audio classification with respect to various lung condition diagnoses.</description><subject>attention mechanism</subject><subject>audio classification</subject><subject>bidirectional Recurrent Neural Network</subject><subject>COVID</subject><subject>COVID-19</subject><subject>Deep learning</subject><subject>Evolutionary computation</subject><subject>Long Short-Term Memory</subject><subject>Lung</subject><subject>lung disease</subject><subject>Pulmonary diseases</subject><subject>Recurrent neural networks</subject><issn>2161-4407</issn><isbn>1665439009</isbn><isbn>9781665439008</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2021</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotkEtOwzAYhA0SEm3hBCzwBRJ-27EdL6MApagECcE6clxHNeRR2a6q3J4guhpp5ptZDEL3BFJCQD1sXsuq4pTlMqVASao4Y0qIC7QkQvCMKQB1iRaUCJJkGchrtAzhG4AypdgC1Y_WHvCHNUfv7RBxZY9ed7PE0-h_Aj65uMdFjHPmxgG_WbPXgwt9wO3o5144OK_j6CdcDGOvuwmXnQ7Btc7ov8YNump1F-ztWVfo6_nps3xJtu_rTVlsE8NAxIQ1uaECQO70DohqqWlMLmUmJVfCGC4booWi0sqGzgzjWhg-e2amLc2ArdDd_66z1tYH73rtp_p8BvsFUxxWWA</recordid><startdate>20210718</startdate><enddate>20210718</enddate><creator>Wall, Conor</creator><creator>Zhang, Li</creator><creator>Yu, Yonghong</creator><creator>Mistry, Kamlesh</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20210718</creationdate><title>Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification</title><author>Wall, Conor ; Zhang, Li ; Yu, Yonghong ; Mistry, Kamlesh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2021</creationdate><topic>attention mechanism</topic><topic>audio classification</topic><topic>bidirectional Recurrent Neural Network</topic><topic>COVID</topic><topic>COVID-19</topic><topic>Deep learning</topic><topic>Evolutionary computation</topic><topic>Long Short-Term Memory</topic><topic>Lung</topic><topic>lung disease</topic><topic>Pulmonary diseases</topic><topic>Recurrent neural networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Wall, Conor</creatorcontrib><creatorcontrib>Zhang, Li</creatorcontrib><creatorcontrib>Yu, Yonghong</creatorcontrib><creatorcontrib>Mistry, Kamlesh</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEL</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Wall, Conor</au><au>Zhang, Li</au><au>Yu, Yonghong</au><au>Mistry, Kamlesh</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification</atitle><btitle>2021 International Joint Conference on Neural Networks (IJCNN)</btitle><stitle>IJCNN</stitle><date>2021-07-18</date><risdate>2021</risdate><spage>1</spage><epage>8</epage><pages>1-8</pages><eissn>2161-4407</eissn><eisbn>1665439009</eisbn><eisbn>9781665439008</eisbn><abstract>In recent years, a variety of deep learning techniques and methods have been adopted to provide AI solutions to issues within the medical field, with one specific area being audio-based classification of medical datasets. This research aims to create a novel deep learning architecture for this purpose, with a variety of different layer structures implemented for undertaking audio classification. Specifically, bidirectional Long Short-Term Memory (BiLSTM) and Gated Recurrent Units (GRU) networks in conjunction with an attention mechanism, are implemented in this research for chronic and non-chronic lung disease and COVID-19 diagnosis. We employ two audio datasets, i.e. the Respiratory Sound and the Coswara datasets, to evaluate the proposed model architectures pertaining to lung disease classification. The Respiratory Sound Database contains audio data with respect to lung conditions such as Chronic Obstructive Pulmonary Disease (COPD) and asthma, while the Coswara dataset contains coughing audio samples associated with COVID-19. After a comprehensive evaluation and experimentation process, as the most performant architecture, the proposed attention BiLSTM network (A-BiLSTM) achieves accuracy rates of 96.2% and 96.8% for the Respiratory Sound and the Coswara datasets, respectively. Our research indicates that the implementation of the BiLSTM and attention mechanism was effective in improving performance for undertaking audio classification with respect to various lung condition diagnoses.</abstract><pub>IEEE</pub><doi>10.1109/IJCNN52387.2021.9533966</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | EISSN: 2161-4407 |
ispartof | 2021 International Joint Conference on Neural Networks (IJCNN), 2021, p.1-8 |
issn | 2161-4407 |
language | eng |
recordid | cdi_ieee_primary_9533966 |
source | IEEE Xplore All Conference Series |
subjects | attention mechanism audio classification bidirectional Recurrent Neural Network COVID COVID-19 Deep learning Evolutionary computation Long Short-Term Memory Lung lung disease Pulmonary diseases Recurrent neural networks |
title | Deep Recurrent Neural Networks with Attention Mechanisms for Respiratory Anomaly Classification |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T18%3A05%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Deep%20Recurrent%20Neural%20Networks%20with%20Attention%20Mechanisms%20for%20Respiratory%20Anomaly%20Classification&rft.btitle=2021%20International%20Joint%20Conference%20on%20Neural%20Networks%20(IJCNN)&rft.au=Wall,%20Conor&rft.date=2021-07-18&rft.spage=1&rft.epage=8&rft.pages=1-8&rft.eissn=2161-4407&rft_id=info:doi/10.1109/IJCNN52387.2021.9533966&rft.eisbn=1665439009&rft.eisbn_list=9781665439008&rft_dat=%3Cieee_CHZPO%3E9533966%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c306t-3b8c26007dad019f2cbc877477596cc57b1a6927e7b27da35a6c5b1acd01e2403%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=9533966&rfr_iscdi=true |