Loading…

The Prediction of Influenza-like Illness and Respiratory Disease Using LSTM and ARIMA

This paper proposed the forecasting model of Influenza-like Illness (ILI) and respiratory disease. The dataset was extracted from the Taiwan Environmental Protection Administration (EPA) for air pollutants data and the Centers for Disease Control (CDC) for disease cases from 2009 to 2018. First, thi...

Full description

Saved in:
Bibliographic Details
Published in:International journal of environmental research and public health 2022-02, Vol.19 (3), p.1858
Main Authors: Tsan, Yu-Tse, Chen, Der-Yuan, Liu, Po-Yu, Kristiani, Endah, Nguyen, Kieu Lan Phuong, Yang, Chao-Tung
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c418t-adabe87cd7466e6fb4fd5b6fd47abc201f886e52d5b59fc50b59a9c3da9b4ba33
cites cdi_FETCH-LOGICAL-c418t-adabe87cd7466e6fb4fd5b6fd47abc201f886e52d5b59fc50b59a9c3da9b4ba33
container_end_page
container_issue 3
container_start_page 1858
container_title International journal of environmental research and public health
container_volume 19
creator Tsan, Yu-Tse
Chen, Der-Yuan
Liu, Po-Yu
Kristiani, Endah
Nguyen, Kieu Lan Phuong
Yang, Chao-Tung
description This paper proposed the forecasting model of Influenza-like Illness (ILI) and respiratory disease. The dataset was extracted from the Taiwan Environmental Protection Administration (EPA) for air pollutants data and the Centers for Disease Control (CDC) for disease cases from 2009 to 2018. First, this paper applied the ARIMA method, which trained based on the weekly number of disease cases in time series. Second, we implemented the Long short-term memory (LSTM) method, which trained based on the correlation between the weekly number of diseases and air pollutants. The models were also trained and evaluated based on five and ten years of historical data. Autoregressive integrated moving average (ARIMA) has an excellent model in the five-year dataset of ILI at 2564.9 compared to ten years at 8173.6 of RMSE value. This accuracy is similar to the Respiratory dataset, which gets 15,656.7 in the five-year dataset and 22,680.4 of RMSE value in the ten-year dataset. On the contrary, LSTM has better accuracy in the ten-year dataset than the five-year dataset. For example, on average of RMSE in the ILI dataset, LSTM has 720.2 RMSE value in five years and 517.0 in ten years dataset. Also, in the Respiratory disease dataset, LSTM gets 4768.6 of five years of data and 3254.3 of the ten-year dataset. These experiments revealed that the LSTM model generally outperforms ARIMA by three to seven times higher model performance.
doi_str_mv 10.3390/ijerph19031858
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8835266</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2627535573</sourcerecordid><originalsourceid>FETCH-LOGICAL-c418t-adabe87cd7466e6fb4fd5b6fd47abc201f886e52d5b59fc50b59a9c3da9b4ba33</originalsourceid><addsrcrecordid>eNpdkc1rGzEQxUVJqZ201x6LIJde1pVWK632UjBukxgcUlL7LLTaUSx3LTnSbsH567POF0lOM8z85vGGh9BXSiaMVeSH20DcrWlFGJVcfkBjKgTJCkHo0at-hI5T2hDCZCGqT2jEOBW5LKsxWi3XgP9EaJzpXPA4WDz3tu3B3-msdf8Az9vWQ0pY-wZfQ9q5qLsQ9_iXS6AT4FVy_gYv_i4vH5Dp9fxy-hl9tLpN8OWpnqDV2e_l7CJbXJ3PZ9NFZgoqu0w3ugZZmqYshABh68I2vBa2KUpdm5xQK6UAng9DXlnDyVB0ZVijq7qoNWMn6Oej7q6vt9AY8F3UrdpFt9Vxr4J26u3Gu7W6Cf-VlIznQgwC358EYrjtIXVq65KBttUeQp9ULvKKCMbKfEBP36Gb0Ec_vHegSs44Lw-OJo-UiSGlCPbFDCXqkJh6m9hw8O31Cy_4c0TsHhivk6I</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2627535573</pqid></control><display><type>article</type><title>The Prediction of Influenza-like Illness and Respiratory Disease Using LSTM and ARIMA</title><source>Publicly Available Content Database</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><source>Coronavirus Research Database</source><creator>Tsan, Yu-Tse ; Chen, Der-Yuan ; Liu, Po-Yu ; Kristiani, Endah ; Nguyen, Kieu Lan Phuong ; Yang, Chao-Tung</creator><creatorcontrib>Tsan, Yu-Tse ; Chen, Der-Yuan ; Liu, Po-Yu ; Kristiani, Endah ; Nguyen, Kieu Lan Phuong ; Yang, Chao-Tung</creatorcontrib><description>This paper proposed the forecasting model of Influenza-like Illness (ILI) and respiratory disease. The dataset was extracted from the Taiwan Environmental Protection Administration (EPA) for air pollutants data and the Centers for Disease Control (CDC) for disease cases from 2009 to 2018. First, this paper applied the ARIMA method, which trained based on the weekly number of disease cases in time series. Second, we implemented the Long short-term memory (LSTM) method, which trained based on the correlation between the weekly number of diseases and air pollutants. The models were also trained and evaluated based on five and ten years of historical data. Autoregressive integrated moving average (ARIMA) has an excellent model in the five-year dataset of ILI at 2564.9 compared to ten years at 8173.6 of RMSE value. This accuracy is similar to the Respiratory dataset, which gets 15,656.7 in the five-year dataset and 22,680.4 of RMSE value in the ten-year dataset. On the contrary, LSTM has better accuracy in the ten-year dataset than the five-year dataset. For example, on average of RMSE in the ILI dataset, LSTM has 720.2 RMSE value in five years and 517.0 in ten years dataset. Also, in the Respiratory disease dataset, LSTM gets 4768.6 of five years of data and 3254.3 of the ten-year dataset. These experiments revealed that the LSTM model generally outperforms ARIMA by three to seven times higher model performance.</description><identifier>ISSN: 1660-4601</identifier><identifier>ISSN: 1661-7827</identifier><identifier>EISSN: 1660-4601</identifier><identifier>DOI: 10.3390/ijerph19031858</identifier><identifier>PMID: 35162879</identifier><language>eng</language><publisher>Switzerland: MDPI AG</publisher><subject>Air Pollutants ; Air pollution ; Autoregressive models ; Datasets ; Disease control ; Environmental protection ; Forecasting ; Historical account ; Humans ; Illnesses ; Influenza, Human - epidemiology ; Long short-term memory ; Neural networks ; Outdoor air quality ; Pollutants ; Respiration Disorders ; Respiratory diseases ; Taiwan - epidemiology</subject><ispartof>International journal of environmental research and public health, 2022-02, Vol.19 (3), p.1858</ispartof><rights>2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2022 by the authors. 2022</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c418t-adabe87cd7466e6fb4fd5b6fd47abc201f886e52d5b59fc50b59a9c3da9b4ba33</citedby><cites>FETCH-LOGICAL-c418t-adabe87cd7466e6fb4fd5b6fd47abc201f886e52d5b59fc50b59a9c3da9b4ba33</cites><orcidid>0000-0001-8006-4917 ; 0000-0003-1266-1423 ; 0000-0003-2925-2992 ; 0000-0001-9148-8811 ; 0000-0002-4487-0185 ; 0000-0002-9579-4426</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2627535573/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2627535573?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,725,778,782,883,25740,27911,27912,36999,37000,38503,43882,44577,53778,53780,74169,74883</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35162879$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Tsan, Yu-Tse</creatorcontrib><creatorcontrib>Chen, Der-Yuan</creatorcontrib><creatorcontrib>Liu, Po-Yu</creatorcontrib><creatorcontrib>Kristiani, Endah</creatorcontrib><creatorcontrib>Nguyen, Kieu Lan Phuong</creatorcontrib><creatorcontrib>Yang, Chao-Tung</creatorcontrib><title>The Prediction of Influenza-like Illness and Respiratory Disease Using LSTM and ARIMA</title><title>International journal of environmental research and public health</title><addtitle>Int J Environ Res Public Health</addtitle><description>This paper proposed the forecasting model of Influenza-like Illness (ILI) and respiratory disease. The dataset was extracted from the Taiwan Environmental Protection Administration (EPA) for air pollutants data and the Centers for Disease Control (CDC) for disease cases from 2009 to 2018. First, this paper applied the ARIMA method, which trained based on the weekly number of disease cases in time series. Second, we implemented the Long short-term memory (LSTM) method, which trained based on the correlation between the weekly number of diseases and air pollutants. The models were also trained and evaluated based on five and ten years of historical data. Autoregressive integrated moving average (ARIMA) has an excellent model in the five-year dataset of ILI at 2564.9 compared to ten years at 8173.6 of RMSE value. This accuracy is similar to the Respiratory dataset, which gets 15,656.7 in the five-year dataset and 22,680.4 of RMSE value in the ten-year dataset. On the contrary, LSTM has better accuracy in the ten-year dataset than the five-year dataset. For example, on average of RMSE in the ILI dataset, LSTM has 720.2 RMSE value in five years and 517.0 in ten years dataset. Also, in the Respiratory disease dataset, LSTM gets 4768.6 of five years of data and 3254.3 of the ten-year dataset. These experiments revealed that the LSTM model generally outperforms ARIMA by three to seven times higher model performance.</description><subject>Air Pollutants</subject><subject>Air pollution</subject><subject>Autoregressive models</subject><subject>Datasets</subject><subject>Disease control</subject><subject>Environmental protection</subject><subject>Forecasting</subject><subject>Historical account</subject><subject>Humans</subject><subject>Illnesses</subject><subject>Influenza, Human - epidemiology</subject><subject>Long short-term memory</subject><subject>Neural networks</subject><subject>Outdoor air quality</subject><subject>Pollutants</subject><subject>Respiration Disorders</subject><subject>Respiratory diseases</subject><subject>Taiwan - epidemiology</subject><issn>1660-4601</issn><issn>1661-7827</issn><issn>1660-4601</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>COVID</sourceid><sourceid>PIMPY</sourceid><recordid>eNpdkc1rGzEQxUVJqZ201x6LIJde1pVWK632UjBukxgcUlL7LLTaUSx3LTnSbsH567POF0lOM8z85vGGh9BXSiaMVeSH20DcrWlFGJVcfkBjKgTJCkHo0at-hI5T2hDCZCGqT2jEOBW5LKsxWi3XgP9EaJzpXPA4WDz3tu3B3-msdf8Az9vWQ0pY-wZfQ9q5qLsQ9_iXS6AT4FVy_gYv_i4vH5Dp9fxy-hl9tLpN8OWpnqDV2e_l7CJbXJ3PZ9NFZgoqu0w3ugZZmqYshABh68I2vBa2KUpdm5xQK6UAng9DXlnDyVB0ZVijq7qoNWMn6Oej7q6vt9AY8F3UrdpFt9Vxr4J26u3Gu7W6Cf-VlIznQgwC358EYrjtIXVq65KBttUeQp9ULvKKCMbKfEBP36Gb0Ec_vHegSs44Lw-OJo-UiSGlCPbFDCXqkJh6m9hw8O31Cy_4c0TsHhivk6I</recordid><startdate>20220207</startdate><enddate>20220207</enddate><creator>Tsan, Yu-Tse</creator><creator>Chen, Der-Yuan</creator><creator>Liu, Po-Yu</creator><creator>Kristiani, Endah</creator><creator>Nguyen, Kieu Lan Phuong</creator><creator>Yang, Chao-Tung</creator><general>MDPI AG</general><general>MDPI</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8C1</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>CCPQU</scope><scope>COVID</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>K9.</scope><scope>M0S</scope><scope>M1P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0001-8006-4917</orcidid><orcidid>https://orcid.org/0000-0003-1266-1423</orcidid><orcidid>https://orcid.org/0000-0003-2925-2992</orcidid><orcidid>https://orcid.org/0000-0001-9148-8811</orcidid><orcidid>https://orcid.org/0000-0002-4487-0185</orcidid><orcidid>https://orcid.org/0000-0002-9579-4426</orcidid></search><sort><creationdate>20220207</creationdate><title>The Prediction of Influenza-like Illness and Respiratory Disease Using LSTM and ARIMA</title><author>Tsan, Yu-Tse ; Chen, Der-Yuan ; Liu, Po-Yu ; Kristiani, Endah ; Nguyen, Kieu Lan Phuong ; Yang, Chao-Tung</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c418t-adabe87cd7466e6fb4fd5b6fd47abc201f886e52d5b59fc50b59a9c3da9b4ba33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Air Pollutants</topic><topic>Air pollution</topic><topic>Autoregressive models</topic><topic>Datasets</topic><topic>Disease control</topic><topic>Environmental protection</topic><topic>Forecasting</topic><topic>Historical account</topic><topic>Humans</topic><topic>Illnesses</topic><topic>Influenza, Human - epidemiology</topic><topic>Long short-term memory</topic><topic>Neural networks</topic><topic>Outdoor air quality</topic><topic>Pollutants</topic><topic>Respiration Disorders</topic><topic>Respiratory diseases</topic><topic>Taiwan - epidemiology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tsan, Yu-Tse</creatorcontrib><creatorcontrib>Chen, Der-Yuan</creatorcontrib><creatorcontrib>Liu, Po-Yu</creatorcontrib><creatorcontrib>Kristiani, Endah</creatorcontrib><creatorcontrib>Nguyen, Kieu Lan Phuong</creatorcontrib><creatorcontrib>Yang, Chao-Tung</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Public Health Database</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest One Community College</collection><collection>Coronavirus Research Database</collection><collection>ProQuest Central</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>PML(ProQuest Medical Library)</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>International journal of environmental research and public health</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tsan, Yu-Tse</au><au>Chen, Der-Yuan</au><au>Liu, Po-Yu</au><au>Kristiani, Endah</au><au>Nguyen, Kieu Lan Phuong</au><au>Yang, Chao-Tung</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The Prediction of Influenza-like Illness and Respiratory Disease Using LSTM and ARIMA</atitle><jtitle>International journal of environmental research and public health</jtitle><addtitle>Int J Environ Res Public Health</addtitle><date>2022-02-07</date><risdate>2022</risdate><volume>19</volume><issue>3</issue><spage>1858</spage><pages>1858-</pages><issn>1660-4601</issn><issn>1661-7827</issn><eissn>1660-4601</eissn><abstract>This paper proposed the forecasting model of Influenza-like Illness (ILI) and respiratory disease. The dataset was extracted from the Taiwan Environmental Protection Administration (EPA) for air pollutants data and the Centers for Disease Control (CDC) for disease cases from 2009 to 2018. First, this paper applied the ARIMA method, which trained based on the weekly number of disease cases in time series. Second, we implemented the Long short-term memory (LSTM) method, which trained based on the correlation between the weekly number of diseases and air pollutants. The models were also trained and evaluated based on five and ten years of historical data. Autoregressive integrated moving average (ARIMA) has an excellent model in the five-year dataset of ILI at 2564.9 compared to ten years at 8173.6 of RMSE value. This accuracy is similar to the Respiratory dataset, which gets 15,656.7 in the five-year dataset and 22,680.4 of RMSE value in the ten-year dataset. On the contrary, LSTM has better accuracy in the ten-year dataset than the five-year dataset. For example, on average of RMSE in the ILI dataset, LSTM has 720.2 RMSE value in five years and 517.0 in ten years dataset. Also, in the Respiratory disease dataset, LSTM gets 4768.6 of five years of data and 3254.3 of the ten-year dataset. These experiments revealed that the LSTM model generally outperforms ARIMA by three to seven times higher model performance.</abstract><cop>Switzerland</cop><pub>MDPI AG</pub><pmid>35162879</pmid><doi>10.3390/ijerph19031858</doi><orcidid>https://orcid.org/0000-0001-8006-4917</orcidid><orcidid>https://orcid.org/0000-0003-1266-1423</orcidid><orcidid>https://orcid.org/0000-0003-2925-2992</orcidid><orcidid>https://orcid.org/0000-0001-9148-8811</orcidid><orcidid>https://orcid.org/0000-0002-4487-0185</orcidid><orcidid>https://orcid.org/0000-0002-9579-4426</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1660-4601
ispartof International journal of environmental research and public health, 2022-02, Vol.19 (3), p.1858
issn 1660-4601
1661-7827
1660-4601
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8835266
source Publicly Available Content Database; PubMed Central; Free Full-Text Journals in Chemistry; Coronavirus Research Database
subjects Air Pollutants
Air pollution
Autoregressive models
Datasets
Disease control
Environmental protection
Forecasting
Historical account
Humans
Illnesses
Influenza, Human - epidemiology
Long short-term memory
Neural networks
Outdoor air quality
Pollutants
Respiration Disorders
Respiratory diseases
Taiwan - epidemiology
title The Prediction of Influenza-like Illness and Respiratory Disease Using LSTM and ARIMA
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T09%3A38%3A14IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20Prediction%20of%20Influenza-like%20Illness%20and%20Respiratory%20Disease%20Using%20LSTM%20and%20ARIMA&rft.jtitle=International%20journal%20of%20environmental%20research%20and%20public%20health&rft.au=Tsan,%20Yu-Tse&rft.date=2022-02-07&rft.volume=19&rft.issue=3&rft.spage=1858&rft.pages=1858-&rft.issn=1660-4601&rft.eissn=1660-4601&rft_id=info:doi/10.3390/ijerph19031858&rft_dat=%3Cproquest_pubme%3E2627535573%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c418t-adabe87cd7466e6fb4fd5b6fd47abc201f886e52d5b59fc50b59a9c3da9b4ba33%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2627535573&rft_id=info:pmid/35162879&rfr_iscdi=true