Loading…

A data-driven eXtreme gradient boosting machine learning model to predict COVID-19 transmission with meteorological drivers

COVID-19 pandemic has become a global major public health concern. Examining the meteorological risk factors and accurately predicting the incidence of the COVID-19 pandemic is an extremely important challenge. Therefore, in this study, we analyzed the relationship between meteorological factors and...

Full description

Saved in:
Bibliographic Details
Published in:PloS one 2022-09, Vol.17 (9), p.e0273319
Main Authors: Rahman, Md Siddikur, Chowdhury, Arman Hossain
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c692t-2268ea5d4d914d281b56951a16b243345785d4516bf0d7649e6c6b65ae792073
cites cdi_FETCH-LOGICAL-c692t-2268ea5d4d914d281b56951a16b243345785d4516bf0d7649e6c6b65ae792073
container_end_page
container_issue 9
container_start_page e0273319
container_title PloS one
container_volume 17
creator Rahman, Md Siddikur
Chowdhury, Arman Hossain
description COVID-19 pandemic has become a global major public health concern. Examining the meteorological risk factors and accurately predicting the incidence of the COVID-19 pandemic is an extremely important challenge. Therefore, in this study, we analyzed the relationship between meteorological factors and COVID-19 transmission in SAARC countries. We also compared the predictive accuracy of Autoregressive Integrated Moving Average (ARIMAX) and eXtreme Gradient Boosting (XGBoost) methods for precise modelling of COVID-19 incidence. We compiled a daily dataset including confirmed COVID-19 case counts, minimum and maximum temperature (°C), relative humidity (%), surface pressure (kPa), precipitation (mm/day) and maximum wind speed (m/s) from the onset of the disease to January 29, 2022, in each country. The data were divided into training and test sets. The training data were used to fit ARIMAX model for examining significant meteorological risk factors. All significant factors were then used as covariates in ARIMAX and XGBoost models to predict the COVID-19 confirmed cases. We found that maximum temperature had a positive impact on the COVID-19 transmission in Afghanistan (β = 11.91, 95% CI: 4.77, 19.05) and India (β = 0.18, 95% CI: 0.01, 0.35). Surface pressure had a positive influence in Pakistan (β = 25.77, 95% CI: 7.85, 43.69) and Sri Lanka (β = 411.63, 95% CI: 49.04, 774.23). We also found that the XGBoost model can help improve prediction of COVID-19 cases in SAARC countries over the ARIMAX model. The study findings will help the scientific communities and policymakers to establish a more accurate early warning system to control the spread of the pandemic.
doi_str_mv 10.1371/journal.pone.0273319
format article
fullrecord <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_2714022939</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A717391443</galeid><doaj_id>oai_doaj_org_article_ffcacf8529804f4898334e6e5ed6768c</doaj_id><sourcerecordid>A717391443</sourcerecordid><originalsourceid>FETCH-LOGICAL-c692t-2268ea5d4d914d281b56951a16b243345785d4516bf0d7649e6c6b65ae792073</originalsourceid><addsrcrecordid>eNqNk12LEzEUhgdR3LX6D0QDguhF6ySZyUxuhLJ-FRYKuizehTQ5M03JTGqSWRX_vGk7u7SyFzIXmZw85z3Jm5wse47zGaYVfrdxg--lnW1dD7OcVJRi_iA7x5ySKSM5fXj0f5Y9CWGT5yWtGXucnVGWc05Kep79mSMto5xqb26gR_A9eugAtV5qA31EK-dCNH2LOqnWpgdkQfp-H3AaLIoObT1ooyK6WF4vPkwxR9HLPnQmBON69NPENeoggvPOutYoadG-mA9Ps0eNtAGejeMku_r08eriy_Ry-XlxMb-cKsZJnBLCapClLjTHhSY1XpWMl1hitiIFpUVZ1WmxTNMm1xUrODDFVqyUUHGSV3SSvTzIbq0LYrQtCFLhIieEU56IxYHQTm7E1ptO-t_CSSP2AedbIX00yoJoGiVVU5eE13nRFDWv0w6AQQmaVaxWSev9WG1YdaBVMtFLeyJ6utKbtWjdjeAF47zKk8CbUcC7HwOEKJKVCqyVPbhh3DcjrMQJffUPev_pRqqV6QCmb1yqq3aiYl7hiiZXk42TbHYPlT4NnVHpjTUmxU8S3p4kJCbCr9jKIQSx-Pb1_9nl9Sn7-ohdg7RxHZwdYnpN4RQsDqDyLgQPzZ3JOBe7Frl1Q-xaRIwtktJeHF_QXdJtT9C_HtMLhQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2714022939</pqid></control><display><type>article</type><title>A data-driven eXtreme gradient boosting machine learning model to predict COVID-19 transmission with meteorological drivers</title><source>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</source><source>PubMed Central</source><creator>Rahman, Md Siddikur ; Chowdhury, Arman Hossain</creator><contributor>Singh, Randeep</contributor><creatorcontrib>Rahman, Md Siddikur ; Chowdhury, Arman Hossain ; Singh, Randeep</creatorcontrib><description>COVID-19 pandemic has become a global major public health concern. Examining the meteorological risk factors and accurately predicting the incidence of the COVID-19 pandemic is an extremely important challenge. Therefore, in this study, we analyzed the relationship between meteorological factors and COVID-19 transmission in SAARC countries. We also compared the predictive accuracy of Autoregressive Integrated Moving Average (ARIMAX) and eXtreme Gradient Boosting (XGBoost) methods for precise modelling of COVID-19 incidence. We compiled a daily dataset including confirmed COVID-19 case counts, minimum and maximum temperature (°C), relative humidity (%), surface pressure (kPa), precipitation (mm/day) and maximum wind speed (m/s) from the onset of the disease to January 29, 2022, in each country. The data were divided into training and test sets. The training data were used to fit ARIMAX model for examining significant meteorological risk factors. All significant factors were then used as covariates in ARIMAX and XGBoost models to predict the COVID-19 confirmed cases. We found that maximum temperature had a positive impact on the COVID-19 transmission in Afghanistan (β = 11.91, 95% CI: 4.77, 19.05) and India (β = 0.18, 95% CI: 0.01, 0.35). Surface pressure had a positive influence in Pakistan (β = 25.77, 95% CI: 7.85, 43.69) and Sri Lanka (β = 411.63, 95% CI: 49.04, 774.23). We also found that the XGBoost model can help improve prediction of COVID-19 cases in SAARC countries over the ARIMAX model. The study findings will help the scientific communities and policymakers to establish a more accurate early warning system to control the spread of the pandemic.</description><identifier>ISSN: 1932-6203</identifier><identifier>EISSN: 1932-6203</identifier><identifier>DOI: 10.1371/journal.pone.0273319</identifier><identifier>PMID: 36099253</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Accuracy ; Analysis ; Computer and Information Sciences ; Coronaviruses ; COVID-19 ; COVID-19 - epidemiology ; Disease control ; Disease transmission ; Early warning systems ; Earth Sciences ; Fatalities ; Forecasting ; Generalized linear models ; Humans ; Humidity ; Machine Learning ; Maximum temperatures ; Maximum winds ; Medicine and Health Sciences ; Meteorological Concepts ; Meteorology ; Modelling ; Pandemics ; People and places ; Physical Sciences ; Precipitation ; Pressure ; Public health ; Relative humidity ; Research and Analysis Methods ; Respiratory diseases ; Risk analysis ; Risk factors ; Severe acute respiratory syndrome coronavirus 2 ; Surface pressure ; Time series ; Training ; Variables ; Wind speed</subject><ispartof>PloS one, 2022-09, Vol.17 (9), p.e0273319</ispartof><rights>COPYRIGHT 2022 Public Library of Science</rights><rights>2022 Rahman, Chowdhury. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2022 Rahman, Chowdhury 2022 Rahman, Chowdhury</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c692t-2268ea5d4d914d281b56951a16b243345785d4516bf0d7649e6c6b65ae792073</citedby><cites>FETCH-LOGICAL-c692t-2268ea5d4d914d281b56951a16b243345785d4516bf0d7649e6c6b65ae792073</cites><orcidid>0000-0001-8925-6544 ; 0000-0003-1498-287X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2714022939/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2714022939?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,44590,53791,53793,75126</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36099253$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Singh, Randeep</contributor><creatorcontrib>Rahman, Md Siddikur</creatorcontrib><creatorcontrib>Chowdhury, Arman Hossain</creatorcontrib><title>A data-driven eXtreme gradient boosting machine learning model to predict COVID-19 transmission with meteorological drivers</title><title>PloS one</title><addtitle>PLoS One</addtitle><description>COVID-19 pandemic has become a global major public health concern. Examining the meteorological risk factors and accurately predicting the incidence of the COVID-19 pandemic is an extremely important challenge. Therefore, in this study, we analyzed the relationship between meteorological factors and COVID-19 transmission in SAARC countries. We also compared the predictive accuracy of Autoregressive Integrated Moving Average (ARIMAX) and eXtreme Gradient Boosting (XGBoost) methods for precise modelling of COVID-19 incidence. We compiled a daily dataset including confirmed COVID-19 case counts, minimum and maximum temperature (°C), relative humidity (%), surface pressure (kPa), precipitation (mm/day) and maximum wind speed (m/s) from the onset of the disease to January 29, 2022, in each country. The data were divided into training and test sets. The training data were used to fit ARIMAX model for examining significant meteorological risk factors. All significant factors were then used as covariates in ARIMAX and XGBoost models to predict the COVID-19 confirmed cases. We found that maximum temperature had a positive impact on the COVID-19 transmission in Afghanistan (β = 11.91, 95% CI: 4.77, 19.05) and India (β = 0.18, 95% CI: 0.01, 0.35). Surface pressure had a positive influence in Pakistan (β = 25.77, 95% CI: 7.85, 43.69) and Sri Lanka (β = 411.63, 95% CI: 49.04, 774.23). We also found that the XGBoost model can help improve prediction of COVID-19 cases in SAARC countries over the ARIMAX model. The study findings will help the scientific communities and policymakers to establish a more accurate early warning system to control the spread of the pandemic.</description><subject>Accuracy</subject><subject>Analysis</subject><subject>Computer and Information Sciences</subject><subject>Coronaviruses</subject><subject>COVID-19</subject><subject>COVID-19 - epidemiology</subject><subject>Disease control</subject><subject>Disease transmission</subject><subject>Early warning systems</subject><subject>Earth Sciences</subject><subject>Fatalities</subject><subject>Forecasting</subject><subject>Generalized linear models</subject><subject>Humans</subject><subject>Humidity</subject><subject>Machine Learning</subject><subject>Maximum temperatures</subject><subject>Maximum winds</subject><subject>Medicine and Health Sciences</subject><subject>Meteorological Concepts</subject><subject>Meteorology</subject><subject>Modelling</subject><subject>Pandemics</subject><subject>People and places</subject><subject>Physical Sciences</subject><subject>Precipitation</subject><subject>Pressure</subject><subject>Public health</subject><subject>Relative humidity</subject><subject>Research and Analysis Methods</subject><subject>Respiratory diseases</subject><subject>Risk analysis</subject><subject>Risk factors</subject><subject>Severe acute respiratory syndrome coronavirus 2</subject><subject>Surface pressure</subject><subject>Time series</subject><subject>Training</subject><subject>Variables</subject><subject>Wind speed</subject><issn>1932-6203</issn><issn>1932-6203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNqNk12LEzEUhgdR3LX6D0QDguhF6ySZyUxuhLJ-FRYKuizehTQ5M03JTGqSWRX_vGk7u7SyFzIXmZw85z3Jm5wse47zGaYVfrdxg--lnW1dD7OcVJRi_iA7x5ySKSM5fXj0f5Y9CWGT5yWtGXucnVGWc05Kep79mSMto5xqb26gR_A9eugAtV5qA31EK-dCNH2LOqnWpgdkQfp-H3AaLIoObT1ooyK6WF4vPkwxR9HLPnQmBON69NPENeoggvPOutYoadG-mA9Ps0eNtAGejeMku_r08eriy_Ry-XlxMb-cKsZJnBLCapClLjTHhSY1XpWMl1hitiIFpUVZ1WmxTNMm1xUrODDFVqyUUHGSV3SSvTzIbq0LYrQtCFLhIieEU56IxYHQTm7E1ptO-t_CSSP2AedbIX00yoJoGiVVU5eE13nRFDWv0w6AQQmaVaxWSev9WG1YdaBVMtFLeyJ6utKbtWjdjeAF47zKk8CbUcC7HwOEKJKVCqyVPbhh3DcjrMQJffUPev_pRqqV6QCmb1yqq3aiYl7hiiZXk42TbHYPlT4NnVHpjTUmxU8S3p4kJCbCr9jKIQSx-Pb1_9nl9Sn7-ohdg7RxHZwdYnpN4RQsDqDyLgQPzZ3JOBe7Frl1Q-xaRIwtktJeHF_QXdJtT9C_HtMLhQ</recordid><startdate>20220913</startdate><enddate>20220913</enddate><creator>Rahman, Md Siddikur</creator><creator>Chowdhury, Arman Hossain</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISR</scope><scope>3V.</scope><scope>7QG</scope><scope>7QL</scope><scope>7QO</scope><scope>7RV</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TG</scope><scope>7TM</scope><scope>7U9</scope><scope>7X2</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB.</scope><scope>KB0</scope><scope>KL.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>M7S</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0001-8925-6544</orcidid><orcidid>https://orcid.org/0000-0003-1498-287X</orcidid></search><sort><creationdate>20220913</creationdate><title>A data-driven eXtreme gradient boosting machine learning model to predict COVID-19 transmission with meteorological drivers</title><author>Rahman, Md Siddikur ; Chowdhury, Arman Hossain</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c692t-2268ea5d4d914d281b56951a16b243345785d4516bf0d7649e6c6b65ae792073</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Accuracy</topic><topic>Analysis</topic><topic>Computer and Information Sciences</topic><topic>Coronaviruses</topic><topic>COVID-19</topic><topic>COVID-19 - epidemiology</topic><topic>Disease control</topic><topic>Disease transmission</topic><topic>Early warning systems</topic><topic>Earth Sciences</topic><topic>Fatalities</topic><topic>Forecasting</topic><topic>Generalized linear models</topic><topic>Humans</topic><topic>Humidity</topic><topic>Machine Learning</topic><topic>Maximum temperatures</topic><topic>Maximum winds</topic><topic>Medicine and Health Sciences</topic><topic>Meteorological Concepts</topic><topic>Meteorology</topic><topic>Modelling</topic><topic>Pandemics</topic><topic>People and places</topic><topic>Physical Sciences</topic><topic>Precipitation</topic><topic>Pressure</topic><topic>Public health</topic><topic>Relative humidity</topic><topic>Research and Analysis Methods</topic><topic>Respiratory diseases</topic><topic>Risk analysis</topic><topic>Risk factors</topic><topic>Severe acute respiratory syndrome coronavirus 2</topic><topic>Surface pressure</topic><topic>Time series</topic><topic>Training</topic><topic>Variables</topic><topic>Wind speed</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Rahman, Md Siddikur</creatorcontrib><creatorcontrib>Chowdhury, Arman Hossain</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Opposing Viewpoints</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>ProQuest Nursing and Allied Health Source</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Meteorological &amp; Geoastrophysical Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Agricultural Science Collection</collection><collection>ProQuest Health and Medical</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>Agricultural &amp; Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Nursing &amp; Allied Health Database (Alumni Edition)</collection><collection>Meteorological &amp; Geoastrophysical Abstracts - Academic</collection><collection>ProQuest Engineering Collection</collection><collection>Biological Sciences</collection><collection>Agricultural Science Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>ProQuest Biological Science Journals</collection><collection>Engineering Database</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>Materials science collection</collection><collection>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering collection</collection><collection>Environmental Science Collection</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PloS one</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rahman, Md Siddikur</au><au>Chowdhury, Arman Hossain</au><au>Singh, Randeep</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A data-driven eXtreme gradient boosting machine learning model to predict COVID-19 transmission with meteorological drivers</atitle><jtitle>PloS one</jtitle><addtitle>PLoS One</addtitle><date>2022-09-13</date><risdate>2022</risdate><volume>17</volume><issue>9</issue><spage>e0273319</spage><pages>e0273319-</pages><issn>1932-6203</issn><eissn>1932-6203</eissn><abstract>COVID-19 pandemic has become a global major public health concern. Examining the meteorological risk factors and accurately predicting the incidence of the COVID-19 pandemic is an extremely important challenge. Therefore, in this study, we analyzed the relationship between meteorological factors and COVID-19 transmission in SAARC countries. We also compared the predictive accuracy of Autoregressive Integrated Moving Average (ARIMAX) and eXtreme Gradient Boosting (XGBoost) methods for precise modelling of COVID-19 incidence. We compiled a daily dataset including confirmed COVID-19 case counts, minimum and maximum temperature (°C), relative humidity (%), surface pressure (kPa), precipitation (mm/day) and maximum wind speed (m/s) from the onset of the disease to January 29, 2022, in each country. The data were divided into training and test sets. The training data were used to fit ARIMAX model for examining significant meteorological risk factors. All significant factors were then used as covariates in ARIMAX and XGBoost models to predict the COVID-19 confirmed cases. We found that maximum temperature had a positive impact on the COVID-19 transmission in Afghanistan (β = 11.91, 95% CI: 4.77, 19.05) and India (β = 0.18, 95% CI: 0.01, 0.35). Surface pressure had a positive influence in Pakistan (β = 25.77, 95% CI: 7.85, 43.69) and Sri Lanka (β = 411.63, 95% CI: 49.04, 774.23). We also found that the XGBoost model can help improve prediction of COVID-19 cases in SAARC countries over the ARIMAX model. The study findings will help the scientific communities and policymakers to establish a more accurate early warning system to control the spread of the pandemic.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>36099253</pmid><doi>10.1371/journal.pone.0273319</doi><tpages>e0273319</tpages><orcidid>https://orcid.org/0000-0001-8925-6544</orcidid><orcidid>https://orcid.org/0000-0003-1498-287X</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1932-6203
ispartof PloS one, 2022-09, Vol.17 (9), p.e0273319
issn 1932-6203
1932-6203
language eng
recordid cdi_plos_journals_2714022939
source Publicly Available Content Database (Proquest) (PQ_SDU_P3); PubMed Central
subjects Accuracy
Analysis
Computer and Information Sciences
Coronaviruses
COVID-19
COVID-19 - epidemiology
Disease control
Disease transmission
Early warning systems
Earth Sciences
Fatalities
Forecasting
Generalized linear models
Humans
Humidity
Machine Learning
Maximum temperatures
Maximum winds
Medicine and Health Sciences
Meteorological Concepts
Meteorology
Modelling
Pandemics
People and places
Physical Sciences
Precipitation
Pressure
Public health
Relative humidity
Research and Analysis Methods
Respiratory diseases
Risk analysis
Risk factors
Severe acute respiratory syndrome coronavirus 2
Surface pressure
Time series
Training
Variables
Wind speed
title A data-driven eXtreme gradient boosting machine learning model to predict COVID-19 transmission with meteorological drivers
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T09%3A27%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20data-driven%20eXtreme%20gradient%20boosting%20machine%20learning%20model%20to%20predict%20COVID-19%20transmission%20with%20meteorological%20drivers&rft.jtitle=PloS%20one&rft.au=Rahman,%20Md%20Siddikur&rft.date=2022-09-13&rft.volume=17&rft.issue=9&rft.spage=e0273319&rft.pages=e0273319-&rft.issn=1932-6203&rft.eissn=1932-6203&rft_id=info:doi/10.1371/journal.pone.0273319&rft_dat=%3Cgale_plos_%3EA717391443%3C/gale_plos_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c692t-2268ea5d4d914d281b56951a16b243345785d4516bf0d7649e6c6b65ae792073%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2714022939&rft_id=info:pmid/36099253&rft_galeid=A717391443&rfr_iscdi=true