Loading…

Sequential Data Processing for IMERG Satellite Rainfall Comparison and Improvement Using LSTM and ADAM Optimizer

This study introduces a systematic methodology whereby different technologies were utilized to download, pre-process, and interactively compare the rainfall datasets from the Integrated Multi-Satellite Retrievals for Global Precipitation Mission (IMERG) satellite and rain gauges. To efficiently hand...

Full description

Saved in:
Bibliographic Details
Published in:Applied sciences 2023-06, Vol.13 (12), p.7237
Main Authors: Toh, Seng Choon, Lai, Sai Hin, Mirzaei, Majid, Soo, Eugene Zhen Xiang, Teo, Fang Yenn
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c403t-3862a42482fc3fc7e6ad4dcca847599a66d4c641d9b1732f38db51b42d3ecd033
cites cdi_FETCH-LOGICAL-c403t-3862a42482fc3fc7e6ad4dcca847599a66d4c641d9b1732f38db51b42d3ecd033
container_end_page
container_issue 12
container_start_page 7237
container_title Applied sciences
container_volume 13
creator Toh, Seng Choon
Lai, Sai Hin
Mirzaei, Majid
Soo, Eugene Zhen Xiang
Teo, Fang Yenn
description This study introduces a systematic methodology whereby different technologies were utilized to download, pre-process, and interactively compare the rainfall datasets from the Integrated Multi-Satellite Retrievals for Global Precipitation Mission (IMERG) satellite and rain gauges. To efficiently handle the large volume of data, we developed automated shell scripts for downloading IMERG data and storing it, along with rain gauge data, in a relational database system. Hypertext pre-processor (pHp) programs were built to visualize the result for better analysis. In this study, the performance of IMERG estimations over the east coast of Peninsular Malaysia for the duration of 10 years (2011–2020) against rain gauge observation data is evaluated. Moreover, this study aimed to improve the daily IMERG estimations with long short-term memory (LSTM) developed with Python. Findings show that the LSTM with Adaptive Moment Estimation (ADAM) optimizer trained against the mean square error (MSE) loss enhances the accuracy of satellite estimations. At the point-to-pixel scale, the correlation between satellite estimations and ground observations was increased by about 15%. The bias was reduced by 81–118%, MAE was reduced by 18–59%, the root-mean-square error (RMSE) was reduced by 1–66%, and the Kling–Gupta efficiency (KGE) was increased by approximately 200%. The approach developed in this study establishes a comprehensive and scalable data processing and analysis pipeline that can be applied to diverse datasets and regions encountering similar domain-specific challenges.
doi_str_mv 10.3390/app13127237
format article
fullrecord <record><control><sourceid>gale_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_54ce4218f07a4faeb68b02cd14fd7fc5</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A754973869</galeid><doaj_id>oai_doaj_org_article_54ce4218f07a4faeb68b02cd14fd7fc5</doaj_id><sourcerecordid>A754973869</sourcerecordid><originalsourceid>FETCH-LOGICAL-c403t-3862a42482fc3fc7e6ad4dcca847599a66d4c641d9b1732f38db51b42d3ecd033</originalsourceid><addsrcrecordid>eNpNkVtLIzEUxwdxQVGf9gsEfJS6uc1k8liqdgstitXncCaXkjIzmU3igvvpN7Yi5jzkcJL_79yq6ifBt4xJ_AumiTBCBWXipDqnWDQzxok4_eafVVcp7XE5krCW4PNq2to_b3bMHnp0BxnQUwzapuTHHXIhotXm_nmJtpBt3_ts0TP40UHfo0UYJog-hRHBaNBqmGL4a4eCQq8H9Xr7sjk8ze_mG_Q4ZT_4fzZeVj-KPtmrz_uien24f1n8nq0fl6vFfD3THLM8Y21DgVPeUqeZ08I2YLjRGlouaimhaQzXDSdGdkQw6lhrupp0nBpmtcGMXVSrI9cE2Ksp-gHiuwrg1SEQ4k5BzF73VtVcW05J67AA7sB2Tdthqg3hzgin68K6PrJKj2VaKat9eItjKV_RlkpBqGQfGW-Pv3ZQoGVMIUfQxYwdvA6jdb7E56LmUpT2ZBHcHAU6hpSidV9lEqw-Vqq-rZT9B-BFkuA</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2829712933</pqid></control><display><type>article</type><title>Sequential Data Processing for IMERG Satellite Rainfall Comparison and Improvement Using LSTM and ADAM Optimizer</title><source>Publicly Available Content Database</source><creator>Toh, Seng Choon ; Lai, Sai Hin ; Mirzaei, Majid ; Soo, Eugene Zhen Xiang ; Teo, Fang Yenn</creator><creatorcontrib>Toh, Seng Choon ; Lai, Sai Hin ; Mirzaei, Majid ; Soo, Eugene Zhen Xiang ; Teo, Fang Yenn</creatorcontrib><description>This study introduces a systematic methodology whereby different technologies were utilized to download, pre-process, and interactively compare the rainfall datasets from the Integrated Multi-Satellite Retrievals for Global Precipitation Mission (IMERG) satellite and rain gauges. To efficiently handle the large volume of data, we developed automated shell scripts for downloading IMERG data and storing it, along with rain gauge data, in a relational database system. Hypertext pre-processor (pHp) programs were built to visualize the result for better analysis. In this study, the performance of IMERG estimations over the east coast of Peninsular Malaysia for the duration of 10 years (2011–2020) against rain gauge observation data is evaluated. Moreover, this study aimed to improve the daily IMERG estimations with long short-term memory (LSTM) developed with Python. Findings show that the LSTM with Adaptive Moment Estimation (ADAM) optimizer trained against the mean square error (MSE) loss enhances the accuracy of satellite estimations. At the point-to-pixel scale, the correlation between satellite estimations and ground observations was increased by about 15%. The bias was reduced by 81–118%, MAE was reduced by 18–59%, the root-mean-square error (RMSE) was reduced by 1–66%, and the Kling–Gupta efficiency (KGE) was increased by approximately 200%. The approach developed in this study establishes a comprehensive and scalable data processing and analysis pipeline that can be applied to diverse datasets and regions encountering similar domain-specific challenges.</description><identifier>ISSN: 2076-3417</identifier><identifier>EISSN: 2076-3417</identifier><identifier>DOI: 10.3390/app13127237</identifier><language>eng</language><publisher>Basel: MDPI AG</publisher><subject>Accuracy ; ADAM ; Computational linguistics ; Data processing ; Datasets ; Deep learning ; Electronic data processing ; Error reduction ; Gauges ; Hydrology ; Hypertext ; IMERG satellite rainfall ; Language processing ; Long short-term memory ; LSTM ; Machine learning ; Microprocessors ; Natural language interfaces ; Neural networks ; pHp ; Precipitation ; Python ; Rain ; Rain and rainfall ; Rain gauges ; Rainfall ; Relational data bases ; Remote sensing ; Root-mean-square errors ; Software ; SQL relational database ; Time series ; Wind</subject><ispartof>Applied sciences, 2023-06, Vol.13 (12), p.7237</ispartof><rights>COPYRIGHT 2023 MDPI AG</rights><rights>2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c403t-3862a42482fc3fc7e6ad4dcca847599a66d4c641d9b1732f38db51b42d3ecd033</citedby><cites>FETCH-LOGICAL-c403t-3862a42482fc3fc7e6ad4dcca847599a66d4c641d9b1732f38db51b42d3ecd033</cites><orcidid>0009-0006-3735-1666 ; 0000-0002-5529-1381 ; 0000-0002-7143-4805 ; 0000-0003-1978-1110</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2829712933/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2829712933?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,25731,27901,27902,36989,44566,75096</link.rule.ids></links><search><creatorcontrib>Toh, Seng Choon</creatorcontrib><creatorcontrib>Lai, Sai Hin</creatorcontrib><creatorcontrib>Mirzaei, Majid</creatorcontrib><creatorcontrib>Soo, Eugene Zhen Xiang</creatorcontrib><creatorcontrib>Teo, Fang Yenn</creatorcontrib><title>Sequential Data Processing for IMERG Satellite Rainfall Comparison and Improvement Using LSTM and ADAM Optimizer</title><title>Applied sciences</title><description>This study introduces a systematic methodology whereby different technologies were utilized to download, pre-process, and interactively compare the rainfall datasets from the Integrated Multi-Satellite Retrievals for Global Precipitation Mission (IMERG) satellite and rain gauges. To efficiently handle the large volume of data, we developed automated shell scripts for downloading IMERG data and storing it, along with rain gauge data, in a relational database system. Hypertext pre-processor (pHp) programs were built to visualize the result for better analysis. In this study, the performance of IMERG estimations over the east coast of Peninsular Malaysia for the duration of 10 years (2011–2020) against rain gauge observation data is evaluated. Moreover, this study aimed to improve the daily IMERG estimations with long short-term memory (LSTM) developed with Python. Findings show that the LSTM with Adaptive Moment Estimation (ADAM) optimizer trained against the mean square error (MSE) loss enhances the accuracy of satellite estimations. At the point-to-pixel scale, the correlation between satellite estimations and ground observations was increased by about 15%. The bias was reduced by 81–118%, MAE was reduced by 18–59%, the root-mean-square error (RMSE) was reduced by 1–66%, and the Kling–Gupta efficiency (KGE) was increased by approximately 200%. The approach developed in this study establishes a comprehensive and scalable data processing and analysis pipeline that can be applied to diverse datasets and regions encountering similar domain-specific challenges.</description><subject>Accuracy</subject><subject>ADAM</subject><subject>Computational linguistics</subject><subject>Data processing</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Electronic data processing</subject><subject>Error reduction</subject><subject>Gauges</subject><subject>Hydrology</subject><subject>Hypertext</subject><subject>IMERG satellite rainfall</subject><subject>Language processing</subject><subject>Long short-term memory</subject><subject>LSTM</subject><subject>Machine learning</subject><subject>Microprocessors</subject><subject>Natural language interfaces</subject><subject>Neural networks</subject><subject>pHp</subject><subject>Precipitation</subject><subject>Python</subject><subject>Rain</subject><subject>Rain and rainfall</subject><subject>Rain gauges</subject><subject>Rainfall</subject><subject>Relational data bases</subject><subject>Remote sensing</subject><subject>Root-mean-square errors</subject><subject>Software</subject><subject>SQL relational database</subject><subject>Time series</subject><subject>Wind</subject><issn>2076-3417</issn><issn>2076-3417</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNpNkVtLIzEUxwdxQVGf9gsEfJS6uc1k8liqdgstitXncCaXkjIzmU3igvvpN7Yi5jzkcJL_79yq6ifBt4xJ_AumiTBCBWXipDqnWDQzxok4_eafVVcp7XE5krCW4PNq2to_b3bMHnp0BxnQUwzapuTHHXIhotXm_nmJtpBt3_ts0TP40UHfo0UYJog-hRHBaNBqmGL4a4eCQq8H9Xr7sjk8ze_mG_Q4ZT_4fzZeVj-KPtmrz_uien24f1n8nq0fl6vFfD3THLM8Y21DgVPeUqeZ08I2YLjRGlouaimhaQzXDSdGdkQw6lhrupp0nBpmtcGMXVSrI9cE2Ksp-gHiuwrg1SEQ4k5BzF73VtVcW05J67AA7sB2Tdthqg3hzgin68K6PrJKj2VaKat9eItjKV_RlkpBqGQfGW-Pv3ZQoGVMIUfQxYwdvA6jdb7E56LmUpT2ZBHcHAU6hpSidV9lEqw-Vqq-rZT9B-BFkuA</recordid><startdate>20230601</startdate><enddate>20230601</enddate><creator>Toh, Seng Choon</creator><creator>Lai, Sai Hin</creator><creator>Mirzaei, Majid</creator><creator>Soo, Eugene Zhen Xiang</creator><creator>Teo, Fang Yenn</creator><general>MDPI AG</general><scope>AAYXX</scope><scope>CITATION</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>DOA</scope><orcidid>https://orcid.org/0009-0006-3735-1666</orcidid><orcidid>https://orcid.org/0000-0002-5529-1381</orcidid><orcidid>https://orcid.org/0000-0002-7143-4805</orcidid><orcidid>https://orcid.org/0000-0003-1978-1110</orcidid></search><sort><creationdate>20230601</creationdate><title>Sequential Data Processing for IMERG Satellite Rainfall Comparison and Improvement Using LSTM and ADAM Optimizer</title><author>Toh, Seng Choon ; Lai, Sai Hin ; Mirzaei, Majid ; Soo, Eugene Zhen Xiang ; Teo, Fang Yenn</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c403t-3862a42482fc3fc7e6ad4dcca847599a66d4c641d9b1732f38db51b42d3ecd033</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Accuracy</topic><topic>ADAM</topic><topic>Computational linguistics</topic><topic>Data processing</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Electronic data processing</topic><topic>Error reduction</topic><topic>Gauges</topic><topic>Hydrology</topic><topic>Hypertext</topic><topic>IMERG satellite rainfall</topic><topic>Language processing</topic><topic>Long short-term memory</topic><topic>LSTM</topic><topic>Machine learning</topic><topic>Microprocessors</topic><topic>Natural language interfaces</topic><topic>Neural networks</topic><topic>pHp</topic><topic>Precipitation</topic><topic>Python</topic><topic>Rain</topic><topic>Rain and rainfall</topic><topic>Rain gauges</topic><topic>Rainfall</topic><topic>Relational data bases</topic><topic>Remote sensing</topic><topic>Root-mean-square errors</topic><topic>Software</topic><topic>SQL relational database</topic><topic>Time series</topic><topic>Wind</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Toh, Seng Choon</creatorcontrib><creatorcontrib>Lai, Sai Hin</creatorcontrib><creatorcontrib>Mirzaei, Majid</creatorcontrib><creatorcontrib>Soo, Eugene Zhen Xiang</creatorcontrib><creatorcontrib>Teo, Fang Yenn</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Applied sciences</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Toh, Seng Choon</au><au>Lai, Sai Hin</au><au>Mirzaei, Majid</au><au>Soo, Eugene Zhen Xiang</au><au>Teo, Fang Yenn</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Sequential Data Processing for IMERG Satellite Rainfall Comparison and Improvement Using LSTM and ADAM Optimizer</atitle><jtitle>Applied sciences</jtitle><date>2023-06-01</date><risdate>2023</risdate><volume>13</volume><issue>12</issue><spage>7237</spage><pages>7237-</pages><issn>2076-3417</issn><eissn>2076-3417</eissn><abstract>This study introduces a systematic methodology whereby different technologies were utilized to download, pre-process, and interactively compare the rainfall datasets from the Integrated Multi-Satellite Retrievals for Global Precipitation Mission (IMERG) satellite and rain gauges. To efficiently handle the large volume of data, we developed automated shell scripts for downloading IMERG data and storing it, along with rain gauge data, in a relational database system. Hypertext pre-processor (pHp) programs were built to visualize the result for better analysis. In this study, the performance of IMERG estimations over the east coast of Peninsular Malaysia for the duration of 10 years (2011–2020) against rain gauge observation data is evaluated. Moreover, this study aimed to improve the daily IMERG estimations with long short-term memory (LSTM) developed with Python. Findings show that the LSTM with Adaptive Moment Estimation (ADAM) optimizer trained against the mean square error (MSE) loss enhances the accuracy of satellite estimations. At the point-to-pixel scale, the correlation between satellite estimations and ground observations was increased by about 15%. The bias was reduced by 81–118%, MAE was reduced by 18–59%, the root-mean-square error (RMSE) was reduced by 1–66%, and the Kling–Gupta efficiency (KGE) was increased by approximately 200%. The approach developed in this study establishes a comprehensive and scalable data processing and analysis pipeline that can be applied to diverse datasets and regions encountering similar domain-specific challenges.</abstract><cop>Basel</cop><pub>MDPI AG</pub><doi>10.3390/app13127237</doi><orcidid>https://orcid.org/0009-0006-3735-1666</orcidid><orcidid>https://orcid.org/0000-0002-5529-1381</orcidid><orcidid>https://orcid.org/0000-0002-7143-4805</orcidid><orcidid>https://orcid.org/0000-0003-1978-1110</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2076-3417
ispartof Applied sciences, 2023-06, Vol.13 (12), p.7237
issn 2076-3417
2076-3417
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_54ce4218f07a4faeb68b02cd14fd7fc5
source Publicly Available Content Database
subjects Accuracy
ADAM
Computational linguistics
Data processing
Datasets
Deep learning
Electronic data processing
Error reduction
Gauges
Hydrology
Hypertext
IMERG satellite rainfall
Language processing
Long short-term memory
LSTM
Machine learning
Microprocessors
Natural language interfaces
Neural networks
pHp
Precipitation
Python
Rain
Rain and rainfall
Rain gauges
Rainfall
Relational data bases
Remote sensing
Root-mean-square errors
Software
SQL relational database
Time series
Wind
title Sequential Data Processing for IMERG Satellite Rainfall Comparison and Improvement Using LSTM and ADAM Optimizer
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-14T20%3A35%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Sequential%20Data%20Processing%20for%20IMERG%20Satellite%20Rainfall%20Comparison%20and%20Improvement%20Using%20LSTM%20and%20ADAM%20Optimizer&rft.jtitle=Applied%20sciences&rft.au=Toh,%20Seng%20Choon&rft.date=2023-06-01&rft.volume=13&rft.issue=12&rft.spage=7237&rft.pages=7237-&rft.issn=2076-3417&rft.eissn=2076-3417&rft_id=info:doi/10.3390/app13127237&rft_dat=%3Cgale_doaj_%3EA754973869%3C/gale_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c403t-3862a42482fc3fc7e6ad4dcca847599a66d4c641d9b1732f38db51b42d3ecd033%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2829712933&rft_id=info:pmid/&rft_galeid=A754973869&rfr_iscdi=true