Loading…

Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 1: Concepts and methodology

A comprehensive data driven modeling experiment is presented in a two-part paper. In this first part, an extensive data-driven modeling experiment is proposed. The most important concerns regarding the way data driven modeling (DDM) techniques and data were handled, compared, and evaluated, and the...

Full description

Saved in:
Bibliographic Details
Published in:Hydrology and earth system sciences 2010-10, Vol.14 (10), p.1931-1941
Main Authors: Elshorbagy, A, Corzo, G, Srinivasulu, S, Solomatine, D P
Format: Article
Language:English
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c479t-60ca1cd248d910fcd618fcf3c84e30b66cb33267d1214b4d97f780f082054c543
cites cdi_FETCH-LOGICAL-c479t-60ca1cd248d910fcd618fcf3c84e30b66cb33267d1214b4d97f780f082054c543
container_end_page 1941
container_issue 10
container_start_page 1931
container_title Hydrology and earth system sciences
container_volume 14
creator Elshorbagy, A
Corzo, G
Srinivasulu, S
Solomatine, D P
description A comprehensive data driven modeling experiment is presented in a two-part paper. In this first part, an extensive data-driven modeling experiment is proposed. The most important concerns regarding the way data driven modeling (DDM) techniques and data were handled, compared, and evaluated, and the basis on which findings and conclusions were drawn are discussed. A concise review of key articles that presented comparisons among various DDM techniques is presented. Six DDM techniques, namely, neural networks, genetic programming, evolutionary polynomial regression, support vector machines, M5 model trees, and K-nearest neighbors are proposed and explained. Multiple linear regression and naïve models are also suggested as baseline for comparison with the various techniques. Five datasets from Canada and Europe representing evapotranspiration, upper and lower layer soil moisture content, and rainfall-runoff process are described and proposed, in the second paper, for the modeling experiment. Twelve different realizations (groups) from each dataset are created by a procedure involving random sampling. Each group contains three subsets; training, cross-validation, and testing. Each modeling technique is proposed to be applied to each of the 12 groups of each dataset. This way, both prediction accuracy and uncertainty of the modeling techniques can be evaluated. The description of the datasets, the implementation of the modeling techniques, results and analysis, and the findings of the modeling experiment are deferred to the second part of this paper.
doi_str_mv 10.5194/hess-14-1931-2010
format article
fullrecord <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_85603d99dde044439960e90a0e3c11cd</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_85603d99dde044439960e90a0e3c11cd</doaj_id><sourcerecordid>817610083</sourcerecordid><originalsourceid>FETCH-LOGICAL-c479t-60ca1cd248d910fcd618fcf3c84e30b66cb33267d1214b4d97f780f082054c543</originalsourceid><addsrcrecordid>eNpdkUGLFDEUhBtRcF39Ad6CF0-t73XS6cSbDLu6sKAHPYdM8nomQ0_SJpnF-Q_-aLt3RMRTHqmPoopqmtcI73rU4v2eSmlRtKg5th0gPGmuUMLQDpqrp__cz5sXpRwAOqVkd9X8uvk5Uw5HitVOLMQHKjXsbA0psjSyuic2Z_LB1fBAzNnZbsMUaqCyyt5Wy3xepMiOydMU4o5VcvsYfpwWJES2P_ucprQ7s5Z9tbky_MA2KTqaa2E2enakuk_-EXnZPBvtVOjVn_e6-X57823zub3_8ulu8_G-dWLQtZXgLDrfCeU1wui8RDW6kTsliMNWSrflvJODxw7FVng9jIOCEVQHvXC94NfN3cXXJ3sw81Lf5rNJNpjHj5R3Zkka3ERG9RK419p7AiEE11oCabBA3OESYvF6e_Gac1o7V3MMxdE02UjpVIzCQSKA4gv55j_ykE45LkWNXnQcEPoFwgvkciol0_g3HoJZlzbr0gaFWZc269L8N6n6nfQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>908317105</pqid></control><display><type>article</type><title>Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 1: Concepts and methodology</title><source>Access via ProQuest (Open Access)</source><source>DOAJ Directory of Open Access Journals</source><creator>Elshorbagy, A ; Corzo, G ; Srinivasulu, S ; Solomatine, D P</creator><creatorcontrib>Elshorbagy, A ; Corzo, G ; Srinivasulu, S ; Solomatine, D P</creatorcontrib><description>A comprehensive data driven modeling experiment is presented in a two-part paper. In this first part, an extensive data-driven modeling experiment is proposed. The most important concerns regarding the way data driven modeling (DDM) techniques and data were handled, compared, and evaluated, and the basis on which findings and conclusions were drawn are discussed. A concise review of key articles that presented comparisons among various DDM techniques is presented. Six DDM techniques, namely, neural networks, genetic programming, evolutionary polynomial regression, support vector machines, M5 model trees, and K-nearest neighbors are proposed and explained. Multiple linear regression and naïve models are also suggested as baseline for comparison with the various techniques. Five datasets from Canada and Europe representing evapotranspiration, upper and lower layer soil moisture content, and rainfall-runoff process are described and proposed, in the second paper, for the modeling experiment. Twelve different realizations (groups) from each dataset are created by a procedure involving random sampling. Each group contains three subsets; training, cross-validation, and testing. Each modeling technique is proposed to be applied to each of the 12 groups of each dataset. This way, both prediction accuracy and uncertainty of the modeling techniques can be evaluated. The description of the datasets, the implementation of the modeling techniques, results and analysis, and the findings of the modeling experiment are deferred to the second part of this paper.</description><identifier>ISSN: 1607-7938</identifier><identifier>ISSN: 1027-5606</identifier><identifier>EISSN: 1607-7938</identifier><identifier>DOI: 10.5194/hess-14-1931-2010</identifier><language>eng</language><publisher>Katlenburg-Lindau: Copernicus GmbH</publisher><ispartof>Hydrology and earth system sciences, 2010-10, Vol.14 (10), p.1931-1941</ispartof><rights>Hydrology and Earth System Sciences 2010</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c479t-60ca1cd248d910fcd618fcf3c84e30b66cb33267d1214b4d97f780f082054c543</citedby><cites>FETCH-LOGICAL-c479t-60ca1cd248d910fcd618fcf3c84e30b66cb33267d1214b4d97f780f082054c543</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/908317105/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/908317105?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,864,2102,25753,27924,27925,37012,37013,44590,75126</link.rule.ids></links><search><creatorcontrib>Elshorbagy, A</creatorcontrib><creatorcontrib>Corzo, G</creatorcontrib><creatorcontrib>Srinivasulu, S</creatorcontrib><creatorcontrib>Solomatine, D P</creatorcontrib><title>Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 1: Concepts and methodology</title><title>Hydrology and earth system sciences</title><description>A comprehensive data driven modeling experiment is presented in a two-part paper. In this first part, an extensive data-driven modeling experiment is proposed. The most important concerns regarding the way data driven modeling (DDM) techniques and data were handled, compared, and evaluated, and the basis on which findings and conclusions were drawn are discussed. A concise review of key articles that presented comparisons among various DDM techniques is presented. Six DDM techniques, namely, neural networks, genetic programming, evolutionary polynomial regression, support vector machines, M5 model trees, and K-nearest neighbors are proposed and explained. Multiple linear regression and naïve models are also suggested as baseline for comparison with the various techniques. Five datasets from Canada and Europe representing evapotranspiration, upper and lower layer soil moisture content, and rainfall-runoff process are described and proposed, in the second paper, for the modeling experiment. Twelve different realizations (groups) from each dataset are created by a procedure involving random sampling. Each group contains three subsets; training, cross-validation, and testing. Each modeling technique is proposed to be applied to each of the 12 groups of each dataset. This way, both prediction accuracy and uncertainty of the modeling techniques can be evaluated. The description of the datasets, the implementation of the modeling techniques, results and analysis, and the findings of the modeling experiment are deferred to the second part of this paper.</description><issn>1607-7938</issn><issn>1027-5606</issn><issn>1607-7938</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNpdkUGLFDEUhBtRcF39Ad6CF0-t73XS6cSbDLu6sKAHPYdM8nomQ0_SJpnF-Q_-aLt3RMRTHqmPoopqmtcI73rU4v2eSmlRtKg5th0gPGmuUMLQDpqrp__cz5sXpRwAOqVkd9X8uvk5Uw5HitVOLMQHKjXsbA0psjSyuic2Z_LB1fBAzNnZbsMUaqCyyt5Wy3xepMiOydMU4o5VcvsYfpwWJES2P_ucprQ7s5Z9tbky_MA2KTqaa2E2enakuk_-EXnZPBvtVOjVn_e6-X57823zub3_8ulu8_G-dWLQtZXgLDrfCeU1wui8RDW6kTsliMNWSrflvJODxw7FVng9jIOCEVQHvXC94NfN3cXXJ3sw81Lf5rNJNpjHj5R3Zkka3ERG9RK419p7AiEE11oCabBA3OESYvF6e_Gac1o7V3MMxdE02UjpVIzCQSKA4gv55j_ykE45LkWNXnQcEPoFwgvkciol0_g3HoJZlzbr0gaFWZc269L8N6n6nfQ</recordid><startdate>20101014</startdate><enddate>20101014</enddate><creator>Elshorbagy, A</creator><creator>Corzo, G</creator><creator>Srinivasulu, S</creator><creator>Solomatine, D P</creator><general>Copernicus GmbH</general><general>Copernicus Publications</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7QH</scope><scope>7TG</scope><scope>7UA</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BFMQW</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>BKSAR</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>F1W</scope><scope>FR3</scope><scope>GNUQQ</scope><scope>H96</scope><scope>HCIFZ</scope><scope>KL.</scope><scope>KR7</scope><scope>L.G</scope><scope>L6V</scope><scope>M7S</scope><scope>PATMY</scope><scope>PCBAR</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>DOA</scope></search><sort><creationdate>20101014</creationdate><title>Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 1: Concepts and methodology</title><author>Elshorbagy, A ; Corzo, G ; Srinivasulu, S ; Solomatine, D P</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c479t-60ca1cd248d910fcd618fcf3c84e30b66cb33267d1214b4d97f780f082054c543</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Elshorbagy, A</creatorcontrib><creatorcontrib>Corzo, G</creatorcontrib><creatorcontrib>Srinivasulu, S</creatorcontrib><creatorcontrib>Solomatine, D P</creatorcontrib><collection>CrossRef</collection><collection>Aqualine</collection><collection>Meteorological &amp; Geoastrophysical Abstracts</collection><collection>Water Resources Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Agricultural &amp; Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Continental Europe Database</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Earth, Atmospheric &amp; Aquatic Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Engineering Research Database</collection><collection>ProQuest Central Student</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy &amp; Non-Living Resources</collection><collection>SciTech Premium Collection</collection><collection>Meteorological &amp; Geoastrophysical Abstracts - Academic</collection><collection>Civil Engineering Abstracts</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) Professional</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Environmental Science Database</collection><collection>Earth, Atmospheric &amp; Aquatic Science Database</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><collection>Environmental Science Collection</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Hydrology and earth system sciences</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Elshorbagy, A</au><au>Corzo, G</au><au>Srinivasulu, S</au><au>Solomatine, D P</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 1: Concepts and methodology</atitle><jtitle>Hydrology and earth system sciences</jtitle><date>2010-10-14</date><risdate>2010</risdate><volume>14</volume><issue>10</issue><spage>1931</spage><epage>1941</epage><pages>1931-1941</pages><issn>1607-7938</issn><issn>1027-5606</issn><eissn>1607-7938</eissn><abstract>A comprehensive data driven modeling experiment is presented in a two-part paper. In this first part, an extensive data-driven modeling experiment is proposed. The most important concerns regarding the way data driven modeling (DDM) techniques and data were handled, compared, and evaluated, and the basis on which findings and conclusions were drawn are discussed. A concise review of key articles that presented comparisons among various DDM techniques is presented. Six DDM techniques, namely, neural networks, genetic programming, evolutionary polynomial regression, support vector machines, M5 model trees, and K-nearest neighbors are proposed and explained. Multiple linear regression and naïve models are also suggested as baseline for comparison with the various techniques. Five datasets from Canada and Europe representing evapotranspiration, upper and lower layer soil moisture content, and rainfall-runoff process are described and proposed, in the second paper, for the modeling experiment. Twelve different realizations (groups) from each dataset are created by a procedure involving random sampling. Each group contains three subsets; training, cross-validation, and testing. Each modeling technique is proposed to be applied to each of the 12 groups of each dataset. This way, both prediction accuracy and uncertainty of the modeling techniques can be evaluated. The description of the datasets, the implementation of the modeling techniques, results and analysis, and the findings of the modeling experiment are deferred to the second part of this paper.</abstract><cop>Katlenburg-Lindau</cop><pub>Copernicus GmbH</pub><doi>10.5194/hess-14-1931-2010</doi><tpages>11</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1607-7938
ispartof Hydrology and earth system sciences, 2010-10, Vol.14 (10), p.1931-1941
issn 1607-7938
1027-5606
1607-7938
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_85603d99dde044439960e90a0e3c11cd
source Access via ProQuest (Open Access); DOAJ Directory of Open Access Journals
title Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 1: Concepts and methodology
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T08%3A05%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Experimental%20investigation%20of%20the%20predictive%20capabilities%20of%20data%20driven%20modeling%20techniques%20in%20hydrology%20-%20Part%201:%20Concepts%20and%20methodology&rft.jtitle=Hydrology%20and%20earth%20system%20sciences&rft.au=Elshorbagy,%20A&rft.date=2010-10-14&rft.volume=14&rft.issue=10&rft.spage=1931&rft.epage=1941&rft.pages=1931-1941&rft.issn=1607-7938&rft.eissn=1607-7938&rft_id=info:doi/10.5194/hess-14-1931-2010&rft_dat=%3Cproquest_doaj_%3E817610083%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c479t-60ca1cd248d910fcd618fcf3c84e30b66cb33267d1214b4d97f780f082054c543%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=908317105&rft_id=info:pmid/&rfr_iscdi=true