Loading…

A machine learning approach to water quality forecasts and sensor network expansion: Case study in the Wabash River Basin, United States

Midwestern cities require forecasts of surface nitrate loads to bring additional treatment processes online or activate alternative water supplies. Concurrently, networks of nitrate monitoring stations are being deployed in river basins, co‐locating water quality observations with established stream...

Full description

Saved in:
Bibliographic Details
Published in:Hydrological processes 2022-06, Vol.36 (6), p.n/a
Main Authors: Balson, Tyler, Ward, Adam S.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Midwestern cities require forecasts of surface nitrate loads to bring additional treatment processes online or activate alternative water supplies. Concurrently, networks of nitrate monitoring stations are being deployed in river basins, co‐locating water quality observations with established stream gauges. However, tools to evaluate the future value of expanded networks to improve water quality forecasts remains challenging. Here, we construct a synthetic data set of stream discharge and nitrate for the Wabash River Basin—one of the United States’ most nutrient polluted basins—using the established Agro‐IBIS and THMB models. Synthetic data enables rapid, unbiased and low‐cost assessment of potential sensor placements to support management objectives, such as near‐term forecasting. Using the synthetic data, we established baseline 1‐day forecasts for surface water nitrate at 12 cities in the basin using support vector machine regression (SVMR; RMSE 0.48–3.3 ppm). Next, we used the SVMRs to evaluate the improvement in forecast performance associated with deployment of additional nitrate sensors. We identified the optimal sensor placement to improve forecasts at each city, and the relative value of sensors at each candidate location. Finally, we assessed the co‐benefit realized by other cities when a sensor is deployed to optimize a forecast at one city, finding significant positive externalities in all cases. Ultimately, our study explores the potential for machine learning to make near‐term predictions and critically evaluate the improvement realized by expanding a monitoring network. While we use nitrate pollution in the Wabash River Basin as a case study, this approach could be readily applied to any problem where the future value of sensors and network design are being evaluated. This study applies machine learning to: (1) produce near‐term water quality forecasts based on an existing monitoring network; and (2) evaluation of future network configurations to increase predictive power. Using the synthetic data, we established 1‐day forecasts for surface water nitrate at 12 cities using support vector machine regression (SVMR; RMSE 0.48–3.3 mg/L). We then showed how those cities could improve their forecast with new data (5%–82% reduction in RMSE).
ISSN:0885-6087
1099-1085
DOI:10.1002/hyp.14619