Loading…

Using Machine Learning to Assess the Risk of and Prevent Water Main Breaks

Water infrastructure in the United States is beginning to show its age, particularly through water main breaks. Main breaks cause major disruptions in everyday life for residents and businesses. Water main failures in Syracuse, N.Y. (as in most cities) are handled reactively rather than proactively....

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2018-05
Main Authors: Kumar, Avishek, Syed Ali Asad Rizvi, Brooks, Benjamin, R Ali Vanderveld, Wilson, Kevin H, Kenney, Chad, Edelstein, Sam, Finch, Adria, Maxwell, Andrew, Zuckerbraun, Joe, Ghani, Rayid
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Kumar, Avishek
Syed Ali Asad Rizvi
Brooks, Benjamin
R Ali Vanderveld
Wilson, Kevin H
Kenney, Chad
Edelstein, Sam
Finch, Adria
Maxwell, Andrew
Zuckerbraun, Joe
Ghani, Rayid
description Water infrastructure in the United States is beginning to show its age, particularly through water main breaks. Main breaks cause major disruptions in everyday life for residents and businesses. Water main failures in Syracuse, N.Y. (as in most cities) are handled reactively rather than proactively. A barrier to proactive maintenance is the city's inability to predict the risk of failure on parts of its infrastructure. In response, we worked with the city to build a ML system to assess the risk of a water mains breaking. Using historical data on which mains have failed, descriptors of pipes, and other data sources, we evaluated several models' abilities to predict breaks three years into the future. Our results show that our system using gradient boosted decision trees performed the best out of several algorithms and expert heuristics, achieving precision at 1\% (P@1) of 0.62. Our model outperforms a random baseline (P@1 of 0.08) and expert heuristics such as water main age (P@1 of 0.10) and history of past main breaks (P@1 of 0.48). The model is deployed in the City of Syracuse. We are running a pilot by calculating the risk of failure for each city block over the period 2016-2018 using data up to the end of 2015 and, as of the end of 2017, there have been 33 breaks on our riskiest 52 mains. This has been a successful initiative for the city of Syracuse in improving their infrastructure and we believe this approach can be applied to other cities.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2072286878</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2072286878</sourcerecordid><originalsourceid>FETCH-proquest_journals_20722868783</originalsourceid><addsrcrecordid>eNqNi0EKwjAQRYMgWLR3GHBdiFPbZquiiCiIKC5L0KlNK4lmUs-vggdw9eG993siwjSdJGqKOBAxcyOlxLzALEsjsTmxsTfY6UttLMGWtLdfEBzMmIkZQk1wMNyCq0DbK-w9vcgGOOtA_nM0FuaedMsj0a_0nSn-7VCMV8vjYp08vHt2xKFsXOftR5UoC0SVq0Kl_1VvMAg7mQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2072286878</pqid></control><display><type>article</type><title>Using Machine Learning to Assess the Risk of and Prevent Water Main Breaks</title><source>Publicly Available Content Database</source><creator>Kumar, Avishek ; Syed Ali Asad Rizvi ; Brooks, Benjamin ; R Ali Vanderveld ; Wilson, Kevin H ; Kenney, Chad ; Edelstein, Sam ; Finch, Adria ; Maxwell, Andrew ; Zuckerbraun, Joe ; Ghani, Rayid</creator><creatorcontrib>Kumar, Avishek ; Syed Ali Asad Rizvi ; Brooks, Benjamin ; R Ali Vanderveld ; Wilson, Kevin H ; Kenney, Chad ; Edelstein, Sam ; Finch, Adria ; Maxwell, Andrew ; Zuckerbraun, Joe ; Ghani, Rayid</creatorcontrib><description>Water infrastructure in the United States is beginning to show its age, particularly through water main breaks. Main breaks cause major disruptions in everyday life for residents and businesses. Water main failures in Syracuse, N.Y. (as in most cities) are handled reactively rather than proactively. A barrier to proactive maintenance is the city's inability to predict the risk of failure on parts of its infrastructure. In response, we worked with the city to build a ML system to assess the risk of a water mains breaking. Using historical data on which mains have failed, descriptors of pipes, and other data sources, we evaluated several models' abilities to predict breaks three years into the future. Our results show that our system using gradient boosted decision trees performed the best out of several algorithms and expert heuristics, achieving precision at 1\% (P@1) of 0.62. Our model outperforms a random baseline (P@1 of 0.08) and expert heuristics such as water main age (P@1 of 0.10) and history of past main breaks (P@1 of 0.48). The model is deployed in the City of Syracuse. We are running a pilot by calculating the risk of failure for each city block over the period 2016-2018 using data up to the end of 2015 and, as of the end of 2017, there have been 33 breaks on our riskiest 52 mains. This has been a successful initiative for the city of Syracuse in improving their infrastructure and we believe this approach can be applied to other cities.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Cities ; Decision trees ; Infrastructure ; Machine learning ; Mathematical models ; Risk assessment ; Water mains ; Water pipelines ; Water supply systems</subject><ispartof>arXiv.org, 2018-05</ispartof><rights>2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2072286878?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25752,37011,44589</link.rule.ids></links><search><creatorcontrib>Kumar, Avishek</creatorcontrib><creatorcontrib>Syed Ali Asad Rizvi</creatorcontrib><creatorcontrib>Brooks, Benjamin</creatorcontrib><creatorcontrib>R Ali Vanderveld</creatorcontrib><creatorcontrib>Wilson, Kevin H</creatorcontrib><creatorcontrib>Kenney, Chad</creatorcontrib><creatorcontrib>Edelstein, Sam</creatorcontrib><creatorcontrib>Finch, Adria</creatorcontrib><creatorcontrib>Maxwell, Andrew</creatorcontrib><creatorcontrib>Zuckerbraun, Joe</creatorcontrib><creatorcontrib>Ghani, Rayid</creatorcontrib><title>Using Machine Learning to Assess the Risk of and Prevent Water Main Breaks</title><title>arXiv.org</title><description>Water infrastructure in the United States is beginning to show its age, particularly through water main breaks. Main breaks cause major disruptions in everyday life for residents and businesses. Water main failures in Syracuse, N.Y. (as in most cities) are handled reactively rather than proactively. A barrier to proactive maintenance is the city's inability to predict the risk of failure on parts of its infrastructure. In response, we worked with the city to build a ML system to assess the risk of a water mains breaking. Using historical data on which mains have failed, descriptors of pipes, and other data sources, we evaluated several models' abilities to predict breaks three years into the future. Our results show that our system using gradient boosted decision trees performed the best out of several algorithms and expert heuristics, achieving precision at 1\% (P@1) of 0.62. Our model outperforms a random baseline (P@1 of 0.08) and expert heuristics such as water main age (P@1 of 0.10) and history of past main breaks (P@1 of 0.48). The model is deployed in the City of Syracuse. We are running a pilot by calculating the risk of failure for each city block over the period 2016-2018 using data up to the end of 2015 and, as of the end of 2017, there have been 33 breaks on our riskiest 52 mains. This has been a successful initiative for the city of Syracuse in improving their infrastructure and we believe this approach can be applied to other cities.</description><subject>Algorithms</subject><subject>Cities</subject><subject>Decision trees</subject><subject>Infrastructure</subject><subject>Machine learning</subject><subject>Mathematical models</subject><subject>Risk assessment</subject><subject>Water mains</subject><subject>Water pipelines</subject><subject>Water supply systems</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNi0EKwjAQRYMgWLR3GHBdiFPbZquiiCiIKC5L0KlNK4lmUs-vggdw9eG993siwjSdJGqKOBAxcyOlxLzALEsjsTmxsTfY6UttLMGWtLdfEBzMmIkZQk1wMNyCq0DbK-w9vcgGOOtA_nM0FuaedMsj0a_0nSn-7VCMV8vjYp08vHt2xKFsXOftR5UoC0SVq0Kl_1VvMAg7mQ</recordid><startdate>20180509</startdate><enddate>20180509</enddate><creator>Kumar, Avishek</creator><creator>Syed Ali Asad Rizvi</creator><creator>Brooks, Benjamin</creator><creator>R Ali Vanderveld</creator><creator>Wilson, Kevin H</creator><creator>Kenney, Chad</creator><creator>Edelstein, Sam</creator><creator>Finch, Adria</creator><creator>Maxwell, Andrew</creator><creator>Zuckerbraun, Joe</creator><creator>Ghani, Rayid</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20180509</creationdate><title>Using Machine Learning to Assess the Risk of and Prevent Water Main Breaks</title><author>Kumar, Avishek ; Syed Ali Asad Rizvi ; Brooks, Benjamin ; R Ali Vanderveld ; Wilson, Kevin H ; Kenney, Chad ; Edelstein, Sam ; Finch, Adria ; Maxwell, Andrew ; Zuckerbraun, Joe ; Ghani, Rayid</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_20722868783</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Algorithms</topic><topic>Cities</topic><topic>Decision trees</topic><topic>Infrastructure</topic><topic>Machine learning</topic><topic>Mathematical models</topic><topic>Risk assessment</topic><topic>Water mains</topic><topic>Water pipelines</topic><topic>Water supply systems</topic><toplevel>online_resources</toplevel><creatorcontrib>Kumar, Avishek</creatorcontrib><creatorcontrib>Syed Ali Asad Rizvi</creatorcontrib><creatorcontrib>Brooks, Benjamin</creatorcontrib><creatorcontrib>R Ali Vanderveld</creatorcontrib><creatorcontrib>Wilson, Kevin H</creatorcontrib><creatorcontrib>Kenney, Chad</creatorcontrib><creatorcontrib>Edelstein, Sam</creatorcontrib><creatorcontrib>Finch, Adria</creatorcontrib><creatorcontrib>Maxwell, Andrew</creatorcontrib><creatorcontrib>Zuckerbraun, Joe</creatorcontrib><creatorcontrib>Ghani, Rayid</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kumar, Avishek</au><au>Syed Ali Asad Rizvi</au><au>Brooks, Benjamin</au><au>R Ali Vanderveld</au><au>Wilson, Kevin H</au><au>Kenney, Chad</au><au>Edelstein, Sam</au><au>Finch, Adria</au><au>Maxwell, Andrew</au><au>Zuckerbraun, Joe</au><au>Ghani, Rayid</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Using Machine Learning to Assess the Risk of and Prevent Water Main Breaks</atitle><jtitle>arXiv.org</jtitle><date>2018-05-09</date><risdate>2018</risdate><eissn>2331-8422</eissn><abstract>Water infrastructure in the United States is beginning to show its age, particularly through water main breaks. Main breaks cause major disruptions in everyday life for residents and businesses. Water main failures in Syracuse, N.Y. (as in most cities) are handled reactively rather than proactively. A barrier to proactive maintenance is the city's inability to predict the risk of failure on parts of its infrastructure. In response, we worked with the city to build a ML system to assess the risk of a water mains breaking. Using historical data on which mains have failed, descriptors of pipes, and other data sources, we evaluated several models' abilities to predict breaks three years into the future. Our results show that our system using gradient boosted decision trees performed the best out of several algorithms and expert heuristics, achieving precision at 1\% (P@1) of 0.62. Our model outperforms a random baseline (P@1 of 0.08) and expert heuristics such as water main age (P@1 of 0.10) and history of past main breaks (P@1 of 0.48). The model is deployed in the City of Syracuse. We are running a pilot by calculating the risk of failure for each city block over the period 2016-2018 using data up to the end of 2015 and, as of the end of 2017, there have been 33 breaks on our riskiest 52 mains. This has been a successful initiative for the city of Syracuse in improving their infrastructure and we believe this approach can be applied to other cities.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2018-05
issn 2331-8422
language eng
recordid cdi_proquest_journals_2072286878
source Publicly Available Content Database
subjects Algorithms
Cities
Decision trees
Infrastructure
Machine learning
Mathematical models
Risk assessment
Water mains
Water pipelines
Water supply systems
title Using Machine Learning to Assess the Risk of and Prevent Water Main Breaks
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T10%3A57%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Using%20Machine%20Learning%20to%20Assess%20the%20Risk%20of%20and%20Prevent%20Water%20Main%20Breaks&rft.jtitle=arXiv.org&rft.au=Kumar,%20Avishek&rft.date=2018-05-09&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2072286878%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_20722868783%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2072286878&rft_id=info:pmid/&rfr_iscdi=true