Loading…

Evaluation of E. coli in sediment for assessing irrigation water quality using machine learning

Fresh produce irrigated with contaminated water poses a substantial risk to human health. This study evaluated the impact of incorporating sediment information on improving the performance of machine learning models to quantify E. coli level in irrigation water. Field samples were collected from irr...

Full description

Saved in:
Bibliographic Details
Published in:The Science of the total environment 2021-12, Vol.799, p.149286, Article 149286
Main Authors: Tousi, Erfan Ghasemi, Duan, Jennifer G., Gundy, Patricia M., Bright, Kelly R., Gerba, Charles P.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Fresh produce irrigated with contaminated water poses a substantial risk to human health. This study evaluated the impact of incorporating sediment information on improving the performance of machine learning models to quantify E. coli level in irrigation water. Field samples were collected from irrigation canals in the Southwest U.S., for which meteorological, chemical, and physical water quality variables as well as three additional flow and sediment properties: the concentration of E. coli in sediment, sediment median size, and bed shear stress. Water quality was classified based on E. coli concentration exceeding two standard levels: 1 E. coli and 126 E. coli colony forming units (CFU) per 100 ml of irrigation water. Two series of features, including (FIS) and excluding (FES) sediment features, were selected using multi-variant filter feature selection. The correlation analysis revealed the inclusion of sediment features improves the correlation with the target standards for E. coli compared to the models excluding these features. Support vector machine, logistic regression, and ridge classifier were tested in this study. The support vector machine model performed the best for both targeted standards. Besides, incorporating sediment features improved all models' performance. Therefore, the concentration of E. coli in sediment and bed shear stress are major factors influencing E. coli concentration in irrigation water. [Display omitted] •Role of sediment on quantifying E. coli in water was assessed by Machine Learning.•Including sediment properties improves classification of irrigation water quality.•Principal component analysis provides insights of data pattern using reduced dimensions.•Utilizing non-linear kernels improves the performance of linear models.•Support vector machine model performs the best after including sediment data.
ISSN:0048-9697
1879-1026
DOI:10.1016/j.scitotenv.2021.149286