Loading…

Application of machine learning algorithms to screen potential biomarkers under cadmium exposure based on human urine metabolic profiles

Exposure to environmental cadmium increases the health risk of residents. Early urine metabolic detection using high-resolution mass spectrometry and machine learning algorithms would be advantageous to predict the adverse health effects. Here, we conducted machine learning approaches to screen pote...

Full description

Saved in:
Bibliographic Details
Published in:Chinese chemical letters 2022-12, Vol.33 (12), p.5184-5188
Main Authors: Zeng, Ting, Liang, Yanshan, Dai, Qingyuan, Tian, Jinglin, Chen, Jinyao, Lei, Bo, Yang, Zhu, Cai, Zongwei
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3
cites cdi_FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3
container_end_page 5188
container_issue 12
container_start_page 5184
container_title Chinese chemical letters
container_volume 33
creator Zeng, Ting
Liang, Yanshan
Dai, Qingyuan
Tian, Jinglin
Chen, Jinyao
Lei, Bo
Yang, Zhu
Cai, Zongwei
description Exposure to environmental cadmium increases the health risk of residents. Early urine metabolic detection using high-resolution mass spectrometry and machine learning algorithms would be advantageous to predict the adverse health effects. Here, we conducted machine learning approaches to screen potential biomarkers under cadmium exposure in 403 urine samples. In positive and negative ionization mode, 4207 and 3558 features were extracted, respectively. We compared seven machine learning algorithms and found that the extreme gradient boosting (XGBoost) and random forest (RF) classifiers showed better accuracy and predictive performance than others. Following 5-fold cross-validation, the value of area under curve (AUC) was both 0.93 for positive and negative ionization modes in XGBoost classifier. In the RF classifier, AUC were 0.80 and 0.84 for positive and negative ionization modes, respectively. We then identified a biomarker panel based on XGBoost and RF classifiers. The incorporation of machine learning models into urine analysis using high-resolution mass spectrometry could allow a convenient assessment of cadmium exposure. [Display omitted] On a cohort of 403 volunteers who had been exposed to cadmium, high-resolution mass spectrometry-based urine metabolic detection was conducted, seven machine learning algorithms on the LCHRMS data set were compared, and a biomarker panel based on the selected machine learning mode were identified. The extreme gradient boosting and random forest classifiers showed better accuracy and predictive performance than others which indicates this study has added a new reference for selecting data-driven machine learning algorithms for a metabolic analysis of urine under cadmium exposure.
doi_str_mv 10.1016/j.cclet.2022.03.020
format article
fullrecord <record><control><sourceid>wanfang_jour_cross</sourceid><recordid>TN_cdi_wanfang_journals_zghxkb202212040</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><wanfj_id>zghxkb202212040</wanfj_id><els_id>S1001841722002169</els_id><sourcerecordid>zghxkb202212040</sourcerecordid><originalsourceid>FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3</originalsourceid><addsrcrecordid>eNp9kLtu3DAURIXABrJx_AVu2LmSwoceVOHCMJwHsECapCYuqatd7kqkQFKJ7S_wZ5ubTe3q3mLODGaK4obRilHWfjlUxkyYKk45r6ioKKcfig2TnSybvq0v8k8pK2XNuo_FpxgPlHIpRbspXu-XZbIGkvWO-JHMYPbWIZkQgrNuR2Da-WDTfo4keRJNQHRk8QldsjARbf0M4YghktUNGIiBYbbrTPBp8XENSDREHEh2368zOLKGk_2MCbTPwWQJfrQTxs_F5QhTxOv_96r4_fXx18P3cvvz24-H-21phGhSyVkPDE1Tg5bYy14AaqwNbyjrJALlLba6ERxBjtDJTvN67KHl2Ardm9aIq-L27PsX3Ahupw5-DS4nqpfd_umoTxMyTmualeKsNMHHGHBUS7C57LNiVJ1mVwf1b3Z1YhQVKs-eqbszhbnEH4tBRWPRGRxsQJPU4O27_BunSZBQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Application of machine learning algorithms to screen potential biomarkers under cadmium exposure based on human urine metabolic profiles</title><source>Elsevier</source><creator>Zeng, Ting ; Liang, Yanshan ; Dai, Qingyuan ; Tian, Jinglin ; Chen, Jinyao ; Lei, Bo ; Yang, Zhu ; Cai, Zongwei</creator><creatorcontrib>Zeng, Ting ; Liang, Yanshan ; Dai, Qingyuan ; Tian, Jinglin ; Chen, Jinyao ; Lei, Bo ; Yang, Zhu ; Cai, Zongwei</creatorcontrib><description>Exposure to environmental cadmium increases the health risk of residents. Early urine metabolic detection using high-resolution mass spectrometry and machine learning algorithms would be advantageous to predict the adverse health effects. Here, we conducted machine learning approaches to screen potential biomarkers under cadmium exposure in 403 urine samples. In positive and negative ionization mode, 4207 and 3558 features were extracted, respectively. We compared seven machine learning algorithms and found that the extreme gradient boosting (XGBoost) and random forest (RF) classifiers showed better accuracy and predictive performance than others. Following 5-fold cross-validation, the value of area under curve (AUC) was both 0.93 for positive and negative ionization modes in XGBoost classifier. In the RF classifier, AUC were 0.80 and 0.84 for positive and negative ionization modes, respectively. We then identified a biomarker panel based on XGBoost and RF classifiers. The incorporation of machine learning models into urine analysis using high-resolution mass spectrometry could allow a convenient assessment of cadmium exposure. [Display omitted] On a cohort of 403 volunteers who had been exposed to cadmium, high-resolution mass spectrometry-based urine metabolic detection was conducted, seven machine learning algorithms on the LCHRMS data set were compared, and a biomarker panel based on the selected machine learning mode were identified. The extreme gradient boosting and random forest classifiers showed better accuracy and predictive performance than others which indicates this study has added a new reference for selecting data-driven machine learning algorithms for a metabolic analysis of urine under cadmium exposure.</description><identifier>ISSN: 1001-8417</identifier><identifier>EISSN: 1878-5964</identifier><identifier>DOI: 10.1016/j.cclet.2022.03.020</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Cadmium exposure ; High-resolution mass spectrometry ; Human urine ; Machine learning ; Metabolic profiles</subject><ispartof>Chinese chemical letters, 2022-12, Vol.33 (12), p.5184-5188</ispartof><rights>2022</rights><rights>Copyright © Wanfang Data Co. Ltd. All Rights Reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3</citedby><cites>FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3</cites><orcidid>0000-0001-5934-1617</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://www.wanfangdata.com.cn/images/PeriodicalImages/zghxkb/zghxkb.jpg</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Zeng, Ting</creatorcontrib><creatorcontrib>Liang, Yanshan</creatorcontrib><creatorcontrib>Dai, Qingyuan</creatorcontrib><creatorcontrib>Tian, Jinglin</creatorcontrib><creatorcontrib>Chen, Jinyao</creatorcontrib><creatorcontrib>Lei, Bo</creatorcontrib><creatorcontrib>Yang, Zhu</creatorcontrib><creatorcontrib>Cai, Zongwei</creatorcontrib><title>Application of machine learning algorithms to screen potential biomarkers under cadmium exposure based on human urine metabolic profiles</title><title>Chinese chemical letters</title><description>Exposure to environmental cadmium increases the health risk of residents. Early urine metabolic detection using high-resolution mass spectrometry and machine learning algorithms would be advantageous to predict the adverse health effects. Here, we conducted machine learning approaches to screen potential biomarkers under cadmium exposure in 403 urine samples. In positive and negative ionization mode, 4207 and 3558 features were extracted, respectively. We compared seven machine learning algorithms and found that the extreme gradient boosting (XGBoost) and random forest (RF) classifiers showed better accuracy and predictive performance than others. Following 5-fold cross-validation, the value of area under curve (AUC) was both 0.93 for positive and negative ionization modes in XGBoost classifier. In the RF classifier, AUC were 0.80 and 0.84 for positive and negative ionization modes, respectively. We then identified a biomarker panel based on XGBoost and RF classifiers. The incorporation of machine learning models into urine analysis using high-resolution mass spectrometry could allow a convenient assessment of cadmium exposure. [Display omitted] On a cohort of 403 volunteers who had been exposed to cadmium, high-resolution mass spectrometry-based urine metabolic detection was conducted, seven machine learning algorithms on the LCHRMS data set were compared, and a biomarker panel based on the selected machine learning mode were identified. The extreme gradient boosting and random forest classifiers showed better accuracy and predictive performance than others which indicates this study has added a new reference for selecting data-driven machine learning algorithms for a metabolic analysis of urine under cadmium exposure.</description><subject>Cadmium exposure</subject><subject>High-resolution mass spectrometry</subject><subject>Human urine</subject><subject>Machine learning</subject><subject>Metabolic profiles</subject><issn>1001-8417</issn><issn>1878-5964</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kLtu3DAURIXABrJx_AVu2LmSwoceVOHCMJwHsECapCYuqatd7kqkQFKJ7S_wZ5ubTe3q3mLODGaK4obRilHWfjlUxkyYKk45r6ioKKcfig2TnSybvq0v8k8pK2XNuo_FpxgPlHIpRbspXu-XZbIGkvWO-JHMYPbWIZkQgrNuR2Da-WDTfo4keRJNQHRk8QldsjARbf0M4YghktUNGIiBYbbrTPBp8XENSDREHEh2368zOLKGk_2MCbTPwWQJfrQTxs_F5QhTxOv_96r4_fXx18P3cvvz24-H-21phGhSyVkPDE1Tg5bYy14AaqwNbyjrJALlLba6ERxBjtDJTvN67KHl2Ardm9aIq-L27PsX3Ahupw5-DS4nqpfd_umoTxMyTmualeKsNMHHGHBUS7C57LNiVJ1mVwf1b3Z1YhQVKs-eqbszhbnEH4tBRWPRGRxsQJPU4O27_BunSZBQ</recordid><startdate>20221201</startdate><enddate>20221201</enddate><creator>Zeng, Ting</creator><creator>Liang, Yanshan</creator><creator>Dai, Qingyuan</creator><creator>Tian, Jinglin</creator><creator>Chen, Jinyao</creator><creator>Lei, Bo</creator><creator>Yang, Zhu</creator><creator>Cai, Zongwei</creator><general>Elsevier B.V</general><general>Food Science and Technology Program,Beijing Normal University-Hong Kong Baptist University United International College,Zhuhai 519087,China</general><general>State Key Laboratory of Environmental and Biological Analysis,Department of Chemistry,Hong Kong Baptist University,Hong Kong,China%Department of Nutrition,Food Safety and Toxicology,West China School of Public Health,Sichuan University,Chengdu 610041,China%Food Science and Technology Program,Beijing Normal University-Hong Kong Baptist University United International College,Zhuhai 519087,China%State Key Laboratory of Environmental and Biological Analysis,Department of Chemistry,Hong Kong Baptist University,Hong Kong,China</general><scope>AAYXX</scope><scope>CITATION</scope><scope>2B.</scope><scope>4A8</scope><scope>92I</scope><scope>93N</scope><scope>PSX</scope><scope>TCJ</scope><orcidid>https://orcid.org/0000-0001-5934-1617</orcidid></search><sort><creationdate>20221201</creationdate><title>Application of machine learning algorithms to screen potential biomarkers under cadmium exposure based on human urine metabolic profiles</title><author>Zeng, Ting ; Liang, Yanshan ; Dai, Qingyuan ; Tian, Jinglin ; Chen, Jinyao ; Lei, Bo ; Yang, Zhu ; Cai, Zongwei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Cadmium exposure</topic><topic>High-resolution mass spectrometry</topic><topic>Human urine</topic><topic>Machine learning</topic><topic>Metabolic profiles</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zeng, Ting</creatorcontrib><creatorcontrib>Liang, Yanshan</creatorcontrib><creatorcontrib>Dai, Qingyuan</creatorcontrib><creatorcontrib>Tian, Jinglin</creatorcontrib><creatorcontrib>Chen, Jinyao</creatorcontrib><creatorcontrib>Lei, Bo</creatorcontrib><creatorcontrib>Yang, Zhu</creatorcontrib><creatorcontrib>Cai, Zongwei</creatorcontrib><collection>CrossRef</collection><collection>Wanfang Data Journals - Hong Kong</collection><collection>WANFANG Data Centre</collection><collection>Wanfang Data Journals</collection><collection>万方数据期刊 - 香港版</collection><collection>China Online Journals (COJ)</collection><collection>China Online Journals (COJ)</collection><jtitle>Chinese chemical letters</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zeng, Ting</au><au>Liang, Yanshan</au><au>Dai, Qingyuan</au><au>Tian, Jinglin</au><au>Chen, Jinyao</au><au>Lei, Bo</au><au>Yang, Zhu</au><au>Cai, Zongwei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Application of machine learning algorithms to screen potential biomarkers under cadmium exposure based on human urine metabolic profiles</atitle><jtitle>Chinese chemical letters</jtitle><date>2022-12-01</date><risdate>2022</risdate><volume>33</volume><issue>12</issue><spage>5184</spage><epage>5188</epage><pages>5184-5188</pages><issn>1001-8417</issn><eissn>1878-5964</eissn><abstract>Exposure to environmental cadmium increases the health risk of residents. Early urine metabolic detection using high-resolution mass spectrometry and machine learning algorithms would be advantageous to predict the adverse health effects. Here, we conducted machine learning approaches to screen potential biomarkers under cadmium exposure in 403 urine samples. In positive and negative ionization mode, 4207 and 3558 features were extracted, respectively. We compared seven machine learning algorithms and found that the extreme gradient boosting (XGBoost) and random forest (RF) classifiers showed better accuracy and predictive performance than others. Following 5-fold cross-validation, the value of area under curve (AUC) was both 0.93 for positive and negative ionization modes in XGBoost classifier. In the RF classifier, AUC were 0.80 and 0.84 for positive and negative ionization modes, respectively. We then identified a biomarker panel based on XGBoost and RF classifiers. The incorporation of machine learning models into urine analysis using high-resolution mass spectrometry could allow a convenient assessment of cadmium exposure. [Display omitted] On a cohort of 403 volunteers who had been exposed to cadmium, high-resolution mass spectrometry-based urine metabolic detection was conducted, seven machine learning algorithms on the LCHRMS data set were compared, and a biomarker panel based on the selected machine learning mode were identified. The extreme gradient boosting and random forest classifiers showed better accuracy and predictive performance than others which indicates this study has added a new reference for selecting data-driven machine learning algorithms for a metabolic analysis of urine under cadmium exposure.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.cclet.2022.03.020</doi><tpages>5</tpages><orcidid>https://orcid.org/0000-0001-5934-1617</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1001-8417
ispartof Chinese chemical letters, 2022-12, Vol.33 (12), p.5184-5188
issn 1001-8417
1878-5964
language eng
recordid cdi_wanfang_journals_zghxkb202212040
source Elsevier
subjects Cadmium exposure
High-resolution mass spectrometry
Human urine
Machine learning
Metabolic profiles
title Application of machine learning algorithms to screen potential biomarkers under cadmium exposure based on human urine metabolic profiles
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T10%3A54%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-wanfang_jour_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Application%20of%20machine%20learning%20algorithms%20to%20screen%20potential%20biomarkers%20under%20cadmium%20exposure%20based%20on%20human%20urine%20metabolic%20profiles&rft.jtitle=Chinese%20chemical%20letters&rft.au=Zeng,%20Ting&rft.date=2022-12-01&rft.volume=33&rft.issue=12&rft.spage=5184&rft.epage=5188&rft.pages=5184-5188&rft.issn=1001-8417&rft.eissn=1878-5964&rft_id=info:doi/10.1016/j.cclet.2022.03.020&rft_dat=%3Cwanfang_jour_cross%3Ezghxkb202212040%3C/wanfang_jour_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_wanfj_id=zghxkb202212040&rfr_iscdi=true