Loading…
Application of machine learning algorithms to screen potential biomarkers under cadmium exposure based on human urine metabolic profiles
Exposure to environmental cadmium increases the health risk of residents. Early urine metabolic detection using high-resolution mass spectrometry and machine learning algorithms would be advantageous to predict the adverse health effects. Here, we conducted machine learning approaches to screen pote...
Saved in:
Published in: | Chinese chemical letters 2022-12, Vol.33 (12), p.5184-5188 |
---|---|
Main Authors: | , , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3 |
---|---|
cites | cdi_FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3 |
container_end_page | 5188 |
container_issue | 12 |
container_start_page | 5184 |
container_title | Chinese chemical letters |
container_volume | 33 |
creator | Zeng, Ting Liang, Yanshan Dai, Qingyuan Tian, Jinglin Chen, Jinyao Lei, Bo Yang, Zhu Cai, Zongwei |
description | Exposure to environmental cadmium increases the health risk of residents. Early urine metabolic detection using high-resolution mass spectrometry and machine learning algorithms would be advantageous to predict the adverse health effects. Here, we conducted machine learning approaches to screen potential biomarkers under cadmium exposure in 403 urine samples. In positive and negative ionization mode, 4207 and 3558 features were extracted, respectively. We compared seven machine learning algorithms and found that the extreme gradient boosting (XGBoost) and random forest (RF) classifiers showed better accuracy and predictive performance than others. Following 5-fold cross-validation, the value of area under curve (AUC) was both 0.93 for positive and negative ionization modes in XGBoost classifier. In the RF classifier, AUC were 0.80 and 0.84 for positive and negative ionization modes, respectively. We then identified a biomarker panel based on XGBoost and RF classifiers. The incorporation of machine learning models into urine analysis using high-resolution mass spectrometry could allow a convenient assessment of cadmium exposure.
[Display omitted]
On a cohort of 403 volunteers who had been exposed to cadmium, high-resolution mass spectrometry-based urine metabolic detection was conducted, seven machine learning algorithms on the LCHRMS data set were compared, and a biomarker panel based on the selected machine learning mode were identified. The extreme gradient boosting and random forest classifiers showed better accuracy and predictive performance than others which indicates this study has added a new reference for selecting data-driven machine learning algorithms for a metabolic analysis of urine under cadmium exposure. |
doi_str_mv | 10.1016/j.cclet.2022.03.020 |
format | article |
fullrecord | <record><control><sourceid>wanfang_jour_cross</sourceid><recordid>TN_cdi_wanfang_journals_zghxkb202212040</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><wanfj_id>zghxkb202212040</wanfj_id><els_id>S1001841722002169</els_id><sourcerecordid>zghxkb202212040</sourcerecordid><originalsourceid>FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3</originalsourceid><addsrcrecordid>eNp9kLtu3DAURIXABrJx_AVu2LmSwoceVOHCMJwHsECapCYuqatd7kqkQFKJ7S_wZ5ubTe3q3mLODGaK4obRilHWfjlUxkyYKk45r6ioKKcfig2TnSybvq0v8k8pK2XNuo_FpxgPlHIpRbspXu-XZbIGkvWO-JHMYPbWIZkQgrNuR2Da-WDTfo4keRJNQHRk8QldsjARbf0M4YghktUNGIiBYbbrTPBp8XENSDREHEh2368zOLKGk_2MCbTPwWQJfrQTxs_F5QhTxOv_96r4_fXx18P3cvvz24-H-21phGhSyVkPDE1Tg5bYy14AaqwNbyjrJALlLba6ERxBjtDJTvN67KHl2Ardm9aIq-L27PsX3Ahupw5-DS4nqpfd_umoTxMyTmualeKsNMHHGHBUS7C57LNiVJ1mVwf1b3Z1YhQVKs-eqbszhbnEH4tBRWPRGRxsQJPU4O27_BunSZBQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Application of machine learning algorithms to screen potential biomarkers under cadmium exposure based on human urine metabolic profiles</title><source>Elsevier</source><creator>Zeng, Ting ; Liang, Yanshan ; Dai, Qingyuan ; Tian, Jinglin ; Chen, Jinyao ; Lei, Bo ; Yang, Zhu ; Cai, Zongwei</creator><creatorcontrib>Zeng, Ting ; Liang, Yanshan ; Dai, Qingyuan ; Tian, Jinglin ; Chen, Jinyao ; Lei, Bo ; Yang, Zhu ; Cai, Zongwei</creatorcontrib><description>Exposure to environmental cadmium increases the health risk of residents. Early urine metabolic detection using high-resolution mass spectrometry and machine learning algorithms would be advantageous to predict the adverse health effects. Here, we conducted machine learning approaches to screen potential biomarkers under cadmium exposure in 403 urine samples. In positive and negative ionization mode, 4207 and 3558 features were extracted, respectively. We compared seven machine learning algorithms and found that the extreme gradient boosting (XGBoost) and random forest (RF) classifiers showed better accuracy and predictive performance than others. Following 5-fold cross-validation, the value of area under curve (AUC) was both 0.93 for positive and negative ionization modes in XGBoost classifier. In the RF classifier, AUC were 0.80 and 0.84 for positive and negative ionization modes, respectively. We then identified a biomarker panel based on XGBoost and RF classifiers. The incorporation of machine learning models into urine analysis using high-resolution mass spectrometry could allow a convenient assessment of cadmium exposure.
[Display omitted]
On a cohort of 403 volunteers who had been exposed to cadmium, high-resolution mass spectrometry-based urine metabolic detection was conducted, seven machine learning algorithms on the LCHRMS data set were compared, and a biomarker panel based on the selected machine learning mode were identified. The extreme gradient boosting and random forest classifiers showed better accuracy and predictive performance than others which indicates this study has added a new reference for selecting data-driven machine learning algorithms for a metabolic analysis of urine under cadmium exposure.</description><identifier>ISSN: 1001-8417</identifier><identifier>EISSN: 1878-5964</identifier><identifier>DOI: 10.1016/j.cclet.2022.03.020</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Cadmium exposure ; High-resolution mass spectrometry ; Human urine ; Machine learning ; Metabolic profiles</subject><ispartof>Chinese chemical letters, 2022-12, Vol.33 (12), p.5184-5188</ispartof><rights>2022</rights><rights>Copyright © Wanfang Data Co. Ltd. All Rights Reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3</citedby><cites>FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3</cites><orcidid>0000-0001-5934-1617</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://www.wanfangdata.com.cn/images/PeriodicalImages/zghxkb/zghxkb.jpg</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Zeng, Ting</creatorcontrib><creatorcontrib>Liang, Yanshan</creatorcontrib><creatorcontrib>Dai, Qingyuan</creatorcontrib><creatorcontrib>Tian, Jinglin</creatorcontrib><creatorcontrib>Chen, Jinyao</creatorcontrib><creatorcontrib>Lei, Bo</creatorcontrib><creatorcontrib>Yang, Zhu</creatorcontrib><creatorcontrib>Cai, Zongwei</creatorcontrib><title>Application of machine learning algorithms to screen potential biomarkers under cadmium exposure based on human urine metabolic profiles</title><title>Chinese chemical letters</title><description>Exposure to environmental cadmium increases the health risk of residents. Early urine metabolic detection using high-resolution mass spectrometry and machine learning algorithms would be advantageous to predict the adverse health effects. Here, we conducted machine learning approaches to screen potential biomarkers under cadmium exposure in 403 urine samples. In positive and negative ionization mode, 4207 and 3558 features were extracted, respectively. We compared seven machine learning algorithms and found that the extreme gradient boosting (XGBoost) and random forest (RF) classifiers showed better accuracy and predictive performance than others. Following 5-fold cross-validation, the value of area under curve (AUC) was both 0.93 for positive and negative ionization modes in XGBoost classifier. In the RF classifier, AUC were 0.80 and 0.84 for positive and negative ionization modes, respectively. We then identified a biomarker panel based on XGBoost and RF classifiers. The incorporation of machine learning models into urine analysis using high-resolution mass spectrometry could allow a convenient assessment of cadmium exposure.
[Display omitted]
On a cohort of 403 volunteers who had been exposed to cadmium, high-resolution mass spectrometry-based urine metabolic detection was conducted, seven machine learning algorithms on the LCHRMS data set were compared, and a biomarker panel based on the selected machine learning mode were identified. The extreme gradient boosting and random forest classifiers showed better accuracy and predictive performance than others which indicates this study has added a new reference for selecting data-driven machine learning algorithms for a metabolic analysis of urine under cadmium exposure.</description><subject>Cadmium exposure</subject><subject>High-resolution mass spectrometry</subject><subject>Human urine</subject><subject>Machine learning</subject><subject>Metabolic profiles</subject><issn>1001-8417</issn><issn>1878-5964</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kLtu3DAURIXABrJx_AVu2LmSwoceVOHCMJwHsECapCYuqatd7kqkQFKJ7S_wZ5ubTe3q3mLODGaK4obRilHWfjlUxkyYKk45r6ioKKcfig2TnSybvq0v8k8pK2XNuo_FpxgPlHIpRbspXu-XZbIGkvWO-JHMYPbWIZkQgrNuR2Da-WDTfo4keRJNQHRk8QldsjARbf0M4YghktUNGIiBYbbrTPBp8XENSDREHEh2368zOLKGk_2MCbTPwWQJfrQTxs_F5QhTxOv_96r4_fXx18P3cvvz24-H-21phGhSyVkPDE1Tg5bYy14AaqwNbyjrJALlLba6ERxBjtDJTvN67KHl2Ardm9aIq-L27PsX3Ahupw5-DS4nqpfd_umoTxMyTmualeKsNMHHGHBUS7C57LNiVJ1mVwf1b3Z1YhQVKs-eqbszhbnEH4tBRWPRGRxsQJPU4O27_BunSZBQ</recordid><startdate>20221201</startdate><enddate>20221201</enddate><creator>Zeng, Ting</creator><creator>Liang, Yanshan</creator><creator>Dai, Qingyuan</creator><creator>Tian, Jinglin</creator><creator>Chen, Jinyao</creator><creator>Lei, Bo</creator><creator>Yang, Zhu</creator><creator>Cai, Zongwei</creator><general>Elsevier B.V</general><general>Food Science and Technology Program,Beijing Normal University-Hong Kong Baptist University United International College,Zhuhai 519087,China</general><general>State Key Laboratory of Environmental and Biological Analysis,Department of Chemistry,Hong Kong Baptist University,Hong Kong,China%Department of Nutrition,Food Safety and Toxicology,West China School of Public Health,Sichuan University,Chengdu 610041,China%Food Science and Technology Program,Beijing Normal University-Hong Kong Baptist University United International College,Zhuhai 519087,China%State Key Laboratory of Environmental and Biological Analysis,Department of Chemistry,Hong Kong Baptist University,Hong Kong,China</general><scope>AAYXX</scope><scope>CITATION</scope><scope>2B.</scope><scope>4A8</scope><scope>92I</scope><scope>93N</scope><scope>PSX</scope><scope>TCJ</scope><orcidid>https://orcid.org/0000-0001-5934-1617</orcidid></search><sort><creationdate>20221201</creationdate><title>Application of machine learning algorithms to screen potential biomarkers under cadmium exposure based on human urine metabolic profiles</title><author>Zeng, Ting ; Liang, Yanshan ; Dai, Qingyuan ; Tian, Jinglin ; Chen, Jinyao ; Lei, Bo ; Yang, Zhu ; Cai, Zongwei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Cadmium exposure</topic><topic>High-resolution mass spectrometry</topic><topic>Human urine</topic><topic>Machine learning</topic><topic>Metabolic profiles</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zeng, Ting</creatorcontrib><creatorcontrib>Liang, Yanshan</creatorcontrib><creatorcontrib>Dai, Qingyuan</creatorcontrib><creatorcontrib>Tian, Jinglin</creatorcontrib><creatorcontrib>Chen, Jinyao</creatorcontrib><creatorcontrib>Lei, Bo</creatorcontrib><creatorcontrib>Yang, Zhu</creatorcontrib><creatorcontrib>Cai, Zongwei</creatorcontrib><collection>CrossRef</collection><collection>Wanfang Data Journals - Hong Kong</collection><collection>WANFANG Data Centre</collection><collection>Wanfang Data Journals</collection><collection>万方数据期刊 - 香港版</collection><collection>China Online Journals (COJ)</collection><collection>China Online Journals (COJ)</collection><jtitle>Chinese chemical letters</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zeng, Ting</au><au>Liang, Yanshan</au><au>Dai, Qingyuan</au><au>Tian, Jinglin</au><au>Chen, Jinyao</au><au>Lei, Bo</au><au>Yang, Zhu</au><au>Cai, Zongwei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Application of machine learning algorithms to screen potential biomarkers under cadmium exposure based on human urine metabolic profiles</atitle><jtitle>Chinese chemical letters</jtitle><date>2022-12-01</date><risdate>2022</risdate><volume>33</volume><issue>12</issue><spage>5184</spage><epage>5188</epage><pages>5184-5188</pages><issn>1001-8417</issn><eissn>1878-5964</eissn><abstract>Exposure to environmental cadmium increases the health risk of residents. Early urine metabolic detection using high-resolution mass spectrometry and machine learning algorithms would be advantageous to predict the adverse health effects. Here, we conducted machine learning approaches to screen potential biomarkers under cadmium exposure in 403 urine samples. In positive and negative ionization mode, 4207 and 3558 features were extracted, respectively. We compared seven machine learning algorithms and found that the extreme gradient boosting (XGBoost) and random forest (RF) classifiers showed better accuracy and predictive performance than others. Following 5-fold cross-validation, the value of area under curve (AUC) was both 0.93 for positive and negative ionization modes in XGBoost classifier. In the RF classifier, AUC were 0.80 and 0.84 for positive and negative ionization modes, respectively. We then identified a biomarker panel based on XGBoost and RF classifiers. The incorporation of machine learning models into urine analysis using high-resolution mass spectrometry could allow a convenient assessment of cadmium exposure.
[Display omitted]
On a cohort of 403 volunteers who had been exposed to cadmium, high-resolution mass spectrometry-based urine metabolic detection was conducted, seven machine learning algorithms on the LCHRMS data set were compared, and a biomarker panel based on the selected machine learning mode were identified. The extreme gradient boosting and random forest classifiers showed better accuracy and predictive performance than others which indicates this study has added a new reference for selecting data-driven machine learning algorithms for a metabolic analysis of urine under cadmium exposure.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.cclet.2022.03.020</doi><tpages>5</tpages><orcidid>https://orcid.org/0000-0001-5934-1617</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1001-8417 |
ispartof | Chinese chemical letters, 2022-12, Vol.33 (12), p.5184-5188 |
issn | 1001-8417 1878-5964 |
language | eng |
recordid | cdi_wanfang_journals_zghxkb202212040 |
source | Elsevier |
subjects | Cadmium exposure High-resolution mass spectrometry Human urine Machine learning Metabolic profiles |
title | Application of machine learning algorithms to screen potential biomarkers under cadmium exposure based on human urine metabolic profiles |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T10%3A54%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-wanfang_jour_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Application%20of%20machine%20learning%20algorithms%20to%20screen%20potential%20biomarkers%20under%20cadmium%20exposure%20based%20on%20human%20urine%20metabolic%20profiles&rft.jtitle=Chinese%20chemical%20letters&rft.au=Zeng,%20Ting&rft.date=2022-12-01&rft.volume=33&rft.issue=12&rft.spage=5184&rft.epage=5188&rft.pages=5184-5188&rft.issn=1001-8417&rft.eissn=1878-5964&rft_id=info:doi/10.1016/j.cclet.2022.03.020&rft_dat=%3Cwanfang_jour_cross%3Ezghxkb202212040%3C/wanfang_jour_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c335t-219a1ec54ab8e9893aebe4c250178ea026e6b532ea8fa787b24f9a62e63b9c6c3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_wanfj_id=zghxkb202212040&rfr_iscdi=true |