Loading…

Identifying In Vitro Cultured Human Hepatocytes Markers with Machine Learning Methods Based on Single-Cell RNA-Seq Data

Cell transplantation is an effective method for compensating for the loss of liver function and improve patient survival. However, given that hepatocytes cultivated have diverse developmental processes and physiological features, obtaining hepatocytes that can properly function is difficult. In the...

Full description

Saved in:
Bibliographic Details
Published in:Frontiers in bioengineering and biotechnology 2022-05, Vol.10, p.916309
Main Authors: Li, ZhanDong, Huang, FeiMing, Chen, Lei, Huang, Tao, Cai, Yu-Dong
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c465t-f96fea55ac563f4d912fd8765b4aff27a6490c017144c72b3e833589cee9437a3
cites cdi_FETCH-LOGICAL-c465t-f96fea55ac563f4d912fd8765b4aff27a6490c017144c72b3e833589cee9437a3
container_end_page
container_issue
container_start_page 916309
container_title Frontiers in bioengineering and biotechnology
container_volume 10
creator Li, ZhanDong
Huang, FeiMing
Chen, Lei
Huang, Tao
Cai, Yu-Dong
description Cell transplantation is an effective method for compensating for the loss of liver function and improve patient survival. However, given that hepatocytes cultivated have diverse developmental processes and physiological features, obtaining hepatocytes that can properly function is difficult. In the present study, we present an advanced computational analysis on single-cell transcriptional profiling to resolve the heterogeneity of the hepatocyte differentiation process and to mine biomarkers at different periods of differentiation. We obtained a batch of compressed and effective classification features with the Boruta method and ranked them using the Max-Relevance and Min-Redundancy method. Some key genes were identified during the culture of hepatocytes, including , which not only regulates terminally differentiated cells in the liver but also affects cell differentiation. , which encodes a CD147 ligand, also appeared in the identified gene list, and the combination of the two proteins mediated multiple biological pathways. Other genes, such as , , and , which are involved in the maturation and differentiation of hepatocytes and assist different hepatic cell types in performing their roles were also identified. Then, several classifiers were trained and evaluated to obtain optimal classifiers and optimal feature subsets, using three classification algorithms (random forest, k-nearest neighbor, and decision tree) and the incremental feature selection method. The best random forest classifier with a 0.940 Matthews correlation coefficient was constructed to distinguish different hepatic cell types. Finally, classification rules were created for quantitatively describing hepatic cell types. In summary, This study provided potential targets for cell transplantation associated liver disease treatment strategies by elucidating the process and mechanism of hepatocyte development at both qualitative and quantitative levels.
doi_str_mv 10.3389/fbioe.2022.916309
format article
fullrecord <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_ffd889ab316e4e0da4960b97605b8d77</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_ffd889ab316e4e0da4960b97605b8d77</doaj_id><sourcerecordid>2677575372</sourcerecordid><originalsourceid>FETCH-LOGICAL-c465t-f96fea55ac563f4d912fd8765b4aff27a6490c017144c72b3e833589cee9437a3</originalsourceid><addsrcrecordid>eNpVkc1uEzEURkcIRKvSB2CDvGQzwf8eb5BKCiRSChIFtpbHc524TMapPUOVt8dpStWubF_f79i6p6reEjxjrNEffBsizCimdKaJZFi_qE4p1bLmpBEvn-xPqvOcbzDGhAolGvq6OmFCYSmwOK3ulh0MY_D7MKzRckC_w5gimk_9OCXo0GLa2gEtYGfH6PYjZHRl0x9IGd2FcVMObhMGQCuwaTgQrmDcxC6jTzaXdBzQdan2UM-h79GPbxf1NdyiSzvaN9Urb_sM5w_rWfXry-ef80W9-v51Ob9Y1Y5LMdZeSw9WCOuEZJ53mlDfNUqKllvvqbKSa-wwUYRzp2jLoGFMNNoBaM6UZWfV8sjtor0xuxS2Nu1NtMHcF2JaG5vG4HowvpAbbVtGJHDAneVa4lYriUXbdEoV1scjaze1W-hcGVyy_TPo85shbMw6_jWaNJo2vADePwBSvJ0gj2YbsiujsQPEKRsqlSqKmKKllRxbXYo5J_CPzxBsDv7NvX9z8G-O_kvm3dP_PSb-22b_AK9NrVw</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2677575372</pqid></control><display><type>article</type><title>Identifying In Vitro Cultured Human Hepatocytes Markers with Machine Learning Methods Based on Single-Cell RNA-Seq Data</title><source>PubMed Central</source><creator>Li, ZhanDong ; Huang, FeiMing ; Chen, Lei ; Huang, Tao ; Cai, Yu-Dong</creator><creatorcontrib>Li, ZhanDong ; Huang, FeiMing ; Chen, Lei ; Huang, Tao ; Cai, Yu-Dong</creatorcontrib><description>Cell transplantation is an effective method for compensating for the loss of liver function and improve patient survival. However, given that hepatocytes cultivated have diverse developmental processes and physiological features, obtaining hepatocytes that can properly function is difficult. In the present study, we present an advanced computational analysis on single-cell transcriptional profiling to resolve the heterogeneity of the hepatocyte differentiation process and to mine biomarkers at different periods of differentiation. We obtained a batch of compressed and effective classification features with the Boruta method and ranked them using the Max-Relevance and Min-Redundancy method. Some key genes were identified during the culture of hepatocytes, including , which not only regulates terminally differentiated cells in the liver but also affects cell differentiation. , which encodes a CD147 ligand, also appeared in the identified gene list, and the combination of the two proteins mediated multiple biological pathways. Other genes, such as , , and , which are involved in the maturation and differentiation of hepatocytes and assist different hepatic cell types in performing their roles were also identified. Then, several classifiers were trained and evaluated to obtain optimal classifiers and optimal feature subsets, using three classification algorithms (random forest, k-nearest neighbor, and decision tree) and the incremental feature selection method. The best random forest classifier with a 0.940 Matthews correlation coefficient was constructed to distinguish different hepatic cell types. Finally, classification rules were created for quantitatively describing hepatic cell types. In summary, This study provided potential targets for cell transplantation associated liver disease treatment strategies by elucidating the process and mechanism of hepatocyte development at both qualitative and quantitative levels.</description><identifier>ISSN: 2296-4185</identifier><identifier>EISSN: 2296-4185</identifier><identifier>DOI: 10.3389/fbioe.2022.916309</identifier><identifier>PMID: 35706505</identifier><language>eng</language><publisher>Switzerland: Frontiers Media S.A</publisher><subject>Bioengineering and Biotechnology ; boruta ; hepatocytes ; machine learning ; max-relevance ; min-redundancy and random forest ; single cell RNA sequencing</subject><ispartof>Frontiers in bioengineering and biotechnology, 2022-05, Vol.10, p.916309</ispartof><rights>Copyright © 2022 Li, Huang, Chen, Huang and Cai.</rights><rights>Copyright © 2022 Li, Huang, Chen, Huang and Cai. 2022 Li, Huang, Chen, Huang and Cai</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c465t-f96fea55ac563f4d912fd8765b4aff27a6490c017144c72b3e833589cee9437a3</citedby><cites>FETCH-LOGICAL-c465t-f96fea55ac563f4d912fd8765b4aff27a6490c017144c72b3e833589cee9437a3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC9189284/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC9189284/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,27923,27924,53790,53792</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35706505$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Li, ZhanDong</creatorcontrib><creatorcontrib>Huang, FeiMing</creatorcontrib><creatorcontrib>Chen, Lei</creatorcontrib><creatorcontrib>Huang, Tao</creatorcontrib><creatorcontrib>Cai, Yu-Dong</creatorcontrib><title>Identifying In Vitro Cultured Human Hepatocytes Markers with Machine Learning Methods Based on Single-Cell RNA-Seq Data</title><title>Frontiers in bioengineering and biotechnology</title><addtitle>Front Bioeng Biotechnol</addtitle><description>Cell transplantation is an effective method for compensating for the loss of liver function and improve patient survival. However, given that hepatocytes cultivated have diverse developmental processes and physiological features, obtaining hepatocytes that can properly function is difficult. In the present study, we present an advanced computational analysis on single-cell transcriptional profiling to resolve the heterogeneity of the hepatocyte differentiation process and to mine biomarkers at different periods of differentiation. We obtained a batch of compressed and effective classification features with the Boruta method and ranked them using the Max-Relevance and Min-Redundancy method. Some key genes were identified during the culture of hepatocytes, including , which not only regulates terminally differentiated cells in the liver but also affects cell differentiation. , which encodes a CD147 ligand, also appeared in the identified gene list, and the combination of the two proteins mediated multiple biological pathways. Other genes, such as , , and , which are involved in the maturation and differentiation of hepatocytes and assist different hepatic cell types in performing their roles were also identified. Then, several classifiers were trained and evaluated to obtain optimal classifiers and optimal feature subsets, using three classification algorithms (random forest, k-nearest neighbor, and decision tree) and the incremental feature selection method. The best random forest classifier with a 0.940 Matthews correlation coefficient was constructed to distinguish different hepatic cell types. Finally, classification rules were created for quantitatively describing hepatic cell types. In summary, This study provided potential targets for cell transplantation associated liver disease treatment strategies by elucidating the process and mechanism of hepatocyte development at both qualitative and quantitative levels.</description><subject>Bioengineering and Biotechnology</subject><subject>boruta</subject><subject>hepatocytes</subject><subject>machine learning</subject><subject>max-relevance</subject><subject>min-redundancy and random forest</subject><subject>single cell RNA sequencing</subject><issn>2296-4185</issn><issn>2296-4185</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>DOA</sourceid><recordid>eNpVkc1uEzEURkcIRKvSB2CDvGQzwf8eb5BKCiRSChIFtpbHc524TMapPUOVt8dpStWubF_f79i6p6reEjxjrNEffBsizCimdKaJZFi_qE4p1bLmpBEvn-xPqvOcbzDGhAolGvq6OmFCYSmwOK3ulh0MY_D7MKzRckC_w5gimk_9OCXo0GLa2gEtYGfH6PYjZHRl0x9IGd2FcVMObhMGQCuwaTgQrmDcxC6jTzaXdBzQdan2UM-h79GPbxf1NdyiSzvaN9Urb_sM5w_rWfXry-ef80W9-v51Ob9Y1Y5LMdZeSw9WCOuEZJ53mlDfNUqKllvvqbKSa-wwUYRzp2jLoGFMNNoBaM6UZWfV8sjtor0xuxS2Nu1NtMHcF2JaG5vG4HowvpAbbVtGJHDAneVa4lYriUXbdEoV1scjaze1W-hcGVyy_TPo85shbMw6_jWaNJo2vADePwBSvJ0gj2YbsiujsQPEKRsqlSqKmKKllRxbXYo5J_CPzxBsDv7NvX9z8G-O_kvm3dP_PSb-22b_AK9NrVw</recordid><startdate>20220530</startdate><enddate>20220530</enddate><creator>Li, ZhanDong</creator><creator>Huang, FeiMing</creator><creator>Chen, Lei</creator><creator>Huang, Tao</creator><creator>Cai, Yu-Dong</creator><general>Frontiers Media S.A</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20220530</creationdate><title>Identifying In Vitro Cultured Human Hepatocytes Markers with Machine Learning Methods Based on Single-Cell RNA-Seq Data</title><author>Li, ZhanDong ; Huang, FeiMing ; Chen, Lei ; Huang, Tao ; Cai, Yu-Dong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c465t-f96fea55ac563f4d912fd8765b4aff27a6490c017144c72b3e833589cee9437a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Bioengineering and Biotechnology</topic><topic>boruta</topic><topic>hepatocytes</topic><topic>machine learning</topic><topic>max-relevance</topic><topic>min-redundancy and random forest</topic><topic>single cell RNA sequencing</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, ZhanDong</creatorcontrib><creatorcontrib>Huang, FeiMing</creatorcontrib><creatorcontrib>Chen, Lei</creatorcontrib><creatorcontrib>Huang, Tao</creatorcontrib><creatorcontrib>Cai, Yu-Dong</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>Directory of Open Access Journals</collection><jtitle>Frontiers in bioengineering and biotechnology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, ZhanDong</au><au>Huang, FeiMing</au><au>Chen, Lei</au><au>Huang, Tao</au><au>Cai, Yu-Dong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Identifying In Vitro Cultured Human Hepatocytes Markers with Machine Learning Methods Based on Single-Cell RNA-Seq Data</atitle><jtitle>Frontiers in bioengineering and biotechnology</jtitle><addtitle>Front Bioeng Biotechnol</addtitle><date>2022-05-30</date><risdate>2022</risdate><volume>10</volume><spage>916309</spage><pages>916309-</pages><issn>2296-4185</issn><eissn>2296-4185</eissn><abstract>Cell transplantation is an effective method for compensating for the loss of liver function and improve patient survival. However, given that hepatocytes cultivated have diverse developmental processes and physiological features, obtaining hepatocytes that can properly function is difficult. In the present study, we present an advanced computational analysis on single-cell transcriptional profiling to resolve the heterogeneity of the hepatocyte differentiation process and to mine biomarkers at different periods of differentiation. We obtained a batch of compressed and effective classification features with the Boruta method and ranked them using the Max-Relevance and Min-Redundancy method. Some key genes were identified during the culture of hepatocytes, including , which not only regulates terminally differentiated cells in the liver but also affects cell differentiation. , which encodes a CD147 ligand, also appeared in the identified gene list, and the combination of the two proteins mediated multiple biological pathways. Other genes, such as , , and , which are involved in the maturation and differentiation of hepatocytes and assist different hepatic cell types in performing their roles were also identified. Then, several classifiers were trained and evaluated to obtain optimal classifiers and optimal feature subsets, using three classification algorithms (random forest, k-nearest neighbor, and decision tree) and the incremental feature selection method. The best random forest classifier with a 0.940 Matthews correlation coefficient was constructed to distinguish different hepatic cell types. Finally, classification rules were created for quantitatively describing hepatic cell types. In summary, This study provided potential targets for cell transplantation associated liver disease treatment strategies by elucidating the process and mechanism of hepatocyte development at both qualitative and quantitative levels.</abstract><cop>Switzerland</cop><pub>Frontiers Media S.A</pub><pmid>35706505</pmid><doi>10.3389/fbioe.2022.916309</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2296-4185
ispartof Frontiers in bioengineering and biotechnology, 2022-05, Vol.10, p.916309
issn 2296-4185
2296-4185
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_ffd889ab316e4e0da4960b97605b8d77
source PubMed Central
subjects Bioengineering and Biotechnology
boruta
hepatocytes
machine learning
max-relevance
min-redundancy and random forest
single cell RNA sequencing
title Identifying In Vitro Cultured Human Hepatocytes Markers with Machine Learning Methods Based on Single-Cell RNA-Seq Data
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T08%3A35%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Identifying%20In%20Vitro%20Cultured%20Human%20Hepatocytes%20Markers%20with%20Machine%20Learning%20Methods%20Based%20on%20Single-Cell%20RNA-Seq%20Data&rft.jtitle=Frontiers%20in%20bioengineering%20and%20biotechnology&rft.au=Li,%20ZhanDong&rft.date=2022-05-30&rft.volume=10&rft.spage=916309&rft.pages=916309-&rft.issn=2296-4185&rft.eissn=2296-4185&rft_id=info:doi/10.3389/fbioe.2022.916309&rft_dat=%3Cproquest_doaj_%3E2677575372%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c465t-f96fea55ac563f4d912fd8765b4aff27a6490c017144c72b3e833589cee9437a3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2677575372&rft_id=info:pmid/35706505&rfr_iscdi=true