Loading…

AtML: An Arabidopsis thaliana root cell identity recognition tool for medicinal ingredient accumulation

•A machine learning model, AtML, was constructed to predict the stages of Arabidopsis root cells and identify biomarkers.•Performance testing revealed that AtML achieved very high accuracy and recall.•The AtML model identified 160 important marker genes for annotating cell type. Arabidopsis thaliana...

Full description

Saved in:
Bibliographic Details
Published in:Methods (San Diego, Calif.) Calif.), 2024-11, Vol.231, p.61-69
Main Authors: Yu, Shicong, Liu, Lijia, Wang, Hao, Yan, Shen, Zheng, Shuqin, Ning, Jing, Luo, Ruxian, Fu, Xiangzheng, Deng, Xiaoshu
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•A machine learning model, AtML, was constructed to predict the stages of Arabidopsis root cells and identify biomarkers.•Performance testing revealed that AtML achieved very high accuracy and recall.•The AtML model identified 160 important marker genes for annotating cell type. Arabidopsis thaliana synthesizes various medicinal compounds, and serves as a model plant for medicinal plant research. Single-cell transcriptomics technologies are essential for understanding the developmental trajectory of plant roots, facilitating the analysis of synthesis and accumulation patterns of medicinal compounds in different cell subpopulations. Although methods for interpreting single-cell transcriptomics data are rapidly advancing in Arabidopsis, challenges remain in precisely annotating cell identity due to the lack of marker genes for certain cell types. In this work, we trained a machine learning system, AtML, using sequencing datasets from six cell subpopulations, comprising a total of 6000 cells, to predict Arabidopsis root cell stages and identify biomarkers through complete model interpretability. Performance testing using an external dataset revealed that AtML achieved 96.50% accuracy and 96.51% recall. Through the interpretability provided by AtML, our model identified 160 important marker genes, contributing to the understanding of cell type annotations. In conclusion, we trained AtML to efficiently identify Arabidopsis root cell stages, providing a new tool for elucidating the mechanisms of medicinal compound accumulation in Arabidopsis roots.
ISSN:1046-2023
1095-9130
1095-9130
DOI:10.1016/j.ymeth.2024.09.010