Loading…

PathogenTrack and Yeskit: tools for identifying intracellular pathogens from single-cell RNA-sequencing datasets as illustrated by application to COVID-19

Pathogenic microbes can induce cellular dysfunction, immune response, and cause infectious disease and other diseases including cancers. However, the cellular distributions of pathogens and their impact on host cells remain rarely explored due to the limited methods. Taking advantage of single-cell...

Full description

Saved in:
Bibliographic Details
Published in:Frontiers of medicine 2022-04, Vol.16 (2), p.251-262
Main Authors: Zhang, Wei, Xu, Xiaoguang, Fu, Ziyu, Chen, Jian, Chen, Saijuan, Tan, Yun
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c519t-39f4f697de54fb17d7a22331f2aae0aedfe8d146ec34ce9ac97b0ab2b020aac83
cites cdi_FETCH-LOGICAL-c519t-39f4f697de54fb17d7a22331f2aae0aedfe8d146ec34ce9ac97b0ab2b020aac83
container_end_page 262
container_issue 2
container_start_page 251
container_title Frontiers of medicine
container_volume 16
creator Zhang, Wei
Xu, Xiaoguang
Fu, Ziyu
Chen, Jian
Chen, Saijuan
Tan, Yun
description Pathogenic microbes can induce cellular dysfunction, immune response, and cause infectious disease and other diseases including cancers. However, the cellular distributions of pathogens and their impact on host cells remain rarely explored due to the limited methods. Taking advantage of single-cell RNA-sequencing (scRNA-seq) analysis, we can assess the transcriptomic features at the single-cell level. Still, the tools used to interpret pathogens (such as viruses, bacteria, and fungi) at the single-cell level remain to be explored. Here, we introduced PathogenTrack, a python-based computational pipeline that uses unmapped scRNA-seq data to identify intracellular pathogens at the single-cell level. In addition, we established an R package named Yeskit to import, integrate, analyze, and interpret pathogen abundance and transcriptomic features in host cells. Robustness of these tools has been tested on various real and simulated scRNA-seq datasets. PathogenTrack is competitive to the state-of-the-art tools such as Viral-Track, and the first tools for identifying bacteria at the single-cell level. Using the raw data of bronchoalveolar lavage fluid samples (BALF) from COVID-19 patients in the SRA database, we found the SARS-CoV-2 virus exists in multiple cell types including epithelial cells and macrophages. SARS-CoV-2-positive neutrophils showed increased expression of genes related to type I interferon pathway and antigen presenting module. Additionally, we observed the Haemophilus parahaemolyticus in some macrophage and epithelial cells, indicating a co-infection of the bacterium in some severe cases of COVID-19. The PathogenTrack pipeline and the Yeskit package are publicly available at GitHub.
doi_str_mv 10.1007/s11684-021-0915-9
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8861993</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2656829195</sourcerecordid><originalsourceid>FETCH-LOGICAL-c519t-39f4f697de54fb17d7a22331f2aae0aedfe8d146ec34ce9ac97b0ab2b020aac83</originalsourceid><addsrcrecordid>eNp9kktv1DAUhSMEolXpD2CDLLFhk2I7T7NAqoZXpYoiVJBYWTfJzYzbjJ3aDtL8lf5abpRheCyajSPf7xz75CRJngt-JjivXgchyjpPuRQpV6JI1aPkWHJV0I4sHh_eRXWUnIZww-nJS1Ep9TQ5ygqhpMir4-T-C8SNW6O99tDeMrAd-4Hh1sQ3LDo3BNY7z0yHNpp-Z-yaGRuJxGGYBvBs3KuJ827LAhEDpvOYff18nga8m9C2s66DCAFjYBCYIXUgm4gda3YMxnEwLUTjLB3KVlffL96lQj1LnvQwBDzdryfJtw_vr1ef0surjxer88u0pRQxzVSf96WqOizyvhFVV4GUWSZ6CYAcsOux7kReYpvlLSpoVdVwaGTDJQdo6-wkebv4jlOzxa7FOeGgR2-24HfagdH_TqzZ6LX7qeu6FEplZPBqb-Ad5Q1Rb02YvwFYdFPQssxEXWaqkIS-_A-9cZO3FI-ooqylEqogSixU610IHvvDZQTXc_l6KV9TvXouXyvSvPg7xUHxu2oC5AIEGtk1-j9HP-RaL6KNWW_QYzd6DEFT2fRDoH9I-gsVVtBO</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2656829195</pqid></control><display><type>article</type><title>PathogenTrack and Yeskit: tools for identifying intracellular pathogens from single-cell RNA-sequencing datasets as illustrated by application to COVID-19</title><source>Springer Link</source><creator>Zhang, Wei ; Xu, Xiaoguang ; Fu, Ziyu ; Chen, Jian ; Chen, Saijuan ; Tan, Yun</creator><creatorcontrib>Zhang, Wei ; Xu, Xiaoguang ; Fu, Ziyu ; Chen, Jian ; Chen, Saijuan ; Tan, Yun</creatorcontrib><description>Pathogenic microbes can induce cellular dysfunction, immune response, and cause infectious disease and other diseases including cancers. However, the cellular distributions of pathogens and their impact on host cells remain rarely explored due to the limited methods. Taking advantage of single-cell RNA-sequencing (scRNA-seq) analysis, we can assess the transcriptomic features at the single-cell level. Still, the tools used to interpret pathogens (such as viruses, bacteria, and fungi) at the single-cell level remain to be explored. Here, we introduced PathogenTrack, a python-based computational pipeline that uses unmapped scRNA-seq data to identify intracellular pathogens at the single-cell level. In addition, we established an R package named Yeskit to import, integrate, analyze, and interpret pathogen abundance and transcriptomic features in host cells. Robustness of these tools has been tested on various real and simulated scRNA-seq datasets. PathogenTrack is competitive to the state-of-the-art tools such as Viral-Track, and the first tools for identifying bacteria at the single-cell level. Using the raw data of bronchoalveolar lavage fluid samples (BALF) from COVID-19 patients in the SRA database, we found the SARS-CoV-2 virus exists in multiple cell types including epithelial cells and macrophages. SARS-CoV-2-positive neutrophils showed increased expression of genes related to type I interferon pathway and antigen presenting module. Additionally, we observed the Haemophilus parahaemolyticus in some macrophage and epithelial cells, indicating a co-infection of the bacterium in some severe cases of COVID-19. The PathogenTrack pipeline and the Yeskit package are publicly available at GitHub.</description><identifier>ISSN: 2095-0217</identifier><identifier>EISSN: 2095-0225</identifier><identifier>DOI: 10.1007/s11684-021-0915-9</identifier><identifier>PMID: 35192147</identifier><language>eng</language><publisher>Beijing: Higher Education Press</publisher><subject>Coronaviruses ; COVID-19 ; Humans ; intracellular pathogen ; Medicine ; Medicine &amp; Public Health ; microbe ; Pathogens ; Research Article ; RNA ; SARS-CoV-2 ; SARS-CoV-2 - genetics ; scRNA-seq ; Severe acute respiratory syndrome coronavirus 2 ; Single-Cell Analysis - methods ; Transcriptome</subject><ispartof>Frontiers of medicine, 2022-04, Vol.16 (2), p.251-262</ispartof><rights>Copyright reserved, 2022, Higher Education Press</rights><rights>Higher Education Press 2022</rights><rights>2022. Higher Education Press.</rights><rights>Higher Education Press 2022.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c519t-39f4f697de54fb17d7a22331f2aae0aedfe8d146ec34ce9ac97b0ab2b020aac83</citedby><cites>FETCH-LOGICAL-c519t-39f4f697de54fb17d7a22331f2aae0aedfe8d146ec34ce9ac97b0ab2b020aac83</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35192147$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Zhang, Wei</creatorcontrib><creatorcontrib>Xu, Xiaoguang</creatorcontrib><creatorcontrib>Fu, Ziyu</creatorcontrib><creatorcontrib>Chen, Jian</creatorcontrib><creatorcontrib>Chen, Saijuan</creatorcontrib><creatorcontrib>Tan, Yun</creatorcontrib><title>PathogenTrack and Yeskit: tools for identifying intracellular pathogens from single-cell RNA-sequencing datasets as illustrated by application to COVID-19</title><title>Frontiers of medicine</title><addtitle>Front. Med</addtitle><addtitle>Front Med</addtitle><description>Pathogenic microbes can induce cellular dysfunction, immune response, and cause infectious disease and other diseases including cancers. However, the cellular distributions of pathogens and their impact on host cells remain rarely explored due to the limited methods. Taking advantage of single-cell RNA-sequencing (scRNA-seq) analysis, we can assess the transcriptomic features at the single-cell level. Still, the tools used to interpret pathogens (such as viruses, bacteria, and fungi) at the single-cell level remain to be explored. Here, we introduced PathogenTrack, a python-based computational pipeline that uses unmapped scRNA-seq data to identify intracellular pathogens at the single-cell level. In addition, we established an R package named Yeskit to import, integrate, analyze, and interpret pathogen abundance and transcriptomic features in host cells. Robustness of these tools has been tested on various real and simulated scRNA-seq datasets. PathogenTrack is competitive to the state-of-the-art tools such as Viral-Track, and the first tools for identifying bacteria at the single-cell level. Using the raw data of bronchoalveolar lavage fluid samples (BALF) from COVID-19 patients in the SRA database, we found the SARS-CoV-2 virus exists in multiple cell types including epithelial cells and macrophages. SARS-CoV-2-positive neutrophils showed increased expression of genes related to type I interferon pathway and antigen presenting module. Additionally, we observed the Haemophilus parahaemolyticus in some macrophage and epithelial cells, indicating a co-infection of the bacterium in some severe cases of COVID-19. The PathogenTrack pipeline and the Yeskit package are publicly available at GitHub.</description><subject>Coronaviruses</subject><subject>COVID-19</subject><subject>Humans</subject><subject>intracellular pathogen</subject><subject>Medicine</subject><subject>Medicine &amp; Public Health</subject><subject>microbe</subject><subject>Pathogens</subject><subject>Research Article</subject><subject>RNA</subject><subject>SARS-CoV-2</subject><subject>SARS-CoV-2 - genetics</subject><subject>scRNA-seq</subject><subject>Severe acute respiratory syndrome coronavirus 2</subject><subject>Single-Cell Analysis - methods</subject><subject>Transcriptome</subject><issn>2095-0217</issn><issn>2095-0225</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kktv1DAUhSMEolXpD2CDLLFhk2I7T7NAqoZXpYoiVJBYWTfJzYzbjJ3aDtL8lf5abpRheCyajSPf7xz75CRJngt-JjivXgchyjpPuRQpV6JI1aPkWHJV0I4sHh_eRXWUnIZww-nJS1Ep9TQ5ygqhpMir4-T-C8SNW6O99tDeMrAd-4Hh1sQ3LDo3BNY7z0yHNpp-Z-yaGRuJxGGYBvBs3KuJ827LAhEDpvOYff18nga8m9C2s66DCAFjYBCYIXUgm4gda3YMxnEwLUTjLB3KVlffL96lQj1LnvQwBDzdryfJtw_vr1ef0surjxer88u0pRQxzVSf96WqOizyvhFVV4GUWSZ6CYAcsOux7kReYpvlLSpoVdVwaGTDJQdo6-wkebv4jlOzxa7FOeGgR2-24HfagdH_TqzZ6LX7qeu6FEplZPBqb-Ad5Q1Rb02YvwFYdFPQssxEXWaqkIS-_A-9cZO3FI-ooqylEqogSixU610IHvvDZQTXc_l6KV9TvXouXyvSvPg7xUHxu2oC5AIEGtk1-j9HP-RaL6KNWW_QYzd6DEFT2fRDoH9I-gsVVtBO</recordid><startdate>20220401</startdate><enddate>20220401</enddate><creator>Zhang, Wei</creator><creator>Xu, Xiaoguang</creator><creator>Fu, Ziyu</creator><creator>Chen, Jian</creator><creator>Chen, Saijuan</creator><creator>Tan, Yun</creator><general>Higher Education Press</general><general>Springer Nature B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>K9.</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20220401</creationdate><title>PathogenTrack and Yeskit: tools for identifying intracellular pathogens from single-cell RNA-sequencing datasets as illustrated by application to COVID-19</title><author>Zhang, Wei ; Xu, Xiaoguang ; Fu, Ziyu ; Chen, Jian ; Chen, Saijuan ; Tan, Yun</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c519t-39f4f697de54fb17d7a22331f2aae0aedfe8d146ec34ce9ac97b0ab2b020aac83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Coronaviruses</topic><topic>COVID-19</topic><topic>Humans</topic><topic>intracellular pathogen</topic><topic>Medicine</topic><topic>Medicine &amp; Public Health</topic><topic>microbe</topic><topic>Pathogens</topic><topic>Research Article</topic><topic>RNA</topic><topic>SARS-CoV-2</topic><topic>SARS-CoV-2 - genetics</topic><topic>scRNA-seq</topic><topic>Severe acute respiratory syndrome coronavirus 2</topic><topic>Single-Cell Analysis - methods</topic><topic>Transcriptome</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Wei</creatorcontrib><creatorcontrib>Xu, Xiaoguang</creatorcontrib><creatorcontrib>Fu, Ziyu</creatorcontrib><creatorcontrib>Chen, Jian</creatorcontrib><creatorcontrib>Chen, Saijuan</creatorcontrib><creatorcontrib>Tan, Yun</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Frontiers of medicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhang, Wei</au><au>Xu, Xiaoguang</au><au>Fu, Ziyu</au><au>Chen, Jian</au><au>Chen, Saijuan</au><au>Tan, Yun</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>PathogenTrack and Yeskit: tools for identifying intracellular pathogens from single-cell RNA-sequencing datasets as illustrated by application to COVID-19</atitle><jtitle>Frontiers of medicine</jtitle><stitle>Front. Med</stitle><addtitle>Front Med</addtitle><date>2022-04-01</date><risdate>2022</risdate><volume>16</volume><issue>2</issue><spage>251</spage><epage>262</epage><pages>251-262</pages><issn>2095-0217</issn><eissn>2095-0225</eissn><abstract>Pathogenic microbes can induce cellular dysfunction, immune response, and cause infectious disease and other diseases including cancers. However, the cellular distributions of pathogens and their impact on host cells remain rarely explored due to the limited methods. Taking advantage of single-cell RNA-sequencing (scRNA-seq) analysis, we can assess the transcriptomic features at the single-cell level. Still, the tools used to interpret pathogens (such as viruses, bacteria, and fungi) at the single-cell level remain to be explored. Here, we introduced PathogenTrack, a python-based computational pipeline that uses unmapped scRNA-seq data to identify intracellular pathogens at the single-cell level. In addition, we established an R package named Yeskit to import, integrate, analyze, and interpret pathogen abundance and transcriptomic features in host cells. Robustness of these tools has been tested on various real and simulated scRNA-seq datasets. PathogenTrack is competitive to the state-of-the-art tools such as Viral-Track, and the first tools for identifying bacteria at the single-cell level. Using the raw data of bronchoalveolar lavage fluid samples (BALF) from COVID-19 patients in the SRA database, we found the SARS-CoV-2 virus exists in multiple cell types including epithelial cells and macrophages. SARS-CoV-2-positive neutrophils showed increased expression of genes related to type I interferon pathway and antigen presenting module. Additionally, we observed the Haemophilus parahaemolyticus in some macrophage and epithelial cells, indicating a co-infection of the bacterium in some severe cases of COVID-19. The PathogenTrack pipeline and the Yeskit package are publicly available at GitHub.</abstract><cop>Beijing</cop><pub>Higher Education Press</pub><pmid>35192147</pmid><doi>10.1007/s11684-021-0915-9</doi><tpages>12</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2095-0217
ispartof Frontiers of medicine, 2022-04, Vol.16 (2), p.251-262
issn 2095-0217
2095-0225
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8861993
source Springer Link
subjects Coronaviruses
COVID-19
Humans
intracellular pathogen
Medicine
Medicine & Public Health
microbe
Pathogens
Research Article
RNA
SARS-CoV-2
SARS-CoV-2 - genetics
scRNA-seq
Severe acute respiratory syndrome coronavirus 2
Single-Cell Analysis - methods
Transcriptome
title PathogenTrack and Yeskit: tools for identifying intracellular pathogens from single-cell RNA-sequencing datasets as illustrated by application to COVID-19
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T16%3A12%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=PathogenTrack%20and%20Yeskit:%20tools%20for%20identifying%20intracellular%20pathogens%20from%20single-cell%20RNA-sequencing%20datasets%20as%20illustrated%20by%20application%20to%20COVID-19&rft.jtitle=Frontiers%20of%20medicine&rft.au=Zhang,%20Wei&rft.date=2022-04-01&rft.volume=16&rft.issue=2&rft.spage=251&rft.epage=262&rft.pages=251-262&rft.issn=2095-0217&rft.eissn=2095-0225&rft_id=info:doi/10.1007/s11684-021-0915-9&rft_dat=%3Cproquest_pubme%3E2656829195%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c519t-39f4f697de54fb17d7a22331f2aae0aedfe8d146ec34ce9ac97b0ab2b020aac83%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2656829195&rft_id=info:pmid/35192147&rfr_iscdi=true