Loading…

Recovery of 197 eukaryotic bins reveals major challenges for eukaryote genome reconstruction from terrestrial metagenomes

As most eukaryotic genomes are yet to be sequenced, the mechanisms underlying their contribution to different ecosystem processes remain untapped. Although approaches to recovering Prokaryotic genomes have become common in genome biology, few studies have tackled the recovery of eukaryotic genomes f...

Full description

Saved in:
Bibliographic Details
Published in:Molecular ecology resources 2023-07, Vol.23 (5), p.1066-1076
Main Authors: Saraiva, Joao Pedro, Bartholomäus, Alexander, Toscan, Rodolfo Brizola, Baldrian, Petr, Nunes da Rocha, Ulisses
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c4126-3c7b52bfcd92e586703539104512f30c12716d10d81a52f4ebb52c8ec8673b153
cites cdi_FETCH-LOGICAL-c4126-3c7b52bfcd92e586703539104512f30c12716d10d81a52f4ebb52c8ec8673b153
container_end_page 1076
container_issue 5
container_start_page 1066
container_title Molecular ecology resources
container_volume 23
creator Saraiva, Joao Pedro
Bartholomäus, Alexander
Toscan, Rodolfo Brizola
Baldrian, Petr
Nunes da Rocha, Ulisses
description As most eukaryotic genomes are yet to be sequenced, the mechanisms underlying their contribution to different ecosystem processes remain untapped. Although approaches to recovering Prokaryotic genomes have become common in genome biology, few studies have tackled the recovery of eukaryotic genomes from metagenomes. This study assessed the reconstruction of microbial eukaryotic genomes using 6000 metagenomes from terrestrial and some transition environments using the EukRep pipeline. Only 215 metagenomic libraries yielded eukaryotic bins. From a total of 447 eukaryotic bins recovered 197 were classified at the phylum level. Streptophytes and fungi were the most represented clades with 83 and 73 bins, respectively. More than 78% of the obtained eukaryotic bins were recovered from samples whose biomes were classified as host‐associated, aquatic, and anthropogenic terrestrial. However, only 93 bins were taxonomically assigned at the genus level and 17 bins at the species level. Completeness and contamination estimates were obtained for a total of 193 bins and consisted of 44.64% (σ = 27.41%) and 3.97% (σ = 6.53%), respectively. Micromonas commoda was the most frequent taxon found while Saccharomyces cerevisiae presented the highest completeness, probably because more reference genomes are available. Current measures of completeness are based on the presence of single‐copy genes. However, mapping of the contigs from the recovered eukaryotic bins to the chromosomes of the reference genomes showed many gaps, suggesting that completeness measures should also include chromosome coverage. Recovering eukaryotic genomes will benefit significantly from long‐read sequencing, development of tools for dealing with repeat‐rich genomes, and improved reference genomes databases.
doi_str_mv 10.1111/1755-0998.13776
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2780482625</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2780482625</sourcerecordid><originalsourceid>FETCH-LOGICAL-c4126-3c7b52bfcd92e586703539104512f30c12716d10d81a52f4ebb52c8ec8673b153</originalsourceid><addsrcrecordid>eNqF0c9LwzAUB_AgipvTszcJePGyLT-aJj3KmD9gKoiCt9Bmr7OzbWbSTvbfm9m5gxdzSfL45Et4D6FzSkY0rDGVQgxJkqgR5VLGB6i_rxzuz-qth068XxISk0RGx6jHYxVJyUUfbZ7B2DW4DbY5ponE0H6kbmObwuCsqD12sIa09LhKl9Zh856WJdQL8DgP118MeAG1rSBoY2vfuNY0ha1x7myFG3AOQq1IS1xBk3bUn6KjPATD2W4foNeb6cvkbjh7ur2fXM-GJqIsHnIjM8Gy3MwTBkLFknDBE0oiQVnOiaFM0nhOyVzRVLA8gixwo8AEyjMq-ABddbkrZz_b8BFdFd5AWaY12NZrJhWJFIvZll7-oUvbujr8TjPFAol4Egc17pRx1nsHuV65ogp90JTo7VT0tu96OwP9M5Xw4mKX22YVzPf-dwwBiA58FSVs_svTD9PHLvgbOHqXwQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2822624396</pqid></control><display><type>article</type><title>Recovery of 197 eukaryotic bins reveals major challenges for eukaryote genome reconstruction from terrestrial metagenomes</title><source>Wiley:Jisc Collections:Wiley Read and Publish Open Access 2024-2025 (reading list)</source><creator>Saraiva, Joao Pedro ; Bartholomäus, Alexander ; Toscan, Rodolfo Brizola ; Baldrian, Petr ; Nunes da Rocha, Ulisses</creator><creatorcontrib>Saraiva, Joao Pedro ; Bartholomäus, Alexander ; Toscan, Rodolfo Brizola ; Baldrian, Petr ; Nunes da Rocha, Ulisses</creatorcontrib><description>As most eukaryotic genomes are yet to be sequenced, the mechanisms underlying their contribution to different ecosystem processes remain untapped. Although approaches to recovering Prokaryotic genomes have become common in genome biology, few studies have tackled the recovery of eukaryotic genomes from metagenomes. This study assessed the reconstruction of microbial eukaryotic genomes using 6000 metagenomes from terrestrial and some transition environments using the EukRep pipeline. Only 215 metagenomic libraries yielded eukaryotic bins. From a total of 447 eukaryotic bins recovered 197 were classified at the phylum level. Streptophytes and fungi were the most represented clades with 83 and 73 bins, respectively. More than 78% of the obtained eukaryotic bins were recovered from samples whose biomes were classified as host‐associated, aquatic, and anthropogenic terrestrial. However, only 93 bins were taxonomically assigned at the genus level and 17 bins at the species level. Completeness and contamination estimates were obtained for a total of 193 bins and consisted of 44.64% (σ = 27.41%) and 3.97% (σ = 6.53%), respectively. Micromonas commoda was the most frequent taxon found while Saccharomyces cerevisiae presented the highest completeness, probably because more reference genomes are available. Current measures of completeness are based on the presence of single‐copy genes. However, mapping of the contigs from the recovered eukaryotic bins to the chromosomes of the reference genomes showed many gaps, suggesting that completeness measures should also include chromosome coverage. Recovering eukaryotic genomes will benefit significantly from long‐read sequencing, development of tools for dealing with repeat‐rich genomes, and improved reference genomes databases.</description><identifier>ISSN: 1755-098X</identifier><identifier>EISSN: 1755-0998</identifier><identifier>DOI: 10.1111/1755-0998.13776</identifier><identifier>PMID: 36847735</identifier><language>eng</language><publisher>England: Wiley Subscription Services, Inc</publisher><subject>Anthropogenic factors ; Bins ; Chromosomes ; Completeness ; Ecosystem ; Eukaryota - genetics ; eukaryotes ; Fungi - genetics ; Gene mapping ; Genome, Microbial ; Genomes ; genome‐resolved metagenomics ; Hypocreales ; Mamiellales ; Metagenome ; Metagenomics ; Microorganisms ; Reconstruction ; Recovery ; Saccharomycetales</subject><ispartof>Molecular ecology resources, 2023-07, Vol.23 (5), p.1066-1076</ispartof><rights>2023 The Authors. published by John Wiley &amp; Sons Ltd.</rights><rights>2023 The Authors. Molecular Ecology Resources published by John Wiley &amp; Sons Ltd.</rights><rights>2023. This article is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c4126-3c7b52bfcd92e586703539104512f30c12716d10d81a52f4ebb52c8ec8673b153</citedby><cites>FETCH-LOGICAL-c4126-3c7b52bfcd92e586703539104512f30c12716d10d81a52f4ebb52c8ec8673b153</cites><orcidid>0000-0002-8983-2721 ; 0000-0001-6972-6692 ; 0000-0003-0970-7304</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36847735$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Saraiva, Joao Pedro</creatorcontrib><creatorcontrib>Bartholomäus, Alexander</creatorcontrib><creatorcontrib>Toscan, Rodolfo Brizola</creatorcontrib><creatorcontrib>Baldrian, Petr</creatorcontrib><creatorcontrib>Nunes da Rocha, Ulisses</creatorcontrib><title>Recovery of 197 eukaryotic bins reveals major challenges for eukaryote genome reconstruction from terrestrial metagenomes</title><title>Molecular ecology resources</title><addtitle>Mol Ecol Resour</addtitle><description>As most eukaryotic genomes are yet to be sequenced, the mechanisms underlying their contribution to different ecosystem processes remain untapped. Although approaches to recovering Prokaryotic genomes have become common in genome biology, few studies have tackled the recovery of eukaryotic genomes from metagenomes. This study assessed the reconstruction of microbial eukaryotic genomes using 6000 metagenomes from terrestrial and some transition environments using the EukRep pipeline. Only 215 metagenomic libraries yielded eukaryotic bins. From a total of 447 eukaryotic bins recovered 197 were classified at the phylum level. Streptophytes and fungi were the most represented clades with 83 and 73 bins, respectively. More than 78% of the obtained eukaryotic bins were recovered from samples whose biomes were classified as host‐associated, aquatic, and anthropogenic terrestrial. However, only 93 bins were taxonomically assigned at the genus level and 17 bins at the species level. Completeness and contamination estimates were obtained for a total of 193 bins and consisted of 44.64% (σ = 27.41%) and 3.97% (σ = 6.53%), respectively. Micromonas commoda was the most frequent taxon found while Saccharomyces cerevisiae presented the highest completeness, probably because more reference genomes are available. Current measures of completeness are based on the presence of single‐copy genes. However, mapping of the contigs from the recovered eukaryotic bins to the chromosomes of the reference genomes showed many gaps, suggesting that completeness measures should also include chromosome coverage. Recovering eukaryotic genomes will benefit significantly from long‐read sequencing, development of tools for dealing with repeat‐rich genomes, and improved reference genomes databases.</description><subject>Anthropogenic factors</subject><subject>Bins</subject><subject>Chromosomes</subject><subject>Completeness</subject><subject>Ecosystem</subject><subject>Eukaryota - genetics</subject><subject>eukaryotes</subject><subject>Fungi - genetics</subject><subject>Gene mapping</subject><subject>Genome, Microbial</subject><subject>Genomes</subject><subject>genome‐resolved metagenomics</subject><subject>Hypocreales</subject><subject>Mamiellales</subject><subject>Metagenome</subject><subject>Metagenomics</subject><subject>Microorganisms</subject><subject>Reconstruction</subject><subject>Recovery</subject><subject>Saccharomycetales</subject><issn>1755-098X</issn><issn>1755-0998</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>24P</sourceid><recordid>eNqF0c9LwzAUB_AgipvTszcJePGyLT-aJj3KmD9gKoiCt9Bmr7OzbWbSTvbfm9m5gxdzSfL45Et4D6FzSkY0rDGVQgxJkqgR5VLGB6i_rxzuz-qth068XxISk0RGx6jHYxVJyUUfbZ7B2DW4DbY5ponE0H6kbmObwuCsqD12sIa09LhKl9Zh856WJdQL8DgP118MeAG1rSBoY2vfuNY0ha1x7myFG3AOQq1IS1xBk3bUn6KjPATD2W4foNeb6cvkbjh7ur2fXM-GJqIsHnIjM8Gy3MwTBkLFknDBE0oiQVnOiaFM0nhOyVzRVLA8gixwo8AEyjMq-ABddbkrZz_b8BFdFd5AWaY12NZrJhWJFIvZll7-oUvbujr8TjPFAol4Egc17pRx1nsHuV65ogp90JTo7VT0tu96OwP9M5Xw4mKX22YVzPf-dwwBiA58FSVs_svTD9PHLvgbOHqXwQ</recordid><startdate>202307</startdate><enddate>202307</enddate><creator>Saraiva, Joao Pedro</creator><creator>Bartholomäus, Alexander</creator><creator>Toscan, Rodolfo Brizola</creator><creator>Baldrian, Petr</creator><creator>Nunes da Rocha, Ulisses</creator><general>Wiley Subscription Services, Inc</general><scope>24P</scope><scope>WIN</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SN</scope><scope>7SS</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>M7N</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-8983-2721</orcidid><orcidid>https://orcid.org/0000-0001-6972-6692</orcidid><orcidid>https://orcid.org/0000-0003-0970-7304</orcidid></search><sort><creationdate>202307</creationdate><title>Recovery of 197 eukaryotic bins reveals major challenges for eukaryote genome reconstruction from terrestrial metagenomes</title><author>Saraiva, Joao Pedro ; Bartholomäus, Alexander ; Toscan, Rodolfo Brizola ; Baldrian, Petr ; Nunes da Rocha, Ulisses</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c4126-3c7b52bfcd92e586703539104512f30c12716d10d81a52f4ebb52c8ec8673b153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Anthropogenic factors</topic><topic>Bins</topic><topic>Chromosomes</topic><topic>Completeness</topic><topic>Ecosystem</topic><topic>Eukaryota - genetics</topic><topic>eukaryotes</topic><topic>Fungi - genetics</topic><topic>Gene mapping</topic><topic>Genome, Microbial</topic><topic>Genomes</topic><topic>genome‐resolved metagenomics</topic><topic>Hypocreales</topic><topic>Mamiellales</topic><topic>Metagenome</topic><topic>Metagenomics</topic><topic>Microorganisms</topic><topic>Reconstruction</topic><topic>Recovery</topic><topic>Saccharomycetales</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Saraiva, Joao Pedro</creatorcontrib><creatorcontrib>Bartholomäus, Alexander</creatorcontrib><creatorcontrib>Toscan, Rodolfo Brizola</creatorcontrib><creatorcontrib>Baldrian, Petr</creatorcontrib><creatorcontrib>Nunes da Rocha, Ulisses</creatorcontrib><collection>Wiley Online Library (Open Access Collection)</collection><collection>Wiley Online Library Free Content</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Molecular ecology resources</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Saraiva, Joao Pedro</au><au>Bartholomäus, Alexander</au><au>Toscan, Rodolfo Brizola</au><au>Baldrian, Petr</au><au>Nunes da Rocha, Ulisses</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Recovery of 197 eukaryotic bins reveals major challenges for eukaryote genome reconstruction from terrestrial metagenomes</atitle><jtitle>Molecular ecology resources</jtitle><addtitle>Mol Ecol Resour</addtitle><date>2023-07</date><risdate>2023</risdate><volume>23</volume><issue>5</issue><spage>1066</spage><epage>1076</epage><pages>1066-1076</pages><issn>1755-098X</issn><eissn>1755-0998</eissn><abstract>As most eukaryotic genomes are yet to be sequenced, the mechanisms underlying their contribution to different ecosystem processes remain untapped. Although approaches to recovering Prokaryotic genomes have become common in genome biology, few studies have tackled the recovery of eukaryotic genomes from metagenomes. This study assessed the reconstruction of microbial eukaryotic genomes using 6000 metagenomes from terrestrial and some transition environments using the EukRep pipeline. Only 215 metagenomic libraries yielded eukaryotic bins. From a total of 447 eukaryotic bins recovered 197 were classified at the phylum level. Streptophytes and fungi were the most represented clades with 83 and 73 bins, respectively. More than 78% of the obtained eukaryotic bins were recovered from samples whose biomes were classified as host‐associated, aquatic, and anthropogenic terrestrial. However, only 93 bins were taxonomically assigned at the genus level and 17 bins at the species level. Completeness and contamination estimates were obtained for a total of 193 bins and consisted of 44.64% (σ = 27.41%) and 3.97% (σ = 6.53%), respectively. Micromonas commoda was the most frequent taxon found while Saccharomyces cerevisiae presented the highest completeness, probably because more reference genomes are available. Current measures of completeness are based on the presence of single‐copy genes. However, mapping of the contigs from the recovered eukaryotic bins to the chromosomes of the reference genomes showed many gaps, suggesting that completeness measures should also include chromosome coverage. Recovering eukaryotic genomes will benefit significantly from long‐read sequencing, development of tools for dealing with repeat‐rich genomes, and improved reference genomes databases.</abstract><cop>England</cop><pub>Wiley Subscription Services, Inc</pub><pmid>36847735</pmid><doi>10.1111/1755-0998.13776</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0002-8983-2721</orcidid><orcidid>https://orcid.org/0000-0001-6972-6692</orcidid><orcidid>https://orcid.org/0000-0003-0970-7304</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1755-098X
ispartof Molecular ecology resources, 2023-07, Vol.23 (5), p.1066-1076
issn 1755-098X
1755-0998
language eng
recordid cdi_proquest_miscellaneous_2780482625
source Wiley:Jisc Collections:Wiley Read and Publish Open Access 2024-2025 (reading list)
subjects Anthropogenic factors
Bins
Chromosomes
Completeness
Ecosystem
Eukaryota - genetics
eukaryotes
Fungi - genetics
Gene mapping
Genome, Microbial
Genomes
genome‐resolved metagenomics
Hypocreales
Mamiellales
Metagenome
Metagenomics
Microorganisms
Reconstruction
Recovery
Saccharomycetales
title Recovery of 197 eukaryotic bins reveals major challenges for eukaryote genome reconstruction from terrestrial metagenomes
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T19%3A15%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Recovery%20of%20197%20eukaryotic%20bins%20reveals%20major%20challenges%20for%20eukaryote%20genome%20reconstruction%20from%20terrestrial%20metagenomes&rft.jtitle=Molecular%20ecology%20resources&rft.au=Saraiva,%20Joao%20Pedro&rft.date=2023-07&rft.volume=23&rft.issue=5&rft.spage=1066&rft.epage=1076&rft.pages=1066-1076&rft.issn=1755-098X&rft.eissn=1755-0998&rft_id=info:doi/10.1111/1755-0998.13776&rft_dat=%3Cproquest_cross%3E2780482625%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c4126-3c7b52bfcd92e586703539104512f30c12716d10d81a52f4ebb52c8ec8673b153%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2822624396&rft_id=info:pmid/36847735&rfr_iscdi=true