Loading…

Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales)

Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment pro...

Full description

Saved in:
Bibliographic Details
Published in:Systematic biology 2018-05, Vol.67 (3), p.367-383
Main Authors: Moore, Abigail J., de Vos, Jurriaan M., Hancock, Lillian P., Goolsby, Eric, Edwards, Erika J.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323
cites cdi_FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323
container_end_page 383
container_issue 3
container_start_page 367
container_title Systematic biology
container_volume 67
creator Moore, Abigail J.
de Vos, Jurriaan M.
Hancock, Lillian P.
Goolsby, Eric
Edwards, Erika J.
description Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here, we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the “portullugo” (Caryophyllales), a moderately sized lineage of flowering plants (∼2200 species) that includes the cacti and harbors many evolutionary transitions to C4 and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C4 and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C4 and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75–218 loci across 74 taxa, with ∼50% matrix completeness across data sets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve the sister lineage of the cacti with strong support: Anacampserotaceae + Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data.
doi_str_mv 10.1093/sysbio/syx078
format article
fullrecord <record><control><sourceid>jstor_proqu</sourceid><recordid>TN_cdi_proquest_miscellaneous_1951417239</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><jstor_id>26581964</jstor_id><oup_id>10.1093/sysbio/syx078</oup_id><sourcerecordid>26581964</sourcerecordid><originalsourceid>FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323</originalsourceid><addsrcrecordid>eNqFkMFLwzAUh4Mobk6PHpUevVSTJmmTo4xtCgM9TPBW0uRl62ibmbTg_ns7Ot3R0-_x3sfvwYfQLcGPBEv6FPahKF0f3zgTZ2hMcJbGgqaf54c5pTEnPBuhqxC2GBOScnKJRonEiaRUjtFspfwaWjDRrPGl3tTQtJGz0fKwjhbQQDRXdVmVECLrfPS-2Vdu3a_bUkevjQUPjYZrdGFVFeDmmBP0MZ-tpi_x8m3xOn1exppx2sZcp9Rgy41VoDAYQgwUVDCjlBZKqoRbC0yIzBLG-ivGTBBRGMmxFZomdIIeht6dd18dhDavy6ChqlQDrgs5kZwwkiVU9mg8oNq7EDzYfOfLWvl9TnB-MJcP5vLBXM_fH6u7ogbzR_-qOv123e7frrsB3YbW-VNVygWRKaM_aZmD_A</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1951417239</pqid></control><display><type>article</type><title>Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales)</title><source>JSTOR Archival Journals and Primary Sources Collection</source><source>Oxford University Press:Jisc Collections:OUP Read and Publish 2024-2025 (2024 collection) (Reading list)</source><creator>Moore, Abigail J. ; de Vos, Jurriaan M. ; Hancock, Lillian P. ; Goolsby, Eric ; Edwards, Erika J.</creator><contributor>Smith, Stephen</contributor><creatorcontrib>Moore, Abigail J. ; de Vos, Jurriaan M. ; Hancock, Lillian P. ; Goolsby, Eric ; Edwards, Erika J. ; Smith, Stephen</creatorcontrib><description>Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here, we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the “portullugo” (Caryophyllales), a moderately sized lineage of flowering plants (∼2200 species) that includes the cacti and harbors many evolutionary transitions to C4 and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C4 and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C4 and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75–218 loci across 74 taxa, with ∼50% matrix completeness across data sets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve the sister lineage of the cacti with strong support: Anacampserotaceae + Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data.</description><identifier>ISSN: 1063-5157</identifier><identifier>EISSN: 1076-836X</identifier><identifier>DOI: 10.1093/sysbio/syx078</identifier><identifier>PMID: 29029339</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Caryophyllales - classification ; Caryophyllales - genetics ; Evolution, Molecular ; Genome, Plant - genetics ; Photosynthesis - genetics ; Phylogeny ; REGULAR ARTICLES</subject><ispartof>Systematic biology, 2018-05, Vol.67 (3), p.367-383</ispartof><rights>The Author(s) 2017</rights><rights>The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com 2017</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323</citedby><cites>FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.jstor.org/stable/pdf/26581964$$EPDF$$P50$$Gjstor$$H</linktopdf><linktohtml>$$Uhttps://www.jstor.org/stable/26581964$$EHTML$$P50$$Gjstor$$H</linktohtml><link.rule.ids>314,780,784,27922,27923,58236,58469</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29029339$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Smith, Stephen</contributor><creatorcontrib>Moore, Abigail J.</creatorcontrib><creatorcontrib>de Vos, Jurriaan M.</creatorcontrib><creatorcontrib>Hancock, Lillian P.</creatorcontrib><creatorcontrib>Goolsby, Eric</creatorcontrib><creatorcontrib>Edwards, Erika J.</creatorcontrib><title>Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales)</title><title>Systematic biology</title><addtitle>Syst Biol</addtitle><description>Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here, we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the “portullugo” (Caryophyllales), a moderately sized lineage of flowering plants (∼2200 species) that includes the cacti and harbors many evolutionary transitions to C4 and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C4 and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C4 and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75–218 loci across 74 taxa, with ∼50% matrix completeness across data sets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve the sister lineage of the cacti with strong support: Anacampserotaceae + Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data.</description><subject>Caryophyllales - classification</subject><subject>Caryophyllales - genetics</subject><subject>Evolution, Molecular</subject><subject>Genome, Plant - genetics</subject><subject>Photosynthesis - genetics</subject><subject>Phylogeny</subject><subject>REGULAR ARTICLES</subject><issn>1063-5157</issn><issn>1076-836X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNqFkMFLwzAUh4Mobk6PHpUevVSTJmmTo4xtCgM9TPBW0uRl62ibmbTg_ns7Ot3R0-_x3sfvwYfQLcGPBEv6FPahKF0f3zgTZ2hMcJbGgqaf54c5pTEnPBuhqxC2GBOScnKJRonEiaRUjtFspfwaWjDRrPGl3tTQtJGz0fKwjhbQQDRXdVmVECLrfPS-2Vdu3a_bUkevjQUPjYZrdGFVFeDmmBP0MZ-tpi_x8m3xOn1exppx2sZcp9Rgy41VoDAYQgwUVDCjlBZKqoRbC0yIzBLG-ivGTBBRGMmxFZomdIIeht6dd18dhDavy6ChqlQDrgs5kZwwkiVU9mg8oNq7EDzYfOfLWvl9TnB-MJcP5vLBXM_fH6u7ogbzR_-qOv123e7frrsB3YbW-VNVygWRKaM_aZmD_A</recordid><startdate>20180501</startdate><enddate>20180501</enddate><creator>Moore, Abigail J.</creator><creator>de Vos, Jurriaan M.</creator><creator>Hancock, Lillian P.</creator><creator>Goolsby, Eric</creator><creator>Edwards, Erika J.</creator><general>Oxford University Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20180501</creationdate><title>Targeted Enrichment of Large Gene Families for Phylogenetic Inference</title><author>Moore, Abigail J. ; de Vos, Jurriaan M. ; Hancock, Lillian P. ; Goolsby, Eric ; Edwards, Erika J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Caryophyllales - classification</topic><topic>Caryophyllales - genetics</topic><topic>Evolution, Molecular</topic><topic>Genome, Plant - genetics</topic><topic>Photosynthesis - genetics</topic><topic>Phylogeny</topic><topic>REGULAR ARTICLES</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Moore, Abigail J.</creatorcontrib><creatorcontrib>de Vos, Jurriaan M.</creatorcontrib><creatorcontrib>Hancock, Lillian P.</creatorcontrib><creatorcontrib>Goolsby, Eric</creatorcontrib><creatorcontrib>Edwards, Erika J.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Systematic biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Moore, Abigail J.</au><au>de Vos, Jurriaan M.</au><au>Hancock, Lillian P.</au><au>Goolsby, Eric</au><au>Edwards, Erika J.</au><au>Smith, Stephen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales)</atitle><jtitle>Systematic biology</jtitle><addtitle>Syst Biol</addtitle><date>2018-05-01</date><risdate>2018</risdate><volume>67</volume><issue>3</issue><spage>367</spage><epage>383</epage><pages>367-383</pages><issn>1063-5157</issn><eissn>1076-836X</eissn><abstract>Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here, we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the “portullugo” (Caryophyllales), a moderately sized lineage of flowering plants (∼2200 species) that includes the cacti and harbors many evolutionary transitions to C4 and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C4 and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C4 and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75–218 loci across 74 taxa, with ∼50% matrix completeness across data sets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve the sister lineage of the cacti with strong support: Anacampserotaceae + Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>29029339</pmid><doi>10.1093/sysbio/syx078</doi><tpages>17</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1063-5157
ispartof Systematic biology, 2018-05, Vol.67 (3), p.367-383
issn 1063-5157
1076-836X
language eng
recordid cdi_proquest_miscellaneous_1951417239
source JSTOR Archival Journals and Primary Sources Collection; Oxford University Press:Jisc Collections:OUP Read and Publish 2024-2025 (2024 collection) (Reading list)
subjects Caryophyllales - classification
Caryophyllales - genetics
Evolution, Molecular
Genome, Plant - genetics
Photosynthesis - genetics
Phylogeny
REGULAR ARTICLES
title Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales)
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T11%3A07%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Targeted%20Enrichment%20of%20Large%20Gene%20Families%20for%20Phylogenetic%20Inference:%20Phylogeny%20and%20Molecular%20Evolution%20of%20Photosynthesis%20Genes%20in%20the%20Portullugo%20Clade%20(Caryophyllales)&rft.jtitle=Systematic%20biology&rft.au=Moore,%20Abigail%20J.&rft.date=2018-05-01&rft.volume=67&rft.issue=3&rft.spage=367&rft.epage=383&rft.pages=367-383&rft.issn=1063-5157&rft.eissn=1076-836X&rft_id=info:doi/10.1093/sysbio/syx078&rft_dat=%3Cjstor_proqu%3E26581964%3C/jstor_proqu%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1951417239&rft_id=info:pmid/29029339&rft_jstor_id=26581964&rft_oup_id=10.1093/sysbio/syx078&rfr_iscdi=true