Loading…
Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales)
Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment pro...
Saved in:
Published in: | Systematic biology 2018-05, Vol.67 (3), p.367-383 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323 |
---|---|
cites | cdi_FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323 |
container_end_page | 383 |
container_issue | 3 |
container_start_page | 367 |
container_title | Systematic biology |
container_volume | 67 |
creator | Moore, Abigail J. de Vos, Jurriaan M. Hancock, Lillian P. Goolsby, Eric Edwards, Erika J. |
description | Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here, we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the “portullugo” (Caryophyllales), a moderately sized lineage of flowering plants (∼2200 species) that includes the cacti and harbors many evolutionary transitions to C4 and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C4 and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C4 and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75–218 loci across 74 taxa, with ∼50% matrix completeness across data sets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve the sister lineage of the cacti with strong support: Anacampserotaceae + Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data. |
doi_str_mv | 10.1093/sysbio/syx078 |
format | article |
fullrecord | <record><control><sourceid>jstor_proqu</sourceid><recordid>TN_cdi_proquest_miscellaneous_1951417239</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><jstor_id>26581964</jstor_id><oup_id>10.1093/sysbio/syx078</oup_id><sourcerecordid>26581964</sourcerecordid><originalsourceid>FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323</originalsourceid><addsrcrecordid>eNqFkMFLwzAUh4Mobk6PHpUevVSTJmmTo4xtCgM9TPBW0uRl62ibmbTg_ns7Ot3R0-_x3sfvwYfQLcGPBEv6FPahKF0f3zgTZ2hMcJbGgqaf54c5pTEnPBuhqxC2GBOScnKJRonEiaRUjtFspfwaWjDRrPGl3tTQtJGz0fKwjhbQQDRXdVmVECLrfPS-2Vdu3a_bUkevjQUPjYZrdGFVFeDmmBP0MZ-tpi_x8m3xOn1exppx2sZcp9Rgy41VoDAYQgwUVDCjlBZKqoRbC0yIzBLG-ivGTBBRGMmxFZomdIIeht6dd18dhDavy6ChqlQDrgs5kZwwkiVU9mg8oNq7EDzYfOfLWvl9TnB-MJcP5vLBXM_fH6u7ogbzR_-qOv123e7frrsB3YbW-VNVygWRKaM_aZmD_A</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1951417239</pqid></control><display><type>article</type><title>Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales)</title><source>JSTOR Archival Journals and Primary Sources Collection</source><source>Oxford University Press:Jisc Collections:OUP Read and Publish 2024-2025 (2024 collection) (Reading list)</source><creator>Moore, Abigail J. ; de Vos, Jurriaan M. ; Hancock, Lillian P. ; Goolsby, Eric ; Edwards, Erika J.</creator><contributor>Smith, Stephen</contributor><creatorcontrib>Moore, Abigail J. ; de Vos, Jurriaan M. ; Hancock, Lillian P. ; Goolsby, Eric ; Edwards, Erika J. ; Smith, Stephen</creatorcontrib><description>Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here, we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the “portullugo” (Caryophyllales), a moderately sized lineage of flowering plants (∼2200 species) that includes the cacti and harbors many evolutionary transitions to C4 and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C4 and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C4 and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75–218 loci across 74 taxa, with ∼50% matrix completeness across data sets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve the sister lineage of the cacti with strong support: Anacampserotaceae + Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data.</description><identifier>ISSN: 1063-5157</identifier><identifier>EISSN: 1076-836X</identifier><identifier>DOI: 10.1093/sysbio/syx078</identifier><identifier>PMID: 29029339</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Caryophyllales - classification ; Caryophyllales - genetics ; Evolution, Molecular ; Genome, Plant - genetics ; Photosynthesis - genetics ; Phylogeny ; REGULAR ARTICLES</subject><ispartof>Systematic biology, 2018-05, Vol.67 (3), p.367-383</ispartof><rights>The Author(s) 2017</rights><rights>The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com 2017</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323</citedby><cites>FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.jstor.org/stable/pdf/26581964$$EPDF$$P50$$Gjstor$$H</linktopdf><linktohtml>$$Uhttps://www.jstor.org/stable/26581964$$EHTML$$P50$$Gjstor$$H</linktohtml><link.rule.ids>314,780,784,27922,27923,58236,58469</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29029339$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Smith, Stephen</contributor><creatorcontrib>Moore, Abigail J.</creatorcontrib><creatorcontrib>de Vos, Jurriaan M.</creatorcontrib><creatorcontrib>Hancock, Lillian P.</creatorcontrib><creatorcontrib>Goolsby, Eric</creatorcontrib><creatorcontrib>Edwards, Erika J.</creatorcontrib><title>Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales)</title><title>Systematic biology</title><addtitle>Syst Biol</addtitle><description>Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here, we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the “portullugo” (Caryophyllales), a moderately sized lineage of flowering plants (∼2200 species) that includes the cacti and harbors many evolutionary transitions to C4 and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C4 and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C4 and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75–218 loci across 74 taxa, with ∼50% matrix completeness across data sets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve the sister lineage of the cacti with strong support: Anacampserotaceae + Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data.</description><subject>Caryophyllales - classification</subject><subject>Caryophyllales - genetics</subject><subject>Evolution, Molecular</subject><subject>Genome, Plant - genetics</subject><subject>Photosynthesis - genetics</subject><subject>Phylogeny</subject><subject>REGULAR ARTICLES</subject><issn>1063-5157</issn><issn>1076-836X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNqFkMFLwzAUh4Mobk6PHpUevVSTJmmTo4xtCgM9TPBW0uRl62ibmbTg_ns7Ot3R0-_x3sfvwYfQLcGPBEv6FPahKF0f3zgTZ2hMcJbGgqaf54c5pTEnPBuhqxC2GBOScnKJRonEiaRUjtFspfwaWjDRrPGl3tTQtJGz0fKwjhbQQDRXdVmVECLrfPS-2Vdu3a_bUkevjQUPjYZrdGFVFeDmmBP0MZ-tpi_x8m3xOn1exppx2sZcp9Rgy41VoDAYQgwUVDCjlBZKqoRbC0yIzBLG-ivGTBBRGMmxFZomdIIeht6dd18dhDavy6ChqlQDrgs5kZwwkiVU9mg8oNq7EDzYfOfLWvl9TnB-MJcP5vLBXM_fH6u7ogbzR_-qOv123e7frrsB3YbW-VNVygWRKaM_aZmD_A</recordid><startdate>20180501</startdate><enddate>20180501</enddate><creator>Moore, Abigail J.</creator><creator>de Vos, Jurriaan M.</creator><creator>Hancock, Lillian P.</creator><creator>Goolsby, Eric</creator><creator>Edwards, Erika J.</creator><general>Oxford University Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20180501</creationdate><title>Targeted Enrichment of Large Gene Families for Phylogenetic Inference</title><author>Moore, Abigail J. ; de Vos, Jurriaan M. ; Hancock, Lillian P. ; Goolsby, Eric ; Edwards, Erika J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Caryophyllales - classification</topic><topic>Caryophyllales - genetics</topic><topic>Evolution, Molecular</topic><topic>Genome, Plant - genetics</topic><topic>Photosynthesis - genetics</topic><topic>Phylogeny</topic><topic>REGULAR ARTICLES</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Moore, Abigail J.</creatorcontrib><creatorcontrib>de Vos, Jurriaan M.</creatorcontrib><creatorcontrib>Hancock, Lillian P.</creatorcontrib><creatorcontrib>Goolsby, Eric</creatorcontrib><creatorcontrib>Edwards, Erika J.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Systematic biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Moore, Abigail J.</au><au>de Vos, Jurriaan M.</au><au>Hancock, Lillian P.</au><au>Goolsby, Eric</au><au>Edwards, Erika J.</au><au>Smith, Stephen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales)</atitle><jtitle>Systematic biology</jtitle><addtitle>Syst Biol</addtitle><date>2018-05-01</date><risdate>2018</risdate><volume>67</volume><issue>3</issue><spage>367</spage><epage>383</epage><pages>367-383</pages><issn>1063-5157</issn><eissn>1076-836X</eissn><abstract>Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here, we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the “portullugo” (Caryophyllales), a moderately sized lineage of flowering plants (∼2200 species) that includes the cacti and harbors many evolutionary transitions to C4 and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C4 and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C4 and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75–218 loci across 74 taxa, with ∼50% matrix completeness across data sets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve the sister lineage of the cacti with strong support: Anacampserotaceae + Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>29029339</pmid><doi>10.1093/sysbio/syx078</doi><tpages>17</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1063-5157 |
ispartof | Systematic biology, 2018-05, Vol.67 (3), p.367-383 |
issn | 1063-5157 1076-836X |
language | eng |
recordid | cdi_proquest_miscellaneous_1951417239 |
source | JSTOR Archival Journals and Primary Sources Collection; Oxford University Press:Jisc Collections:OUP Read and Publish 2024-2025 (2024 collection) (Reading list) |
subjects | Caryophyllales - classification Caryophyllales - genetics Evolution, Molecular Genome, Plant - genetics Photosynthesis - genetics Phylogeny REGULAR ARTICLES |
title | Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales) |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T11%3A07%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Targeted%20Enrichment%20of%20Large%20Gene%20Families%20for%20Phylogenetic%20Inference:%20Phylogeny%20and%20Molecular%20Evolution%20of%20Photosynthesis%20Genes%20in%20the%20Portullugo%20Clade%20(Caryophyllales)&rft.jtitle=Systematic%20biology&rft.au=Moore,%20Abigail%20J.&rft.date=2018-05-01&rft.volume=67&rft.issue=3&rft.spage=367&rft.epage=383&rft.pages=367-383&rft.issn=1063-5157&rft.eissn=1076-836X&rft_id=info:doi/10.1093/sysbio/syx078&rft_dat=%3Cjstor_proqu%3E26581964%3C/jstor_proqu%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c453t-5c63d0f5dfaea0ed11deb384daac8a9a25ffe4887f144d11004818bd950f8c323%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1951417239&rft_id=info:pmid/29029339&rft_jstor_id=26581964&rft_oup_id=10.1093/sysbio/syx078&rfr_iscdi=true |