Loading…

An algorithm for Morphological Phylogenetic Analysis with Inapplicable Data

Morphological data play a key role in the inference of biological relationships and evolutionary history and are essential for the interpretation of the fossil record. The hierarchical interdependence of many morphological characters, however, complicates phylogenetic analysis. In particular, many c...

Full description

Saved in:
Bibliographic Details
Published in:Systematic biology 2019-07, Vol.68 (4), p.619-631
Main Authors: Brazeau, Martin D., Guillerme, Thomas, Smith, Martin R.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c442t-cf4eb97c8f48ea6a6d3ee491c5f99f3486c91d8e2006535e1c255fac4813298e3
cites cdi_FETCH-LOGICAL-c442t-cf4eb97c8f48ea6a6d3ee491c5f99f3486c91d8e2006535e1c255fac4813298e3
container_end_page 631
container_issue 4
container_start_page 619
container_title Systematic biology
container_volume 68
creator Brazeau, Martin D.
Guillerme, Thomas
Smith, Martin R.
description Morphological data play a key role in the inference of biological relationships and evolutionary history and are essential for the interpretation of the fossil record. The hierarchical interdependence of many morphological characters, however, complicates phylogenetic analysis. In particular, many characters only apply to a subset of terminal taxa. The widely used “reductive coding” approach treats taxa in which a character is inapplicable as though the character’s state is simply missing (unknown). This approach has long been known to create spurious tree length estimates on certain topologies, potentially leading to erroneous results in phylogenetic searches—but pratical solutions have yet to be proposed and implemented. Here, we present a single-character algorithm for reconstructing ancestral states in reductively coded data sets, following the theoretical guideline of minimizing homoplasy over all characters. Our algorithm uses up to three traversals to score a tree, and a fourth to fully resolve final states at each node within the tree. We use explicit criteria to resolve ambiguity in applicable/inapplicable dichotomies, and to optimize missing data. So that it can be applied to single characters, the algorithm employs local optimization; as such, the method provides a fast but approximate inference of ancestral states and tree score. The application of our method to published morphological data sets indicates that, compared to traditional methods, it identifies different trees as “optimal.” As such, the use of our algorithm to handle inapplicable data may significantly alter the outcome of tree searches, modifying the inferred placement of living and fossil taxa and potentially leading to major differences in reconstructions of evolutionary history.
doi_str_mv 10.1093/sysbio/syy083
format article
fullrecord <record><control><sourceid>jstor_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6568014</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><jstor_id>26804885</jstor_id><sourcerecordid>26804885</sourcerecordid><originalsourceid>FETCH-LOGICAL-c442t-cf4eb97c8f48ea6a6d3ee491c5f99f3486c91d8e2006535e1c255fac4813298e3</originalsourceid><addsrcrecordid>eNqFkUtLxDAUhYMovpculS7dVJPm0XQjDOMTFV0ouAuZzO1MJNPUpKP03xupDrpydS_cj8M59yB0QPAJwRU9jX2cWJ9GjyVdQ9sElyKXVLysf-2C5pzwcgvtxPiKMSGCk020RTGnnJTFNrodNZl2Mx9sN19ktQ_ZvQ_t3Ds_s0a77HHepxUa6KzJRo12fbQx-0h0dtPotnWJmjjIznWn99BGrV2E_e-5i54vL57G1_ndw9XNeHSXG8aKLjc1g0lVGlkzCVpoMaUArCKG11VVUyaFqchUQoGxSDaBmILzWhsmCS0qCXQXnQ267XKygKmBpgvaqTbYhQ698tqqv5fGztXMvyvBhcSEJYHjb4Hg35YQO7Ww0YBzugG_jKqgJSOVLNNH_0UJTw8WUuKE5gNqgo8xQL1yRLD66koNXamhq8Qf_Y6xon_KScDhALzGzofVvUghmJScfgJuDZ05</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2155156880</pqid></control><display><type>article</type><title>An algorithm for Morphological Phylogenetic Analysis with Inapplicable Data</title><source>JSTOR Archival Journals and Primary Sources Collection</source><source>Oxford Journals Online</source><creator>Brazeau, Martin D. ; Guillerme, Thomas ; Smith, Martin R.</creator><contributor>Foster, Peter</contributor><creatorcontrib>Brazeau, Martin D. ; Guillerme, Thomas ; Smith, Martin R. ; Foster, Peter</creatorcontrib><description>Morphological data play a key role in the inference of biological relationships and evolutionary history and are essential for the interpretation of the fossil record. The hierarchical interdependence of many morphological characters, however, complicates phylogenetic analysis. In particular, many characters only apply to a subset of terminal taxa. The widely used “reductive coding” approach treats taxa in which a character is inapplicable as though the character’s state is simply missing (unknown). This approach has long been known to create spurious tree length estimates on certain topologies, potentially leading to erroneous results in phylogenetic searches—but pratical solutions have yet to be proposed and implemented. Here, we present a single-character algorithm for reconstructing ancestral states in reductively coded data sets, following the theoretical guideline of minimizing homoplasy over all characters. Our algorithm uses up to three traversals to score a tree, and a fourth to fully resolve final states at each node within the tree. We use explicit criteria to resolve ambiguity in applicable/inapplicable dichotomies, and to optimize missing data. So that it can be applied to single characters, the algorithm employs local optimization; as such, the method provides a fast but approximate inference of ancestral states and tree score. The application of our method to published morphological data sets indicates that, compared to traditional methods, it identifies different trees as “optimal.” As such, the use of our algorithm to handle inapplicable data may significantly alter the outcome of tree searches, modifying the inferred placement of living and fossil taxa and potentially leading to major differences in reconstructions of evolutionary history.</description><identifier>ISSN: 1063-5157</identifier><identifier>ISSN: 1076-836X</identifier><identifier>EISSN: 1076-836X</identifier><identifier>DOI: 10.1093/sysbio/syy083</identifier><identifier>PMID: 30535172</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Algorithms ; Classification - methods ; data collection ; Fossils ; guidelines ; Phylogeny ; Regular ; REGULAR ARTICLES ; topology ; trees</subject><ispartof>Systematic biology, 2019-07, Vol.68 (4), p.619-631</ispartof><rights>The Author(s) 2018</rights><rights>The Author(s) 2018. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.</rights><rights>The Author(s) 2018. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. 2018</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c442t-cf4eb97c8f48ea6a6d3ee491c5f99f3486c91d8e2006535e1c255fac4813298e3</citedby><cites>FETCH-LOGICAL-c442t-cf4eb97c8f48ea6a6d3ee491c5f99f3486c91d8e2006535e1c255fac4813298e3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.jstor.org/stable/pdf/26804885$$EPDF$$P50$$Gjstor$$H</linktopdf><linktohtml>$$Uhttps://www.jstor.org/stable/26804885$$EHTML$$P50$$Gjstor$$H</linktohtml><link.rule.ids>230,314,778,782,883,27907,27908,58221,58454</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30535172$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Foster, Peter</contributor><creatorcontrib>Brazeau, Martin D.</creatorcontrib><creatorcontrib>Guillerme, Thomas</creatorcontrib><creatorcontrib>Smith, Martin R.</creatorcontrib><title>An algorithm for Morphological Phylogenetic Analysis with Inapplicable Data</title><title>Systematic biology</title><addtitle>Syst Biol</addtitle><description>Morphological data play a key role in the inference of biological relationships and evolutionary history and are essential for the interpretation of the fossil record. The hierarchical interdependence of many morphological characters, however, complicates phylogenetic analysis. In particular, many characters only apply to a subset of terminal taxa. The widely used “reductive coding” approach treats taxa in which a character is inapplicable as though the character’s state is simply missing (unknown). This approach has long been known to create spurious tree length estimates on certain topologies, potentially leading to erroneous results in phylogenetic searches—but pratical solutions have yet to be proposed and implemented. Here, we present a single-character algorithm for reconstructing ancestral states in reductively coded data sets, following the theoretical guideline of minimizing homoplasy over all characters. Our algorithm uses up to three traversals to score a tree, and a fourth to fully resolve final states at each node within the tree. We use explicit criteria to resolve ambiguity in applicable/inapplicable dichotomies, and to optimize missing data. So that it can be applied to single characters, the algorithm employs local optimization; as such, the method provides a fast but approximate inference of ancestral states and tree score. The application of our method to published morphological data sets indicates that, compared to traditional methods, it identifies different trees as “optimal.” As such, the use of our algorithm to handle inapplicable data may significantly alter the outcome of tree searches, modifying the inferred placement of living and fossil taxa and potentially leading to major differences in reconstructions of evolutionary history.</description><subject>Algorithms</subject><subject>Classification - methods</subject><subject>data collection</subject><subject>Fossils</subject><subject>guidelines</subject><subject>Phylogeny</subject><subject>Regular</subject><subject>REGULAR ARTICLES</subject><subject>topology</subject><subject>trees</subject><issn>1063-5157</issn><issn>1076-836X</issn><issn>1076-836X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNqFkUtLxDAUhYMovpculS7dVJPm0XQjDOMTFV0ouAuZzO1MJNPUpKP03xupDrpydS_cj8M59yB0QPAJwRU9jX2cWJ9GjyVdQ9sElyKXVLysf-2C5pzwcgvtxPiKMSGCk020RTGnnJTFNrodNZl2Mx9sN19ktQ_ZvQ_t3Ds_s0a77HHepxUa6KzJRo12fbQx-0h0dtPotnWJmjjIznWn99BGrV2E_e-5i54vL57G1_ndw9XNeHSXG8aKLjc1g0lVGlkzCVpoMaUArCKG11VVUyaFqchUQoGxSDaBmILzWhsmCS0qCXQXnQ267XKygKmBpgvaqTbYhQ698tqqv5fGztXMvyvBhcSEJYHjb4Hg35YQO7Ww0YBzugG_jKqgJSOVLNNH_0UJTw8WUuKE5gNqgo8xQL1yRLD66koNXamhq8Qf_Y6xon_KScDhALzGzofVvUghmJScfgJuDZ05</recordid><startdate>20190701</startdate><enddate>20190701</enddate><creator>Brazeau, Martin D.</creator><creator>Guillerme, Thomas</creator><creator>Smith, Martin R.</creator><general>Oxford University Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7S9</scope><scope>L.6</scope><scope>5PM</scope></search><sort><creationdate>20190701</creationdate><title>An algorithm for Morphological Phylogenetic Analysis with Inapplicable Data</title><author>Brazeau, Martin D. ; Guillerme, Thomas ; Smith, Martin R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c442t-cf4eb97c8f48ea6a6d3ee491c5f99f3486c91d8e2006535e1c255fac4813298e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Algorithms</topic><topic>Classification - methods</topic><topic>data collection</topic><topic>Fossils</topic><topic>guidelines</topic><topic>Phylogeny</topic><topic>Regular</topic><topic>REGULAR ARTICLES</topic><topic>topology</topic><topic>trees</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Brazeau, Martin D.</creatorcontrib><creatorcontrib>Guillerme, Thomas</creatorcontrib><creatorcontrib>Smith, Martin R.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>AGRICOLA</collection><collection>AGRICOLA - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Systematic biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Brazeau, Martin D.</au><au>Guillerme, Thomas</au><au>Smith, Martin R.</au><au>Foster, Peter</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An algorithm for Morphological Phylogenetic Analysis with Inapplicable Data</atitle><jtitle>Systematic biology</jtitle><addtitle>Syst Biol</addtitle><date>2019-07-01</date><risdate>2019</risdate><volume>68</volume><issue>4</issue><spage>619</spage><epage>631</epage><pages>619-631</pages><issn>1063-5157</issn><issn>1076-836X</issn><eissn>1076-836X</eissn><abstract>Morphological data play a key role in the inference of biological relationships and evolutionary history and are essential for the interpretation of the fossil record. The hierarchical interdependence of many morphological characters, however, complicates phylogenetic analysis. In particular, many characters only apply to a subset of terminal taxa. The widely used “reductive coding” approach treats taxa in which a character is inapplicable as though the character’s state is simply missing (unknown). This approach has long been known to create spurious tree length estimates on certain topologies, potentially leading to erroneous results in phylogenetic searches—but pratical solutions have yet to be proposed and implemented. Here, we present a single-character algorithm for reconstructing ancestral states in reductively coded data sets, following the theoretical guideline of minimizing homoplasy over all characters. Our algorithm uses up to three traversals to score a tree, and a fourth to fully resolve final states at each node within the tree. We use explicit criteria to resolve ambiguity in applicable/inapplicable dichotomies, and to optimize missing data. So that it can be applied to single characters, the algorithm employs local optimization; as such, the method provides a fast but approximate inference of ancestral states and tree score. The application of our method to published morphological data sets indicates that, compared to traditional methods, it identifies different trees as “optimal.” As such, the use of our algorithm to handle inapplicable data may significantly alter the outcome of tree searches, modifying the inferred placement of living and fossil taxa and potentially leading to major differences in reconstructions of evolutionary history.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>30535172</pmid><doi>10.1093/sysbio/syy083</doi><tpages>13</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1063-5157
ispartof Systematic biology, 2019-07, Vol.68 (4), p.619-631
issn 1063-5157
1076-836X
1076-836X
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6568014
source JSTOR Archival Journals and Primary Sources Collection; Oxford Journals Online
subjects Algorithms
Classification - methods
data collection
Fossils
guidelines
Phylogeny
Regular
REGULAR ARTICLES
topology
trees
title An algorithm for Morphological Phylogenetic Analysis with Inapplicable Data
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T03%3A24%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20algorithm%20for%20Morphological%20Phylogenetic%20Analysis%20with%20Inapplicable%20Data&rft.jtitle=Systematic%20biology&rft.au=Brazeau,%20Martin%20D.&rft.date=2019-07-01&rft.volume=68&rft.issue=4&rft.spage=619&rft.epage=631&rft.pages=619-631&rft.issn=1063-5157&rft.eissn=1076-836X&rft_id=info:doi/10.1093/sysbio/syy083&rft_dat=%3Cjstor_pubme%3E26804885%3C/jstor_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c442t-cf4eb97c8f48ea6a6d3ee491c5f99f3486c91d8e2006535e1c255fac4813298e3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2155156880&rft_id=info:pmid/30535172&rft_jstor_id=26804885&rfr_iscdi=true