Loading…
Stretch coding and block coding: two new strategies to represent questionably aligned DNA sequences
Most coding strategies that address the problem of questionable alignment (elision, case sensitive, missing, polymorphic, gaps as presence/absence matrix) conflict with phylogenetic principles, particularly those relating to the concept of homology (shared similiarity explained by common ancestry)....
Saved in:
Published in: | Journal of molecular evolution 2002-02, Vol.54 (2), p.191-199 |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c355t-b818b253821b316d499c6a90c728fa97631ef3b4d27ff06d7fc198c7e1f413843 |
---|---|
cites | |
container_end_page | 199 |
container_issue | 2 |
container_start_page | 191 |
container_title | Journal of molecular evolution |
container_volume | 54 |
creator | Geiger, Daniel L |
description | Most coding strategies that address the problem of questionable alignment (elision, case sensitive, missing, polymorphic, gaps as presence/absence matrix) conflict with phylogenetic principles, particularly those relating to the concept of homology (shared similiarity explained by common ancestry). In some cases, the test of conjunction is failed. In other cases, characters that are coded ambiguously can lead to character-state optimization in the terminal taxa that conflicts with the original observations. Only data exclusion and contraction avoid these pitfalls. In highly dissimilar sequences additional character states can represent the available information. Two new methods that accomplish this-block and stretch coding-are introduced here. These two new coding strategies are not in conflict with the test of conjunction and do not contradict the original observations. They are comparable to coding practices with morphological data once the intrinsic differences due to character-state identity and topographical identity have been taken into account. It is suggested that, of the three recoding methods, the one is selected that preserves the maximum potential phylogenetic information as measured with the minimum number of steps required for the particular part of the data matrix. |
doi_str_mv | 10.1007/s00239-001-0001-5 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_71424624</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>18280418</sourcerecordid><originalsourceid>FETCH-LOGICAL-c355t-b818b253821b316d499c6a90c728fa97631ef3b4d27ff06d7fc198c7e1f413843</originalsourceid><addsrcrecordid>eNqFkcFuFDEMhqMKRJeWB-gFRRy4DcRJZpJwq7bQIlX0UHqOMhnPdspssk2yqvr2ZLsrIfXCwbZkf7Zs_4ScAfsCjKmvmTEuTMMYVKuuPSILkII3O_eGLGqZN1xLeUze5_xQEdUa8Y4cA2gOBviC-NuSsPh76uMwhRV1YaD9HP2fQ-IbLU-RBnyiuSRXcDVhpiXShJuEGUOhj1vMZYrB9fMzdfO0CjjQi1_nNGMtBY_5lLwd3ZzxwyGekLsf338vr5rrm8ufy_Prxou2LU2vQfe8FXW1XkA3SGN85wzziuvRGdUJwFH0cuBqHFk3qNGD0V4hjBKEluKEfN7P3aT4spVdT9njPLuAcZutAsllx_8P1u9oJkFX8NMr8CFuU6hHWMO1kmAEqxDsIZ9izglHu0nT2qVnC8zudLJ7nWz9v93pZNva8_EweNuvcfjXcRBG_AWXNozw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>928741930</pqid></control><display><type>article</type><title>Stretch coding and block coding: two new strategies to represent questionably aligned DNA sequences</title><source>Springer Link</source><creator>Geiger, Daniel L</creator><creatorcontrib>Geiger, Daniel L</creatorcontrib><description>Most coding strategies that address the problem of questionable alignment (elision, case sensitive, missing, polymorphic, gaps as presence/absence matrix) conflict with phylogenetic principles, particularly those relating to the concept of homology (shared similiarity explained by common ancestry). In some cases, the test of conjunction is failed. In other cases, characters that are coded ambiguously can lead to character-state optimization in the terminal taxa that conflicts with the original observations. Only data exclusion and contraction avoid these pitfalls. In highly dissimilar sequences additional character states can represent the available information. Two new methods that accomplish this-block and stretch coding-are introduced here. These two new coding strategies are not in conflict with the test of conjunction and do not contradict the original observations. They are comparable to coding practices with morphological data once the intrinsic differences due to character-state identity and topographical identity have been taken into account. It is suggested that, of the three recoding methods, the one is selected that preserves the maximum potential phylogenetic information as measured with the minimum number of steps required for the particular part of the data matrix.</description><identifier>ISSN: 0022-2844</identifier><identifier>EISSN: 1432-1432</identifier><identifier>DOI: 10.1007/s00239-001-0001-5</identifier><identifier>PMID: 11821912</identifier><language>eng</language><publisher>Germany: Springer Nature B.V</publisher><subject>Animals ; Base Sequence ; Deoxyribonucleic acid ; DNA ; Evolutionary biology ; Genetics ; Molecular biology ; Molecular Sequence Data ; Polymorphism, Genetic ; Sequence Alignment - methods</subject><ispartof>Journal of molecular evolution, 2002-02, Vol.54 (2), p.191-199</ispartof><rights>Springer-Verlag New York Inc. 2002</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c355t-b818b253821b316d499c6a90c728fa97631ef3b4d27ff06d7fc198c7e1f413843</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/11821912$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Geiger, Daniel L</creatorcontrib><title>Stretch coding and block coding: two new strategies to represent questionably aligned DNA sequences</title><title>Journal of molecular evolution</title><addtitle>J Mol Evol</addtitle><description>Most coding strategies that address the problem of questionable alignment (elision, case sensitive, missing, polymorphic, gaps as presence/absence matrix) conflict with phylogenetic principles, particularly those relating to the concept of homology (shared similiarity explained by common ancestry). In some cases, the test of conjunction is failed. In other cases, characters that are coded ambiguously can lead to character-state optimization in the terminal taxa that conflicts with the original observations. Only data exclusion and contraction avoid these pitfalls. In highly dissimilar sequences additional character states can represent the available information. Two new methods that accomplish this-block and stretch coding-are introduced here. These two new coding strategies are not in conflict with the test of conjunction and do not contradict the original observations. They are comparable to coding practices with morphological data once the intrinsic differences due to character-state identity and topographical identity have been taken into account. It is suggested that, of the three recoding methods, the one is selected that preserves the maximum potential phylogenetic information as measured with the minimum number of steps required for the particular part of the data matrix.</description><subject>Animals</subject><subject>Base Sequence</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>Evolutionary biology</subject><subject>Genetics</subject><subject>Molecular biology</subject><subject>Molecular Sequence Data</subject><subject>Polymorphism, Genetic</subject><subject>Sequence Alignment - methods</subject><issn>0022-2844</issn><issn>1432-1432</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2002</creationdate><recordtype>article</recordtype><recordid>eNqFkcFuFDEMhqMKRJeWB-gFRRy4DcRJZpJwq7bQIlX0UHqOMhnPdspssk2yqvr2ZLsrIfXCwbZkf7Zs_4ScAfsCjKmvmTEuTMMYVKuuPSILkII3O_eGLGqZN1xLeUze5_xQEdUa8Y4cA2gOBviC-NuSsPh76uMwhRV1YaD9HP2fQ-IbLU-RBnyiuSRXcDVhpiXShJuEGUOhj1vMZYrB9fMzdfO0CjjQi1_nNGMtBY_5lLwd3ZzxwyGekLsf338vr5rrm8ufy_Prxou2LU2vQfe8FXW1XkA3SGN85wzziuvRGdUJwFH0cuBqHFk3qNGD0V4hjBKEluKEfN7P3aT4spVdT9njPLuAcZutAsllx_8P1u9oJkFX8NMr8CFuU6hHWMO1kmAEqxDsIZ9izglHu0nT2qVnC8zudLJ7nWz9v93pZNva8_EweNuvcfjXcRBG_AWXNozw</recordid><startdate>20020201</startdate><enddate>20020201</enddate><creator>Geiger, Daniel L</creator><general>Springer Nature B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7QL</scope><scope>7QP</scope><scope>7QR</scope><scope>7T7</scope><scope>7TK</scope><scope>7U9</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2O</scope><scope>M7N</scope><scope>M7P</scope><scope>MBDVC</scope><scope>P64</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>RC3</scope><scope>7X8</scope></search><sort><creationdate>20020201</creationdate><title>Stretch coding and block coding: two new strategies to represent questionably aligned DNA sequences</title><author>Geiger, Daniel L</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c355t-b818b253821b316d499c6a90c728fa97631ef3b4d27ff06d7fc198c7e1f413843</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2002</creationdate><topic>Animals</topic><topic>Base Sequence</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>Evolutionary biology</topic><topic>Genetics</topic><topic>Molecular biology</topic><topic>Molecular Sequence Data</topic><topic>Polymorphism, Genetic</topic><topic>Sequence Alignment - methods</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Geiger, Daniel L</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Calcium & Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Neurosciences Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Biological Sciences</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Research Library</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biological Science Database</collection><collection>Research Library (Corporate)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of molecular evolution</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Geiger, Daniel L</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Stretch coding and block coding: two new strategies to represent questionably aligned DNA sequences</atitle><jtitle>Journal of molecular evolution</jtitle><addtitle>J Mol Evol</addtitle><date>2002-02-01</date><risdate>2002</risdate><volume>54</volume><issue>2</issue><spage>191</spage><epage>199</epage><pages>191-199</pages><issn>0022-2844</issn><eissn>1432-1432</eissn><abstract>Most coding strategies that address the problem of questionable alignment (elision, case sensitive, missing, polymorphic, gaps as presence/absence matrix) conflict with phylogenetic principles, particularly those relating to the concept of homology (shared similiarity explained by common ancestry). In some cases, the test of conjunction is failed. In other cases, characters that are coded ambiguously can lead to character-state optimization in the terminal taxa that conflicts with the original observations. Only data exclusion and contraction avoid these pitfalls. In highly dissimilar sequences additional character states can represent the available information. Two new methods that accomplish this-block and stretch coding-are introduced here. These two new coding strategies are not in conflict with the test of conjunction and do not contradict the original observations. They are comparable to coding practices with morphological data once the intrinsic differences due to character-state identity and topographical identity have been taken into account. It is suggested that, of the three recoding methods, the one is selected that preserves the maximum potential phylogenetic information as measured with the minimum number of steps required for the particular part of the data matrix.</abstract><cop>Germany</cop><pub>Springer Nature B.V</pub><pmid>11821912</pmid><doi>10.1007/s00239-001-0001-5</doi><tpages>9</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0022-2844 |
ispartof | Journal of molecular evolution, 2002-02, Vol.54 (2), p.191-199 |
issn | 0022-2844 1432-1432 |
language | eng |
recordid | cdi_proquest_miscellaneous_71424624 |
source | Springer Link |
subjects | Animals Base Sequence Deoxyribonucleic acid DNA Evolutionary biology Genetics Molecular biology Molecular Sequence Data Polymorphism, Genetic Sequence Alignment - methods |
title | Stretch coding and block coding: two new strategies to represent questionably aligned DNA sequences |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T20%3A05%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Stretch%20coding%20and%20block%20coding:%20two%20new%20strategies%20to%20represent%20questionably%20aligned%20DNA%20sequences&rft.jtitle=Journal%20of%20molecular%20evolution&rft.au=Geiger,%20Daniel%20L&rft.date=2002-02-01&rft.volume=54&rft.issue=2&rft.spage=191&rft.epage=199&rft.pages=191-199&rft.issn=0022-2844&rft.eissn=1432-1432&rft_id=info:doi/10.1007/s00239-001-0001-5&rft_dat=%3Cproquest_cross%3E18280418%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c355t-b818b253821b316d499c6a90c728fa97631ef3b4d27ff06d7fc198c7e1f413843%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=928741930&rft_id=info:pmid/11821912&rfr_iscdi=true |