Loading…
Biochemical Property Based Positional Matrix: A New Approach Towards Genome Sequence Comparison
The growth of the genome sequence has become one of the emerging areas in the study of bioinformatics. It has led to an excessive demand for researchers to develop advanced methodologies for evolutionary relationships among species. The alignment-free methods have been proved to be more efficient an...
Saved in:
Published in: | Journal of molecular evolution 2023-02, Vol.91 (1), p.93-131 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063 |
---|---|
cites | cdi_FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063 |
container_end_page | 131 |
container_issue | 1 |
container_start_page | 93 |
container_title | Journal of molecular evolution |
container_volume | 91 |
creator | Dey, Sudeshna Das, Subhram Bhattacharya, D. K. |
description | The growth of the genome sequence has become one of the emerging areas in the study of bioinformatics. It has led to an excessive demand for researchers to develop advanced methodologies for evolutionary relationships among species. The alignment-free methods have been proved to be more efficient and appropriate related to time and space than existing alignment-based methods for sequence analysis. In this study, a new alignment-free genome sequence comparison technique is proposed based on the biochemical properties of nucleotides. Each genome sequence can be distributed in four parameters to represent a 21-dimensional numerical descriptor using the Positional Matrix. To substantiate the proposed method, phylogenetic trees are constructed on the viral and mammalian datasets by applying the UPGMA/NJ clustering method. Further, the results of this method are compared with the results of the Feature Frequency Profiles method, the Positional Correlation Natural Vector method, the Graph-theoretic method, the Multiple Encoding Vector method, and the Fuzzy Integral Similarity method. In most cases, it is found that the present method produces more accurate results than the prior methods. Also, in the present method, the execution time for computation is comparatively small. |
doi_str_mv | 10.1007/s00239-022-10082-0 |
format | article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9805373</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2759961515</sourcerecordid><originalsourceid>FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063</originalsourceid><addsrcrecordid>eNp9kcFPFDEUxhuDkQX9BzyQJl64jL620-nUg8myUTQBJBHPTafzli2ZmQ7tLMh_T9dFRA9c2rx8v_e1732EvGXwngGoDwmAC10A50Wua17ACzJjpchlPnbIDDYSr8tyl-yldAXAlNTiFdkVlawVU_WMmCMf3Ap772xHz2MYMU539MgmbOl5SH7yYcjKqZ2i__WRzukZ3tL5OMZg3YpehFsb20SPcQg90h94vcbBIV2EfrTRpzC8Ji-Xtkv45uHeJz-_fL5YfC1Ovh9_W8xPCleqcip425baMtkKWDaNVJbZVopaO9lU0ECjXSu5wmWlXcMbjXUltJJQQ5ZAQCX2yaet77huemwdDlO0nRmj7228M8F6868y-JW5DDdG1yCFEtng8MEghjxFmkzvk8OuswOGdTI8r05XTDKZ0Xf_oVdhHfOaNlRVSVWWWmWKbykXQ0oRl4-fYWA2-ZltfiaHZH7nZyA3HTwd47HlT2AZEFsgZWm4xPj37Wds7wFPNKY_</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2766574497</pqid></control><display><type>article</type><title>Biochemical Property Based Positional Matrix: A New Approach Towards Genome Sequence Comparison</title><source>Springer Nature</source><creator>Dey, Sudeshna ; Das, Subhram ; Bhattacharya, D. K.</creator><creatorcontrib>Dey, Sudeshna ; Das, Subhram ; Bhattacharya, D. K.</creatorcontrib><description>The growth of the genome sequence has become one of the emerging areas in the study of bioinformatics. It has led to an excessive demand for researchers to develop advanced methodologies for evolutionary relationships among species. The alignment-free methods have been proved to be more efficient and appropriate related to time and space than existing alignment-based methods for sequence analysis. In this study, a new alignment-free genome sequence comparison technique is proposed based on the biochemical properties of nucleotides. Each genome sequence can be distributed in four parameters to represent a 21-dimensional numerical descriptor using the Positional Matrix. To substantiate the proposed method, phylogenetic trees are constructed on the viral and mammalian datasets by applying the UPGMA/NJ clustering method. Further, the results of this method are compared with the results of the Feature Frequency Profiles method, the Positional Correlation Natural Vector method, the Graph-theoretic method, the Multiple Encoding Vector method, and the Fuzzy Integral Similarity method. In most cases, it is found that the present method produces more accurate results than the prior methods. Also, in the present method, the execution time for computation is comparatively small.</description><identifier>ISSN: 0022-2844</identifier><identifier>EISSN: 1432-1432</identifier><identifier>DOI: 10.1007/s00239-022-10082-0</identifier><identifier>PMID: 36587178</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Alignment ; Animal Genetics and Genomics ; Animals ; Bioinformatics ; Biomedical and Life Sciences ; Cell Biology ; Clustering ; Computational Biology - methods ; Evolutionary Biology ; Genome - genetics ; Genomes ; Graph theory ; Life Sciences ; Mammals - genetics ; Microbiology ; Nucleotide sequence ; Nucleotides ; Nucleotides - genetics ; Original ; Original Article ; Phylogeny ; Plant Genetics and Genomics ; Plant Sciences ; Sequence analysis ; Sequence Analysis, DNA - methods</subject><ispartof>Journal of molecular evolution, 2023-02, Vol.91 (1), p.93-131</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><rights>2022. The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.</rights><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063</citedby><cites>FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063</cites><orcidid>0000-0003-2899-4433</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36587178$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Dey, Sudeshna</creatorcontrib><creatorcontrib>Das, Subhram</creatorcontrib><creatorcontrib>Bhattacharya, D. K.</creatorcontrib><title>Biochemical Property Based Positional Matrix: A New Approach Towards Genome Sequence Comparison</title><title>Journal of molecular evolution</title><addtitle>J Mol Evol</addtitle><addtitle>J Mol Evol</addtitle><description>The growth of the genome sequence has become one of the emerging areas in the study of bioinformatics. It has led to an excessive demand for researchers to develop advanced methodologies for evolutionary relationships among species. The alignment-free methods have been proved to be more efficient and appropriate related to time and space than existing alignment-based methods for sequence analysis. In this study, a new alignment-free genome sequence comparison technique is proposed based on the biochemical properties of nucleotides. Each genome sequence can be distributed in four parameters to represent a 21-dimensional numerical descriptor using the Positional Matrix. To substantiate the proposed method, phylogenetic trees are constructed on the viral and mammalian datasets by applying the UPGMA/NJ clustering method. Further, the results of this method are compared with the results of the Feature Frequency Profiles method, the Positional Correlation Natural Vector method, the Graph-theoretic method, the Multiple Encoding Vector method, and the Fuzzy Integral Similarity method. In most cases, it is found that the present method produces more accurate results than the prior methods. Also, in the present method, the execution time for computation is comparatively small.</description><subject>Algorithms</subject><subject>Alignment</subject><subject>Animal Genetics and Genomics</subject><subject>Animals</subject><subject>Bioinformatics</subject><subject>Biomedical and Life Sciences</subject><subject>Cell Biology</subject><subject>Clustering</subject><subject>Computational Biology - methods</subject><subject>Evolutionary Biology</subject><subject>Genome - genetics</subject><subject>Genomes</subject><subject>Graph theory</subject><subject>Life Sciences</subject><subject>Mammals - genetics</subject><subject>Microbiology</subject><subject>Nucleotide sequence</subject><subject>Nucleotides</subject><subject>Nucleotides - genetics</subject><subject>Original</subject><subject>Original Article</subject><subject>Phylogeny</subject><subject>Plant Genetics and Genomics</subject><subject>Plant Sciences</subject><subject>Sequence analysis</subject><subject>Sequence Analysis, DNA - methods</subject><issn>0022-2844</issn><issn>1432-1432</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kcFPFDEUxhuDkQX9BzyQJl64jL620-nUg8myUTQBJBHPTafzli2ZmQ7tLMh_T9dFRA9c2rx8v_e1732EvGXwngGoDwmAC10A50Wua17ACzJjpchlPnbIDDYSr8tyl-yldAXAlNTiFdkVlawVU_WMmCMf3Ap772xHz2MYMU539MgmbOl5SH7yYcjKqZ2i__WRzukZ3tL5OMZg3YpehFsb20SPcQg90h94vcbBIV2EfrTRpzC8Ji-Xtkv45uHeJz-_fL5YfC1Ovh9_W8xPCleqcip425baMtkKWDaNVJbZVopaO9lU0ECjXSu5wmWlXcMbjXUltJJQQ5ZAQCX2yaet77huemwdDlO0nRmj7228M8F6868y-JW5DDdG1yCFEtng8MEghjxFmkzvk8OuswOGdTI8r05XTDKZ0Xf_oVdhHfOaNlRVSVWWWmWKbykXQ0oRl4-fYWA2-ZltfiaHZH7nZyA3HTwd47HlT2AZEFsgZWm4xPj37Wds7wFPNKY_</recordid><startdate>20230201</startdate><enddate>20230201</enddate><creator>Dey, Sudeshna</creator><creator>Das, Subhram</creator><creator>Bhattacharya, D. K.</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7QL</scope><scope>7QP</scope><scope>7QR</scope><scope>7T7</scope><scope>7TK</scope><scope>7U9</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2O</scope><scope>M7N</scope><scope>M7P</scope><scope>MBDVC</scope><scope>P64</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0003-2899-4433</orcidid></search><sort><creationdate>20230201</creationdate><title>Biochemical Property Based Positional Matrix: A New Approach Towards Genome Sequence Comparison</title><author>Dey, Sudeshna ; Das, Subhram ; Bhattacharya, D. K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Alignment</topic><topic>Animal Genetics and Genomics</topic><topic>Animals</topic><topic>Bioinformatics</topic><topic>Biomedical and Life Sciences</topic><topic>Cell Biology</topic><topic>Clustering</topic><topic>Computational Biology - methods</topic><topic>Evolutionary Biology</topic><topic>Genome - genetics</topic><topic>Genomes</topic><topic>Graph theory</topic><topic>Life Sciences</topic><topic>Mammals - genetics</topic><topic>Microbiology</topic><topic>Nucleotide sequence</topic><topic>Nucleotides</topic><topic>Nucleotides - genetics</topic><topic>Original</topic><topic>Original Article</topic><topic>Phylogeny</topic><topic>Plant Genetics and Genomics</topic><topic>Plant Sciences</topic><topic>Sequence analysis</topic><topic>Sequence Analysis, DNA - methods</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dey, Sudeshna</creatorcontrib><creatorcontrib>Das, Subhram</creatorcontrib><creatorcontrib>Bhattacharya, D. K.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Calcium & Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Neurosciences Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>ProQuest Health and Medical</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>ProQuest Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Research Library</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>ProQuest Biological Science Journals</collection><collection>Research Library (Corporate)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of molecular evolution</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dey, Sudeshna</au><au>Das, Subhram</au><au>Bhattacharya, D. K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Biochemical Property Based Positional Matrix: A New Approach Towards Genome Sequence Comparison</atitle><jtitle>Journal of molecular evolution</jtitle><stitle>J Mol Evol</stitle><addtitle>J Mol Evol</addtitle><date>2023-02-01</date><risdate>2023</risdate><volume>91</volume><issue>1</issue><spage>93</spage><epage>131</epage><pages>93-131</pages><issn>0022-2844</issn><eissn>1432-1432</eissn><abstract>The growth of the genome sequence has become one of the emerging areas in the study of bioinformatics. It has led to an excessive demand for researchers to develop advanced methodologies for evolutionary relationships among species. The alignment-free methods have been proved to be more efficient and appropriate related to time and space than existing alignment-based methods for sequence analysis. In this study, a new alignment-free genome sequence comparison technique is proposed based on the biochemical properties of nucleotides. Each genome sequence can be distributed in four parameters to represent a 21-dimensional numerical descriptor using the Positional Matrix. To substantiate the proposed method, phylogenetic trees are constructed on the viral and mammalian datasets by applying the UPGMA/NJ clustering method. Further, the results of this method are compared with the results of the Feature Frequency Profiles method, the Positional Correlation Natural Vector method, the Graph-theoretic method, the Multiple Encoding Vector method, and the Fuzzy Integral Similarity method. In most cases, it is found that the present method produces more accurate results than the prior methods. Also, in the present method, the execution time for computation is comparatively small.</abstract><cop>New York</cop><pub>Springer US</pub><pmid>36587178</pmid><doi>10.1007/s00239-022-10082-0</doi><tpages>39</tpages><orcidid>https://orcid.org/0000-0003-2899-4433</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0022-2844 |
ispartof | Journal of molecular evolution, 2023-02, Vol.91 (1), p.93-131 |
issn | 0022-2844 1432-1432 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9805373 |
source | Springer Nature |
subjects | Algorithms Alignment Animal Genetics and Genomics Animals Bioinformatics Biomedical and Life Sciences Cell Biology Clustering Computational Biology - methods Evolutionary Biology Genome - genetics Genomes Graph theory Life Sciences Mammals - genetics Microbiology Nucleotide sequence Nucleotides Nucleotides - genetics Original Original Article Phylogeny Plant Genetics and Genomics Plant Sciences Sequence analysis Sequence Analysis, DNA - methods |
title | Biochemical Property Based Positional Matrix: A New Approach Towards Genome Sequence Comparison |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T21%3A40%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Biochemical%20Property%20Based%20Positional%20Matrix:%20A%20New%20Approach%20Towards%20Genome%20Sequence%20Comparison&rft.jtitle=Journal%20of%20molecular%20evolution&rft.au=Dey,%20Sudeshna&rft.date=2023-02-01&rft.volume=91&rft.issue=1&rft.spage=93&rft.epage=131&rft.pages=93-131&rft.issn=0022-2844&rft.eissn=1432-1432&rft_id=info:doi/10.1007/s00239-022-10082-0&rft_dat=%3Cproquest_pubme%3E2759961515%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2766574497&rft_id=info:pmid/36587178&rfr_iscdi=true |