Loading…

Biochemical Property Based Positional Matrix: A New Approach Towards Genome Sequence Comparison

The growth of the genome sequence has become one of the emerging areas in the study of bioinformatics. It has led to an excessive demand for researchers to develop advanced methodologies for evolutionary relationships among species. The alignment-free methods have been proved to be more efficient an...

Full description

Saved in:
Bibliographic Details
Published in:Journal of molecular evolution 2023-02, Vol.91 (1), p.93-131
Main Authors: Dey, Sudeshna, Das, Subhram, Bhattacharya, D. K.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063
cites cdi_FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063
container_end_page 131
container_issue 1
container_start_page 93
container_title Journal of molecular evolution
container_volume 91
creator Dey, Sudeshna
Das, Subhram
Bhattacharya, D. K.
description The growth of the genome sequence has become one of the emerging areas in the study of bioinformatics. It has led to an excessive demand for researchers to develop advanced methodologies for evolutionary relationships among species. The alignment-free methods have been proved to be more efficient and appropriate related to time and space than existing alignment-based methods for sequence analysis. In this study, a new alignment-free genome sequence comparison technique is proposed based on the biochemical properties of nucleotides. Each genome sequence can be distributed in four parameters to represent a 21-dimensional numerical descriptor using the Positional Matrix. To substantiate the proposed method, phylogenetic trees are constructed on the viral and mammalian datasets by applying the UPGMA/NJ clustering method. Further, the results of this method are compared with the results of the Feature Frequency Profiles method, the Positional Correlation Natural Vector method, the Graph-theoretic method, the Multiple Encoding Vector method, and the Fuzzy Integral Similarity method. In most cases, it is found that the present method produces more accurate results than the prior methods. Also, in the present method, the execution time for computation is comparatively small.
doi_str_mv 10.1007/s00239-022-10082-0
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9805373</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2759961515</sourcerecordid><originalsourceid>FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063</originalsourceid><addsrcrecordid>eNp9kcFPFDEUxhuDkQX9BzyQJl64jL620-nUg8myUTQBJBHPTafzli2ZmQ7tLMh_T9dFRA9c2rx8v_e1732EvGXwngGoDwmAC10A50Wua17ACzJjpchlPnbIDDYSr8tyl-yldAXAlNTiFdkVlawVU_WMmCMf3Ap772xHz2MYMU539MgmbOl5SH7yYcjKqZ2i__WRzukZ3tL5OMZg3YpehFsb20SPcQg90h94vcbBIV2EfrTRpzC8Ji-Xtkv45uHeJz-_fL5YfC1Ovh9_W8xPCleqcip425baMtkKWDaNVJbZVopaO9lU0ECjXSu5wmWlXcMbjXUltJJQQ5ZAQCX2yaet77huemwdDlO0nRmj7228M8F6868y-JW5DDdG1yCFEtng8MEghjxFmkzvk8OuswOGdTI8r05XTDKZ0Xf_oVdhHfOaNlRVSVWWWmWKbykXQ0oRl4-fYWA2-ZltfiaHZH7nZyA3HTwd47HlT2AZEFsgZWm4xPj37Wds7wFPNKY_</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2766574497</pqid></control><display><type>article</type><title>Biochemical Property Based Positional Matrix: A New Approach Towards Genome Sequence Comparison</title><source>Springer Nature</source><creator>Dey, Sudeshna ; Das, Subhram ; Bhattacharya, D. K.</creator><creatorcontrib>Dey, Sudeshna ; Das, Subhram ; Bhattacharya, D. K.</creatorcontrib><description>The growth of the genome sequence has become one of the emerging areas in the study of bioinformatics. It has led to an excessive demand for researchers to develop advanced methodologies for evolutionary relationships among species. The alignment-free methods have been proved to be more efficient and appropriate related to time and space than existing alignment-based methods for sequence analysis. In this study, a new alignment-free genome sequence comparison technique is proposed based on the biochemical properties of nucleotides. Each genome sequence can be distributed in four parameters to represent a 21-dimensional numerical descriptor using the Positional Matrix. To substantiate the proposed method, phylogenetic trees are constructed on the viral and mammalian datasets by applying the UPGMA/NJ clustering method. Further, the results of this method are compared with the results of the Feature Frequency Profiles method, the Positional Correlation Natural Vector method, the Graph-theoretic method, the Multiple Encoding Vector method, and the Fuzzy Integral Similarity method. In most cases, it is found that the present method produces more accurate results than the prior methods. Also, in the present method, the execution time for computation is comparatively small.</description><identifier>ISSN: 0022-2844</identifier><identifier>EISSN: 1432-1432</identifier><identifier>DOI: 10.1007/s00239-022-10082-0</identifier><identifier>PMID: 36587178</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Alignment ; Animal Genetics and Genomics ; Animals ; Bioinformatics ; Biomedical and Life Sciences ; Cell Biology ; Clustering ; Computational Biology - methods ; Evolutionary Biology ; Genome - genetics ; Genomes ; Graph theory ; Life Sciences ; Mammals - genetics ; Microbiology ; Nucleotide sequence ; Nucleotides ; Nucleotides - genetics ; Original ; Original Article ; Phylogeny ; Plant Genetics and Genomics ; Plant Sciences ; Sequence analysis ; Sequence Analysis, DNA - methods</subject><ispartof>Journal of molecular evolution, 2023-02, Vol.91 (1), p.93-131</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><rights>2022. The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.</rights><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063</citedby><cites>FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063</cites><orcidid>0000-0003-2899-4433</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36587178$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Dey, Sudeshna</creatorcontrib><creatorcontrib>Das, Subhram</creatorcontrib><creatorcontrib>Bhattacharya, D. K.</creatorcontrib><title>Biochemical Property Based Positional Matrix: A New Approach Towards Genome Sequence Comparison</title><title>Journal of molecular evolution</title><addtitle>J Mol Evol</addtitle><addtitle>J Mol Evol</addtitle><description>The growth of the genome sequence has become one of the emerging areas in the study of bioinformatics. It has led to an excessive demand for researchers to develop advanced methodologies for evolutionary relationships among species. The alignment-free methods have been proved to be more efficient and appropriate related to time and space than existing alignment-based methods for sequence analysis. In this study, a new alignment-free genome sequence comparison technique is proposed based on the biochemical properties of nucleotides. Each genome sequence can be distributed in four parameters to represent a 21-dimensional numerical descriptor using the Positional Matrix. To substantiate the proposed method, phylogenetic trees are constructed on the viral and mammalian datasets by applying the UPGMA/NJ clustering method. Further, the results of this method are compared with the results of the Feature Frequency Profiles method, the Positional Correlation Natural Vector method, the Graph-theoretic method, the Multiple Encoding Vector method, and the Fuzzy Integral Similarity method. In most cases, it is found that the present method produces more accurate results than the prior methods. Also, in the present method, the execution time for computation is comparatively small.</description><subject>Algorithms</subject><subject>Alignment</subject><subject>Animal Genetics and Genomics</subject><subject>Animals</subject><subject>Bioinformatics</subject><subject>Biomedical and Life Sciences</subject><subject>Cell Biology</subject><subject>Clustering</subject><subject>Computational Biology - methods</subject><subject>Evolutionary Biology</subject><subject>Genome - genetics</subject><subject>Genomes</subject><subject>Graph theory</subject><subject>Life Sciences</subject><subject>Mammals - genetics</subject><subject>Microbiology</subject><subject>Nucleotide sequence</subject><subject>Nucleotides</subject><subject>Nucleotides - genetics</subject><subject>Original</subject><subject>Original Article</subject><subject>Phylogeny</subject><subject>Plant Genetics and Genomics</subject><subject>Plant Sciences</subject><subject>Sequence analysis</subject><subject>Sequence Analysis, DNA - methods</subject><issn>0022-2844</issn><issn>1432-1432</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kcFPFDEUxhuDkQX9BzyQJl64jL620-nUg8myUTQBJBHPTafzli2ZmQ7tLMh_T9dFRA9c2rx8v_e1732EvGXwngGoDwmAC10A50Wua17ACzJjpchlPnbIDDYSr8tyl-yldAXAlNTiFdkVlawVU_WMmCMf3Ap772xHz2MYMU539MgmbOl5SH7yYcjKqZ2i__WRzukZ3tL5OMZg3YpehFsb20SPcQg90h94vcbBIV2EfrTRpzC8Ji-Xtkv45uHeJz-_fL5YfC1Ovh9_W8xPCleqcip425baMtkKWDaNVJbZVopaO9lU0ECjXSu5wmWlXcMbjXUltJJQQ5ZAQCX2yaet77huemwdDlO0nRmj7228M8F6868y-JW5DDdG1yCFEtng8MEghjxFmkzvk8OuswOGdTI8r05XTDKZ0Xf_oVdhHfOaNlRVSVWWWmWKbykXQ0oRl4-fYWA2-ZltfiaHZH7nZyA3HTwd47HlT2AZEFsgZWm4xPj37Wds7wFPNKY_</recordid><startdate>20230201</startdate><enddate>20230201</enddate><creator>Dey, Sudeshna</creator><creator>Das, Subhram</creator><creator>Bhattacharya, D. K.</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7QL</scope><scope>7QP</scope><scope>7QR</scope><scope>7T7</scope><scope>7TK</scope><scope>7U9</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2O</scope><scope>M7N</scope><scope>M7P</scope><scope>MBDVC</scope><scope>P64</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0003-2899-4433</orcidid></search><sort><creationdate>20230201</creationdate><title>Biochemical Property Based Positional Matrix: A New Approach Towards Genome Sequence Comparison</title><author>Dey, Sudeshna ; Das, Subhram ; Bhattacharya, D. K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Alignment</topic><topic>Animal Genetics and Genomics</topic><topic>Animals</topic><topic>Bioinformatics</topic><topic>Biomedical and Life Sciences</topic><topic>Cell Biology</topic><topic>Clustering</topic><topic>Computational Biology - methods</topic><topic>Evolutionary Biology</topic><topic>Genome - genetics</topic><topic>Genomes</topic><topic>Graph theory</topic><topic>Life Sciences</topic><topic>Mammals - genetics</topic><topic>Microbiology</topic><topic>Nucleotide sequence</topic><topic>Nucleotides</topic><topic>Nucleotides - genetics</topic><topic>Original</topic><topic>Original Article</topic><topic>Phylogeny</topic><topic>Plant Genetics and Genomics</topic><topic>Plant Sciences</topic><topic>Sequence analysis</topic><topic>Sequence Analysis, DNA - methods</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dey, Sudeshna</creatorcontrib><creatorcontrib>Das, Subhram</creatorcontrib><creatorcontrib>Bhattacharya, D. K.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Neurosciences Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>ProQuest Health and Medical</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>ProQuest Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Research Library</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>ProQuest Biological Science Journals</collection><collection>Research Library (Corporate)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of molecular evolution</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dey, Sudeshna</au><au>Das, Subhram</au><au>Bhattacharya, D. K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Biochemical Property Based Positional Matrix: A New Approach Towards Genome Sequence Comparison</atitle><jtitle>Journal of molecular evolution</jtitle><stitle>J Mol Evol</stitle><addtitle>J Mol Evol</addtitle><date>2023-02-01</date><risdate>2023</risdate><volume>91</volume><issue>1</issue><spage>93</spage><epage>131</epage><pages>93-131</pages><issn>0022-2844</issn><eissn>1432-1432</eissn><abstract>The growth of the genome sequence has become one of the emerging areas in the study of bioinformatics. It has led to an excessive demand for researchers to develop advanced methodologies for evolutionary relationships among species. The alignment-free methods have been proved to be more efficient and appropriate related to time and space than existing alignment-based methods for sequence analysis. In this study, a new alignment-free genome sequence comparison technique is proposed based on the biochemical properties of nucleotides. Each genome sequence can be distributed in four parameters to represent a 21-dimensional numerical descriptor using the Positional Matrix. To substantiate the proposed method, phylogenetic trees are constructed on the viral and mammalian datasets by applying the UPGMA/NJ clustering method. Further, the results of this method are compared with the results of the Feature Frequency Profiles method, the Positional Correlation Natural Vector method, the Graph-theoretic method, the Multiple Encoding Vector method, and the Fuzzy Integral Similarity method. In most cases, it is found that the present method produces more accurate results than the prior methods. Also, in the present method, the execution time for computation is comparatively small.</abstract><cop>New York</cop><pub>Springer US</pub><pmid>36587178</pmid><doi>10.1007/s00239-022-10082-0</doi><tpages>39</tpages><orcidid>https://orcid.org/0000-0003-2899-4433</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0022-2844
ispartof Journal of molecular evolution, 2023-02, Vol.91 (1), p.93-131
issn 0022-2844
1432-1432
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9805373
source Springer Nature
subjects Algorithms
Alignment
Animal Genetics and Genomics
Animals
Bioinformatics
Biomedical and Life Sciences
Cell Biology
Clustering
Computational Biology - methods
Evolutionary Biology
Genome - genetics
Genomes
Graph theory
Life Sciences
Mammals - genetics
Microbiology
Nucleotide sequence
Nucleotides
Nucleotides - genetics
Original
Original Article
Phylogeny
Plant Genetics and Genomics
Plant Sciences
Sequence analysis
Sequence Analysis, DNA - methods
title Biochemical Property Based Positional Matrix: A New Approach Towards Genome Sequence Comparison
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T21%3A40%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Biochemical%20Property%20Based%20Positional%20Matrix:%20A%20New%20Approach%20Towards%20Genome%20Sequence%20Comparison&rft.jtitle=Journal%20of%20molecular%20evolution&rft.au=Dey,%20Sudeshna&rft.date=2023-02-01&rft.volume=91&rft.issue=1&rft.spage=93&rft.epage=131&rft.pages=93-131&rft.issn=0022-2844&rft.eissn=1432-1432&rft_id=info:doi/10.1007/s00239-022-10082-0&rft_dat=%3Cproquest_pubme%3E2759961515%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c474t-2dd49a15d30fbb57a1ad5389c5b60b0b9cd527ef69cb2b9e8639750800b903063%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2766574497&rft_id=info:pmid/36587178&rfr_iscdi=true