Loading…

RECORD: Reference-Assisted Genome Assembly for Closely Related Genomes

Background. Next-generation sequencing technologies are now producing multiple times the genome size in total reads from a single experiment. This is enough information to reconstruct at least some of the differences between the individual genome studied in the experiment and the reference genome of...

Full description

Saved in:
Bibliographic Details
Published in:International journal of genomics 2015-01, Vol.2015 (2015), p.1-10
Main Authors: Buza, Krisztian, Dojer, Norbert, Wilczynski, Bartek
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c566t-f5c0f879be04568b8ac26800e3bd4ccf1d16a02ef1e6ea78a0efefc010bf04343
cites cdi_FETCH-LOGICAL-c566t-f5c0f879be04568b8ac26800e3bd4ccf1d16a02ef1e6ea78a0efefc010bf04343
container_end_page 10
container_issue 2015
container_start_page 1
container_title International journal of genomics
container_volume 2015
creator Buza, Krisztian
Dojer, Norbert
Wilczynski, Bartek
description Background. Next-generation sequencing technologies are now producing multiple times the genome size in total reads from a single experiment. This is enough information to reconstruct at least some of the differences between the individual genome studied in the experiment and the reference genome of the species. However, in most typical protocols, this information is disregarded and the reference genome is used. Results. We provide a new approach that allows researchers to reconstruct genomes very closely related to the reference genome (e.g., mutants of the same species) directly from the reads used in the experiment. Our approach applies de novo assembly software to experimental reads and so-called pseudoreads and uses the resulting contigs to generate a modified reference sequence. In this way, it can very quickly, and at no additional sequencing cost, generate new, modified reference sequence that is closer to the actual sequenced genome and has a full coverage. In this paper, we describe our approach and test its implementation called RECORD. We evaluate RECORD on both simulated and real data. We made our software publicly available on sourceforge. Conclusion. Our tests show that on closely related sequences RECORD outperforms more general assisted-assembly software.
doi_str_mv 10.1155/2015/563482
format article
fullrecord <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_78e3fa63cbd04c82ae6ed7397497c1ef</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_78e3fa63cbd04c82ae6ed7397497c1ef</doaj_id><sourcerecordid>3853491911</sourcerecordid><originalsourceid>FETCH-LOGICAL-c566t-f5c0f879be04568b8ac26800e3bd4ccf1d16a02ef1e6ea78a0efefc010bf04343</originalsourceid><addsrcrecordid>eNqFkU1LHTEUhkOpVFFX3ZeBbkrL1JPvTBcFuVUrCMKlhe5CJnOic5mZ2GRui_--sSO2ujGbnCQPzznkJeQ1hY-USnnEgMojqbgw7AXZY5yKWnBtXj7U6scuOcx5A2U1vDFSvSK7TElpmJR75HR9srpcf_lUrTFgwsljfZxzn2fsqjOc4ohVOePYDrdViKlaDTFjqdc4uH9MPiA7wQ0ZD-_3ffL99OTb6mt9cXl2vjq-qL1Uaq6D9BCMbloEIZVpjfNMGQDkbSe8D7SjygHDQFGh08ZBmSp4oNAGEFzwfXK-eLvoNvYm9aNLtza63v69iOnKujT3fkCrDfLgFPdtB8Ib5oqy07zRotGeYiiuz4vrZtuO2Hmc5uSGR9LHL1N_ba_iLysUa4CrInh3L0jx5xbzbMc-exwGN2HcZks1lw1l2tCCvn2CbuI2TeWrCsWMAsZAF-rDQvkUc04YHoahYO_itndx2yXuQr9f6Ot-6tzv_hn4zQJjQTC4_2BVumv-B9KksQA</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1728602207</pqid></control><display><type>article</type><title>RECORD: Reference-Assisted Genome Assembly for Closely Related Genomes</title><source>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</source><source>Open Access: Wiley-Blackwell Open Access Journals</source><source>PubMed Central</source><creator>Buza, Krisztian ; Dojer, Norbert ; Wilczynski, Bartek</creator><contributor>Lin, Chun-Yuan</contributor><creatorcontrib>Buza, Krisztian ; Dojer, Norbert ; Wilczynski, Bartek ; Lin, Chun-Yuan</creatorcontrib><description>Background. Next-generation sequencing technologies are now producing multiple times the genome size in total reads from a single experiment. This is enough information to reconstruct at least some of the differences between the individual genome studied in the experiment and the reference genome of the species. However, in most typical protocols, this information is disregarded and the reference genome is used. Results. We provide a new approach that allows researchers to reconstruct genomes very closely related to the reference genome (e.g., mutants of the same species) directly from the reads used in the experiment. Our approach applies de novo assembly software to experimental reads and so-called pseudoreads and uses the resulting contigs to generate a modified reference sequence. In this way, it can very quickly, and at no additional sequencing cost, generate new, modified reference sequence that is closer to the actual sequenced genome and has a full coverage. In this paper, we describe our approach and test its implementation called RECORD. We evaluate RECORD on both simulated and real data. We made our software publicly available on sourceforge. Conclusion. Our tests show that on closely related sequences RECORD outperforms more general assisted-assembly software.</description><identifier>ISSN: 2314-436X</identifier><identifier>EISSN: 2314-4378</identifier><identifier>DOI: 10.1155/2015/563482</identifier><identifier>PMID: 26558255</identifier><language>eng</language><publisher>Cairo, Egypt: Hindawi Publishing Corporation</publisher><subject>Deoxyribonucleic acid ; DNA ; Experiments ; Genomes ; R&amp;D ; Research &amp; development ; Studies</subject><ispartof>International journal of genomics, 2015-01, Vol.2015 (2015), p.1-10</ispartof><rights>Copyright © 2015 Krisztian Buza et al.</rights><rights>Copyright © 2015 Krisztian Buza et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</rights><rights>Copyright © 2015 Krisztian Buza et al. 2015</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c566t-f5c0f879be04568b8ac26800e3bd4ccf1d16a02ef1e6ea78a0efefc010bf04343</citedby><cites>FETCH-LOGICAL-c566t-f5c0f879be04568b8ac26800e3bd4ccf1d16a02ef1e6ea78a0efefc010bf04343</cites><orcidid>0000-0002-7111-6452</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/1728602207/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/1728602207?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,25730,27900,27901,36988,36989,44565,53765,53767,75095</link.rule.ids></links><search><contributor>Lin, Chun-Yuan</contributor><creatorcontrib>Buza, Krisztian</creatorcontrib><creatorcontrib>Dojer, Norbert</creatorcontrib><creatorcontrib>Wilczynski, Bartek</creatorcontrib><title>RECORD: Reference-Assisted Genome Assembly for Closely Related Genomes</title><title>International journal of genomics</title><description>Background. Next-generation sequencing technologies are now producing multiple times the genome size in total reads from a single experiment. This is enough information to reconstruct at least some of the differences between the individual genome studied in the experiment and the reference genome of the species. However, in most typical protocols, this information is disregarded and the reference genome is used. Results. We provide a new approach that allows researchers to reconstruct genomes very closely related to the reference genome (e.g., mutants of the same species) directly from the reads used in the experiment. Our approach applies de novo assembly software to experimental reads and so-called pseudoreads and uses the resulting contigs to generate a modified reference sequence. In this way, it can very quickly, and at no additional sequencing cost, generate new, modified reference sequence that is closer to the actual sequenced genome and has a full coverage. In this paper, we describe our approach and test its implementation called RECORD. We evaluate RECORD on both simulated and real data. We made our software publicly available on sourceforge. Conclusion. Our tests show that on closely related sequences RECORD outperforms more general assisted-assembly software.</description><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>Experiments</subject><subject>Genomes</subject><subject>R&amp;D</subject><subject>Research &amp; development</subject><subject>Studies</subject><issn>2314-436X</issn><issn>2314-4378</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNqFkU1LHTEUhkOpVFFX3ZeBbkrL1JPvTBcFuVUrCMKlhe5CJnOic5mZ2GRui_--sSO2ujGbnCQPzznkJeQ1hY-USnnEgMojqbgw7AXZY5yKWnBtXj7U6scuOcx5A2U1vDFSvSK7TElpmJR75HR9srpcf_lUrTFgwsljfZxzn2fsqjOc4ohVOePYDrdViKlaDTFjqdc4uH9MPiA7wQ0ZD-_3ffL99OTb6mt9cXl2vjq-qL1Uaq6D9BCMbloEIZVpjfNMGQDkbSe8D7SjygHDQFGh08ZBmSp4oNAGEFzwfXK-eLvoNvYm9aNLtza63v69iOnKujT3fkCrDfLgFPdtB8Ib5oqy07zRotGeYiiuz4vrZtuO2Hmc5uSGR9LHL1N_ba_iLysUa4CrInh3L0jx5xbzbMc-exwGN2HcZks1lw1l2tCCvn2CbuI2TeWrCsWMAsZAF-rDQvkUc04YHoahYO_itndx2yXuQr9f6Ot-6tzv_hn4zQJjQTC4_2BVumv-B9KksQA</recordid><startdate>20150101</startdate><enddate>20150101</enddate><creator>Buza, Krisztian</creator><creator>Dojer, Norbert</creator><creator>Wilczynski, Bartek</creator><general>Hindawi Publishing Corporation</general><general>Hindawi Limited</general><scope>ADJCN</scope><scope>AHFXO</scope><scope>RHU</scope><scope>RHW</scope><scope>RHX</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7TM</scope><scope>7X7</scope><scope>7XB</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>CWDGH</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M7P</scope><scope>P64</scope><scope>PHGZM</scope><scope>PHGZT</scope><scope>PIMPY</scope><scope>PKEHL</scope><scope>PQEST</scope><scope>PQGLB</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>RC3</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-7111-6452</orcidid></search><sort><creationdate>20150101</creationdate><title>RECORD: Reference-Assisted Genome Assembly for Closely Related Genomes</title><author>Buza, Krisztian ; Dojer, Norbert ; Wilczynski, Bartek</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c566t-f5c0f879be04568b8ac26800e3bd4ccf1d16a02ef1e6ea78a0efefc010bf04343</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>Experiments</topic><topic>Genomes</topic><topic>R&amp;D</topic><topic>Research &amp; development</topic><topic>Studies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Buza, Krisztian</creatorcontrib><creatorcontrib>Dojer, Norbert</creatorcontrib><creatorcontrib>Wilczynski, Bartek</creatorcontrib><collection>الدوريات العلمية والإحصائية - e-Marefa Academic and Statistical Periodicals</collection><collection>معرفة - المحتوى العربي الأكاديمي المتكامل - e-Marefa Academic Complete</collection><collection>Hindawi Publishing Complete</collection><collection>Hindawi Publishing Subscription Journals</collection><collection>Hindawi Publishing Open Access</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Nucleic Acids Abstracts</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>Middle East &amp; Africa Database</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Biological Sciences</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Biological Science Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest Central (New)</collection><collection>ProQuest One Academic (New)</collection><collection>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest One Academic Middle East (New)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Applied &amp; Life Sciences</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Genetics Abstracts</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>International journal of genomics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Buza, Krisztian</au><au>Dojer, Norbert</au><au>Wilczynski, Bartek</au><au>Lin, Chun-Yuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RECORD: Reference-Assisted Genome Assembly for Closely Related Genomes</atitle><jtitle>International journal of genomics</jtitle><date>2015-01-01</date><risdate>2015</risdate><volume>2015</volume><issue>2015</issue><spage>1</spage><epage>10</epage><pages>1-10</pages><issn>2314-436X</issn><eissn>2314-4378</eissn><abstract>Background. Next-generation sequencing technologies are now producing multiple times the genome size in total reads from a single experiment. This is enough information to reconstruct at least some of the differences between the individual genome studied in the experiment and the reference genome of the species. However, in most typical protocols, this information is disregarded and the reference genome is used. Results. We provide a new approach that allows researchers to reconstruct genomes very closely related to the reference genome (e.g., mutants of the same species) directly from the reads used in the experiment. Our approach applies de novo assembly software to experimental reads and so-called pseudoreads and uses the resulting contigs to generate a modified reference sequence. In this way, it can very quickly, and at no additional sequencing cost, generate new, modified reference sequence that is closer to the actual sequenced genome and has a full coverage. In this paper, we describe our approach and test its implementation called RECORD. We evaluate RECORD on both simulated and real data. We made our software publicly available on sourceforge. Conclusion. Our tests show that on closely related sequences RECORD outperforms more general assisted-assembly software.</abstract><cop>Cairo, Egypt</cop><pub>Hindawi Publishing Corporation</pub><pmid>26558255</pmid><doi>10.1155/2015/563482</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0002-7111-6452</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2314-436X
ispartof International journal of genomics, 2015-01, Vol.2015 (2015), p.1-10
issn 2314-436X
2314-4378
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_78e3fa63cbd04c82ae6ed7397497c1ef
source Publicly Available Content Database (Proquest) (PQ_SDU_P3); Open Access: Wiley-Blackwell Open Access Journals; PubMed Central
subjects Deoxyribonucleic acid
DNA
Experiments
Genomes
R&D
Research & development
Studies
title RECORD: Reference-Assisted Genome Assembly for Closely Related Genomes
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-25T09%3A39%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RECORD:%20Reference-Assisted%20Genome%20Assembly%20for%20Closely%20Related%20Genomes&rft.jtitle=International%20journal%20of%20genomics&rft.au=Buza,%20Krisztian&rft.date=2015-01-01&rft.volume=2015&rft.issue=2015&rft.spage=1&rft.epage=10&rft.pages=1-10&rft.issn=2314-436X&rft.eissn=2314-4378&rft_id=info:doi/10.1155/2015/563482&rft_dat=%3Cproquest_doaj_%3E3853491911%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c566t-f5c0f879be04568b8ac26800e3bd4ccf1d16a02ef1e6ea78a0efefc010bf04343%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1728602207&rft_id=info:pmid/26558255&rfr_iscdi=true