Loading…

PacBio sequencing output increased through uniform and directional fivefold concatenation

Advances in sequencing technology have allowed researchers to sequence DNA with greater ease and at decreasing costs. Main developments have focused on either sequencing many short sequences or fewer large sequences. Methods for sequencing mid-sized sequences of 600–5,000 bp are currently less effic...

Full description

Saved in:
Bibliographic Details
Published in:Scientific reports 2021-09, Vol.11 (1), p.18065-18065, Article 18065
Main Authors: Kanwar, Nisha, Blanco, Celia, Chen, Irene A., Seelig, Burckhard
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3
cites cdi_FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3
container_end_page 18065
container_issue 1
container_start_page 18065
container_title Scientific reports
container_volume 11
creator Kanwar, Nisha
Blanco, Celia
Chen, Irene A.
Seelig, Burckhard
description Advances in sequencing technology have allowed researchers to sequence DNA with greater ease and at decreasing costs. Main developments have focused on either sequencing many short sequences or fewer large sequences. Methods for sequencing mid-sized sequences of 600–5,000 bp are currently less efficient. For example, the PacBio Sequel I system yields ~ 100,000–300,000 reads with an accuracy per base pair of 90–99%. We sought to sequence several DNA populations of ~ 870 bp in length with a sequencing accuracy of 99% and to the greatest depth possible. We optimised a simple, robust method to concatenate genes of ~ 870 bp five times and then sequenced the resulting DNA of ~ 5,000 bp by PacBioSMRT long-read sequencing. Our method improved upon previously published concatenation attempts, leading to a greater sequencing depth, high-quality reads and limited sample preparation at little expense. We applied this efficient concatenation protocol to sequence nine DNA populations from a protein engineering study. The improved method is accompanied by a simple and user-friendly analysis pipeline, DeCatCounter, to sequence medium-length sequences efficiently at one-fifth of the cost.
doi_str_mv 10.1038/s41598-021-96829-z
format article
fullrecord <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_e15146f13e9d454ea27128ae5e18d288</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_e15146f13e9d454ea27128ae5e18d288</doaj_id><sourcerecordid>2571922954</sourcerecordid><originalsourceid>FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3</originalsourceid><addsrcrecordid>eNp9kk1vFSEUhifGxjZt_4ALM4kbN6N8zsDGRBs_mjSxi7pwRRg4zOVmLlyBaWJ_vbRTa-tCNhDe9zzA4W2alxi9xYiKd5lhLkWHCO5kL4jsbp41RwQx3hFKyPNH68PmNOctqoMTybB80RxSxpHAeDhqflxq89HHNsPPBYLxYWrjUvZLaX0wCXQG25ZNisu0aZfgXUy7VgfbWp_AFB-Dnlvnr8HF2bYmBqMLBH0rnDQHTs8ZTu_n4-b7509XZ1-7i29fzs8-XHSGM1Q6IfhoHeuRo9KIUTDHgA4MMzkKjCSMw8h7TeiAdC9HLK2lVOBa1cveigHocXO-cm3UW7VPfqfTLxW1V3cbMU1Kp-LNDAowx6x3mIK0jDPQZMBEaOCAhSVCVNb7lbVfxh1YA6EkPT-BPlWC36gpXivBKKVoqIA394AUa0NzUTufDcyzDhCXrAgfsCREclatr_-xbuOSaj9XF2KIU15dZHWZFHNO4B4ug5G6DYJag6BqENRdENRNLXr1-BkPJX--vRroashVChOkv2f_B_sbhle_FA</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2571040535</pqid></control><display><type>article</type><title>PacBio sequencing output increased through uniform and directional fivefold concatenation</title><source>Open Access: PubMed Central</source><source>Publicly Available Content (ProQuest)</source><source>Free Full-Text Journals in Chemistry</source><source>Springer Nature - nature.com Journals - Fully Open Access</source><creator>Kanwar, Nisha ; Blanco, Celia ; Chen, Irene A. ; Seelig, Burckhard</creator><creatorcontrib>Kanwar, Nisha ; Blanco, Celia ; Chen, Irene A. ; Seelig, Burckhard</creatorcontrib><description>Advances in sequencing technology have allowed researchers to sequence DNA with greater ease and at decreasing costs. Main developments have focused on either sequencing many short sequences or fewer large sequences. Methods for sequencing mid-sized sequences of 600–5,000 bp are currently less efficient. For example, the PacBio Sequel I system yields ~ 100,000–300,000 reads with an accuracy per base pair of 90–99%. We sought to sequence several DNA populations of ~ 870 bp in length with a sequencing accuracy of 99% and to the greatest depth possible. We optimised a simple, robust method to concatenate genes of ~ 870 bp five times and then sequenced the resulting DNA of ~ 5,000 bp by PacBioSMRT long-read sequencing. Our method improved upon previously published concatenation attempts, leading to a greater sequencing depth, high-quality reads and limited sample preparation at little expense. We applied this efficient concatenation protocol to sequence nine DNA populations from a protein engineering study. The improved method is accompanied by a simple and user-friendly analysis pipeline, DeCatCounter, to sequence medium-length sequences efficiently at one-fifth of the cost.</description><identifier>ISSN: 2045-2322</identifier><identifier>EISSN: 2045-2322</identifier><identifier>DOI: 10.1038/s41598-021-96829-z</identifier><identifier>PMID: 34508117</identifier><language>eng</language><publisher>London: Nature Publishing Group UK</publisher><subject>631/1647/48 ; 631/1647/514/1948 ; 631/1647/514/1949 ; 631/1647/514/2254 ; 631/181/2475 ; 631/181/735 ; 631/45 ; 631/61 ; 631/61/514 ; Accuracy ; Acids ; Animals ; Base Sequence ; Computational Biology - methods ; Computational Biology - standards ; Deoxyribonucleic acid ; DNA ; DNA sequencing ; Engineering ; Gene Library ; Genes ; High-Throughput Nucleotide Sequencing - methods ; Humanities and Social Sciences ; Methods ; Mice ; Molecular Sequence Annotation ; multidisciplinary ; Mutation ; Nucleotide sequence ; Protein engineering ; Proteins ; Remakes &amp; sequels ; Sample preparation ; Science ; Science (multidisciplinary) ; Sequence Analysis, DNA - methods ; Sequence Analysis, DNA - standards ; Sequence Analysis, Protein</subject><ispartof>Scientific reports, 2021-09, Vol.11 (1), p.18065-18065, Article 18065</ispartof><rights>The Author(s) 2021</rights><rights>2021. The Author(s).</rights><rights>The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3</citedby><cites>FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2571040535/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2571040535?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,25731,27901,27902,36989,36990,44566,53766,53768,74869</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/34508117$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Kanwar, Nisha</creatorcontrib><creatorcontrib>Blanco, Celia</creatorcontrib><creatorcontrib>Chen, Irene A.</creatorcontrib><creatorcontrib>Seelig, Burckhard</creatorcontrib><title>PacBio sequencing output increased through uniform and directional fivefold concatenation</title><title>Scientific reports</title><addtitle>Sci Rep</addtitle><addtitle>Sci Rep</addtitle><description>Advances in sequencing technology have allowed researchers to sequence DNA with greater ease and at decreasing costs. Main developments have focused on either sequencing many short sequences or fewer large sequences. Methods for sequencing mid-sized sequences of 600–5,000 bp are currently less efficient. For example, the PacBio Sequel I system yields ~ 100,000–300,000 reads with an accuracy per base pair of 90–99%. We sought to sequence several DNA populations of ~ 870 bp in length with a sequencing accuracy of 99% and to the greatest depth possible. We optimised a simple, robust method to concatenate genes of ~ 870 bp five times and then sequenced the resulting DNA of ~ 5,000 bp by PacBioSMRT long-read sequencing. Our method improved upon previously published concatenation attempts, leading to a greater sequencing depth, high-quality reads and limited sample preparation at little expense. We applied this efficient concatenation protocol to sequence nine DNA populations from a protein engineering study. The improved method is accompanied by a simple and user-friendly analysis pipeline, DeCatCounter, to sequence medium-length sequences efficiently at one-fifth of the cost.</description><subject>631/1647/48</subject><subject>631/1647/514/1948</subject><subject>631/1647/514/1949</subject><subject>631/1647/514/2254</subject><subject>631/181/2475</subject><subject>631/181/735</subject><subject>631/45</subject><subject>631/61</subject><subject>631/61/514</subject><subject>Accuracy</subject><subject>Acids</subject><subject>Animals</subject><subject>Base Sequence</subject><subject>Computational Biology - methods</subject><subject>Computational Biology - standards</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>DNA sequencing</subject><subject>Engineering</subject><subject>Gene Library</subject><subject>Genes</subject><subject>High-Throughput Nucleotide Sequencing - methods</subject><subject>Humanities and Social Sciences</subject><subject>Methods</subject><subject>Mice</subject><subject>Molecular Sequence Annotation</subject><subject>multidisciplinary</subject><subject>Mutation</subject><subject>Nucleotide sequence</subject><subject>Protein engineering</subject><subject>Proteins</subject><subject>Remakes &amp; sequels</subject><subject>Sample preparation</subject><subject>Science</subject><subject>Science (multidisciplinary)</subject><subject>Sequence Analysis, DNA - methods</subject><subject>Sequence Analysis, DNA - standards</subject><subject>Sequence Analysis, Protein</subject><issn>2045-2322</issn><issn>2045-2322</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNp9kk1vFSEUhifGxjZt_4ALM4kbN6N8zsDGRBs_mjSxi7pwRRg4zOVmLlyBaWJ_vbRTa-tCNhDe9zzA4W2alxi9xYiKd5lhLkWHCO5kL4jsbp41RwQx3hFKyPNH68PmNOctqoMTybB80RxSxpHAeDhqflxq89HHNsPPBYLxYWrjUvZLaX0wCXQG25ZNisu0aZfgXUy7VgfbWp_AFB-Dnlvnr8HF2bYmBqMLBH0rnDQHTs8ZTu_n4-b7509XZ1-7i29fzs8-XHSGM1Q6IfhoHeuRo9KIUTDHgA4MMzkKjCSMw8h7TeiAdC9HLK2lVOBa1cveigHocXO-cm3UW7VPfqfTLxW1V3cbMU1Kp-LNDAowx6x3mIK0jDPQZMBEaOCAhSVCVNb7lbVfxh1YA6EkPT-BPlWC36gpXivBKKVoqIA394AUa0NzUTufDcyzDhCXrAgfsCREclatr_-xbuOSaj9XF2KIU15dZHWZFHNO4B4ug5G6DYJag6BqENRdENRNLXr1-BkPJX--vRroashVChOkv2f_B_sbhle_FA</recordid><startdate>20210910</startdate><enddate>20210910</enddate><creator>Kanwar, Nisha</creator><creator>Blanco, Celia</creator><creator>Chen, Irene A.</creator><creator>Seelig, Burckhard</creator><general>Nature Publishing Group UK</general><general>Nature Publishing Group</general><general>Nature Portfolio</general><scope>C6C</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20210910</creationdate><title>PacBio sequencing output increased through uniform and directional fivefold concatenation</title><author>Kanwar, Nisha ; Blanco, Celia ; Chen, Irene A. ; Seelig, Burckhard</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>631/1647/48</topic><topic>631/1647/514/1948</topic><topic>631/1647/514/1949</topic><topic>631/1647/514/2254</topic><topic>631/181/2475</topic><topic>631/181/735</topic><topic>631/45</topic><topic>631/61</topic><topic>631/61/514</topic><topic>Accuracy</topic><topic>Acids</topic><topic>Animals</topic><topic>Base Sequence</topic><topic>Computational Biology - methods</topic><topic>Computational Biology - standards</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>DNA sequencing</topic><topic>Engineering</topic><topic>Gene Library</topic><topic>Genes</topic><topic>High-Throughput Nucleotide Sequencing - methods</topic><topic>Humanities and Social Sciences</topic><topic>Methods</topic><topic>Mice</topic><topic>Molecular Sequence Annotation</topic><topic>multidisciplinary</topic><topic>Mutation</topic><topic>Nucleotide sequence</topic><topic>Protein engineering</topic><topic>Proteins</topic><topic>Remakes &amp; sequels</topic><topic>Sample preparation</topic><topic>Science</topic><topic>Science (multidisciplinary)</topic><topic>Sequence Analysis, DNA - methods</topic><topic>Sequence Analysis, DNA - standards</topic><topic>Sequence Analysis, Protein</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kanwar, Nisha</creatorcontrib><creatorcontrib>Blanco, Celia</creatorcontrib><creatorcontrib>Chen, Irene A.</creatorcontrib><creatorcontrib>Seelig, Burckhard</creatorcontrib><collection>SpringerOpen</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Proquest Health &amp; Medical Complete</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Biological Sciences</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Journals (ProQuest Database)</collection><collection>Biological Science Database</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Scientific reports</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kanwar, Nisha</au><au>Blanco, Celia</au><au>Chen, Irene A.</au><au>Seelig, Burckhard</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>PacBio sequencing output increased through uniform and directional fivefold concatenation</atitle><jtitle>Scientific reports</jtitle><stitle>Sci Rep</stitle><addtitle>Sci Rep</addtitle><date>2021-09-10</date><risdate>2021</risdate><volume>11</volume><issue>1</issue><spage>18065</spage><epage>18065</epage><pages>18065-18065</pages><artnum>18065</artnum><issn>2045-2322</issn><eissn>2045-2322</eissn><abstract>Advances in sequencing technology have allowed researchers to sequence DNA with greater ease and at decreasing costs. Main developments have focused on either sequencing many short sequences or fewer large sequences. Methods for sequencing mid-sized sequences of 600–5,000 bp are currently less efficient. For example, the PacBio Sequel I system yields ~ 100,000–300,000 reads with an accuracy per base pair of 90–99%. We sought to sequence several DNA populations of ~ 870 bp in length with a sequencing accuracy of 99% and to the greatest depth possible. We optimised a simple, robust method to concatenate genes of ~ 870 bp five times and then sequenced the resulting DNA of ~ 5,000 bp by PacBioSMRT long-read sequencing. Our method improved upon previously published concatenation attempts, leading to a greater sequencing depth, high-quality reads and limited sample preparation at little expense. We applied this efficient concatenation protocol to sequence nine DNA populations from a protein engineering study. The improved method is accompanied by a simple and user-friendly analysis pipeline, DeCatCounter, to sequence medium-length sequences efficiently at one-fifth of the cost.</abstract><cop>London</cop><pub>Nature Publishing Group UK</pub><pmid>34508117</pmid><doi>10.1038/s41598-021-96829-z</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2045-2322
ispartof Scientific reports, 2021-09, Vol.11 (1), p.18065-18065, Article 18065
issn 2045-2322
2045-2322
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_e15146f13e9d454ea27128ae5e18d288
source Open Access: PubMed Central; Publicly Available Content (ProQuest); Free Full-Text Journals in Chemistry; Springer Nature - nature.com Journals - Fully Open Access
subjects 631/1647/48
631/1647/514/1948
631/1647/514/1949
631/1647/514/2254
631/181/2475
631/181/735
631/45
631/61
631/61/514
Accuracy
Acids
Animals
Base Sequence
Computational Biology - methods
Computational Biology - standards
Deoxyribonucleic acid
DNA
DNA sequencing
Engineering
Gene Library
Genes
High-Throughput Nucleotide Sequencing - methods
Humanities and Social Sciences
Methods
Mice
Molecular Sequence Annotation
multidisciplinary
Mutation
Nucleotide sequence
Protein engineering
Proteins
Remakes & sequels
Sample preparation
Science
Science (multidisciplinary)
Sequence Analysis, DNA - methods
Sequence Analysis, DNA - standards
Sequence Analysis, Protein
title PacBio sequencing output increased through uniform and directional fivefold concatenation
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T08%3A07%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=PacBio%20sequencing%20output%20increased%20through%20uniform%20and%20directional%20fivefold%20concatenation&rft.jtitle=Scientific%20reports&rft.au=Kanwar,%20Nisha&rft.date=2021-09-10&rft.volume=11&rft.issue=1&rft.spage=18065&rft.epage=18065&rft.pages=18065-18065&rft.artnum=18065&rft.issn=2045-2322&rft.eissn=2045-2322&rft_id=info:doi/10.1038/s41598-021-96829-z&rft_dat=%3Cproquest_doaj_%3E2571922954%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2571040535&rft_id=info:pmid/34508117&rfr_iscdi=true