Loading…
PacBio sequencing output increased through uniform and directional fivefold concatenation
Advances in sequencing technology have allowed researchers to sequence DNA with greater ease and at decreasing costs. Main developments have focused on either sequencing many short sequences or fewer large sequences. Methods for sequencing mid-sized sequences of 600–5,000 bp are currently less effic...
Saved in:
Published in: | Scientific reports 2021-09, Vol.11 (1), p.18065-18065, Article 18065 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3 |
---|---|
cites | cdi_FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3 |
container_end_page | 18065 |
container_issue | 1 |
container_start_page | 18065 |
container_title | Scientific reports |
container_volume | 11 |
creator | Kanwar, Nisha Blanco, Celia Chen, Irene A. Seelig, Burckhard |
description | Advances in sequencing technology have allowed researchers to sequence DNA with greater ease and at decreasing costs. Main developments have focused on either sequencing many short sequences or fewer large sequences. Methods for sequencing mid-sized sequences of 600–5,000 bp are currently less efficient. For example, the PacBio Sequel I system yields ~ 100,000–300,000 reads with an accuracy per base pair of 90–99%. We sought to sequence several DNA populations of ~ 870 bp in length with a sequencing accuracy of 99% and to the greatest depth possible. We optimised a simple, robust method to concatenate genes of ~ 870 bp five times and then sequenced the resulting DNA of ~ 5,000 bp by PacBioSMRT long-read sequencing. Our method improved upon previously published concatenation attempts, leading to a greater sequencing depth, high-quality reads and limited sample preparation at little expense. We applied this efficient concatenation protocol to sequence nine DNA populations from a protein engineering study. The improved method is accompanied by a simple and user-friendly analysis pipeline, DeCatCounter, to sequence medium-length sequences efficiently at one-fifth of the cost. |
doi_str_mv | 10.1038/s41598-021-96829-z |
format | article |
fullrecord | <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_e15146f13e9d454ea27128ae5e18d288</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_e15146f13e9d454ea27128ae5e18d288</doaj_id><sourcerecordid>2571922954</sourcerecordid><originalsourceid>FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3</originalsourceid><addsrcrecordid>eNp9kk1vFSEUhifGxjZt_4ALM4kbN6N8zsDGRBs_mjSxi7pwRRg4zOVmLlyBaWJ_vbRTa-tCNhDe9zzA4W2alxi9xYiKd5lhLkWHCO5kL4jsbp41RwQx3hFKyPNH68PmNOctqoMTybB80RxSxpHAeDhqflxq89HHNsPPBYLxYWrjUvZLaX0wCXQG25ZNisu0aZfgXUy7VgfbWp_AFB-Dnlvnr8HF2bYmBqMLBH0rnDQHTs8ZTu_n4-b7509XZ1-7i29fzs8-XHSGM1Q6IfhoHeuRo9KIUTDHgA4MMzkKjCSMw8h7TeiAdC9HLK2lVOBa1cveigHocXO-cm3UW7VPfqfTLxW1V3cbMU1Kp-LNDAowx6x3mIK0jDPQZMBEaOCAhSVCVNb7lbVfxh1YA6EkPT-BPlWC36gpXivBKKVoqIA394AUa0NzUTufDcyzDhCXrAgfsCREclatr_-xbuOSaj9XF2KIU15dZHWZFHNO4B4ug5G6DYJag6BqENRdENRNLXr1-BkPJX--vRroashVChOkv2f_B_sbhle_FA</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2571040535</pqid></control><display><type>article</type><title>PacBio sequencing output increased through uniform and directional fivefold concatenation</title><source>Open Access: PubMed Central</source><source>Publicly Available Content (ProQuest)</source><source>Free Full-Text Journals in Chemistry</source><source>Springer Nature - nature.com Journals - Fully Open Access</source><creator>Kanwar, Nisha ; Blanco, Celia ; Chen, Irene A. ; Seelig, Burckhard</creator><creatorcontrib>Kanwar, Nisha ; Blanco, Celia ; Chen, Irene A. ; Seelig, Burckhard</creatorcontrib><description>Advances in sequencing technology have allowed researchers to sequence DNA with greater ease and at decreasing costs. Main developments have focused on either sequencing many short sequences or fewer large sequences. Methods for sequencing mid-sized sequences of 600–5,000 bp are currently less efficient. For example, the PacBio Sequel I system yields ~ 100,000–300,000 reads with an accuracy per base pair of 90–99%. We sought to sequence several DNA populations of ~ 870 bp in length with a sequencing accuracy of 99% and to the greatest depth possible. We optimised a simple, robust method to concatenate genes of ~ 870 bp five times and then sequenced the resulting DNA of ~ 5,000 bp by PacBioSMRT long-read sequencing. Our method improved upon previously published concatenation attempts, leading to a greater sequencing depth, high-quality reads and limited sample preparation at little expense. We applied this efficient concatenation protocol to sequence nine DNA populations from a protein engineering study. The improved method is accompanied by a simple and user-friendly analysis pipeline, DeCatCounter, to sequence medium-length sequences efficiently at one-fifth of the cost.</description><identifier>ISSN: 2045-2322</identifier><identifier>EISSN: 2045-2322</identifier><identifier>DOI: 10.1038/s41598-021-96829-z</identifier><identifier>PMID: 34508117</identifier><language>eng</language><publisher>London: Nature Publishing Group UK</publisher><subject>631/1647/48 ; 631/1647/514/1948 ; 631/1647/514/1949 ; 631/1647/514/2254 ; 631/181/2475 ; 631/181/735 ; 631/45 ; 631/61 ; 631/61/514 ; Accuracy ; Acids ; Animals ; Base Sequence ; Computational Biology - methods ; Computational Biology - standards ; Deoxyribonucleic acid ; DNA ; DNA sequencing ; Engineering ; Gene Library ; Genes ; High-Throughput Nucleotide Sequencing - methods ; Humanities and Social Sciences ; Methods ; Mice ; Molecular Sequence Annotation ; multidisciplinary ; Mutation ; Nucleotide sequence ; Protein engineering ; Proteins ; Remakes & sequels ; Sample preparation ; Science ; Science (multidisciplinary) ; Sequence Analysis, DNA - methods ; Sequence Analysis, DNA - standards ; Sequence Analysis, Protein</subject><ispartof>Scientific reports, 2021-09, Vol.11 (1), p.18065-18065, Article 18065</ispartof><rights>The Author(s) 2021</rights><rights>2021. The Author(s).</rights><rights>The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3</citedby><cites>FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2571040535/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2571040535?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,25731,27901,27902,36989,36990,44566,53766,53768,74869</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/34508117$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Kanwar, Nisha</creatorcontrib><creatorcontrib>Blanco, Celia</creatorcontrib><creatorcontrib>Chen, Irene A.</creatorcontrib><creatorcontrib>Seelig, Burckhard</creatorcontrib><title>PacBio sequencing output increased through uniform and directional fivefold concatenation</title><title>Scientific reports</title><addtitle>Sci Rep</addtitle><addtitle>Sci Rep</addtitle><description>Advances in sequencing technology have allowed researchers to sequence DNA with greater ease and at decreasing costs. Main developments have focused on either sequencing many short sequences or fewer large sequences. Methods for sequencing mid-sized sequences of 600–5,000 bp are currently less efficient. For example, the PacBio Sequel I system yields ~ 100,000–300,000 reads with an accuracy per base pair of 90–99%. We sought to sequence several DNA populations of ~ 870 bp in length with a sequencing accuracy of 99% and to the greatest depth possible. We optimised a simple, robust method to concatenate genes of ~ 870 bp five times and then sequenced the resulting DNA of ~ 5,000 bp by PacBioSMRT long-read sequencing. Our method improved upon previously published concatenation attempts, leading to a greater sequencing depth, high-quality reads and limited sample preparation at little expense. We applied this efficient concatenation protocol to sequence nine DNA populations from a protein engineering study. The improved method is accompanied by a simple and user-friendly analysis pipeline, DeCatCounter, to sequence medium-length sequences efficiently at one-fifth of the cost.</description><subject>631/1647/48</subject><subject>631/1647/514/1948</subject><subject>631/1647/514/1949</subject><subject>631/1647/514/2254</subject><subject>631/181/2475</subject><subject>631/181/735</subject><subject>631/45</subject><subject>631/61</subject><subject>631/61/514</subject><subject>Accuracy</subject><subject>Acids</subject><subject>Animals</subject><subject>Base Sequence</subject><subject>Computational Biology - methods</subject><subject>Computational Biology - standards</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>DNA sequencing</subject><subject>Engineering</subject><subject>Gene Library</subject><subject>Genes</subject><subject>High-Throughput Nucleotide Sequencing - methods</subject><subject>Humanities and Social Sciences</subject><subject>Methods</subject><subject>Mice</subject><subject>Molecular Sequence Annotation</subject><subject>multidisciplinary</subject><subject>Mutation</subject><subject>Nucleotide sequence</subject><subject>Protein engineering</subject><subject>Proteins</subject><subject>Remakes & sequels</subject><subject>Sample preparation</subject><subject>Science</subject><subject>Science (multidisciplinary)</subject><subject>Sequence Analysis, DNA - methods</subject><subject>Sequence Analysis, DNA - standards</subject><subject>Sequence Analysis, Protein</subject><issn>2045-2322</issn><issn>2045-2322</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNp9kk1vFSEUhifGxjZt_4ALM4kbN6N8zsDGRBs_mjSxi7pwRRg4zOVmLlyBaWJ_vbRTa-tCNhDe9zzA4W2alxi9xYiKd5lhLkWHCO5kL4jsbp41RwQx3hFKyPNH68PmNOctqoMTybB80RxSxpHAeDhqflxq89HHNsPPBYLxYWrjUvZLaX0wCXQG25ZNisu0aZfgXUy7VgfbWp_AFB-Dnlvnr8HF2bYmBqMLBH0rnDQHTs8ZTu_n4-b7509XZ1-7i29fzs8-XHSGM1Q6IfhoHeuRo9KIUTDHgA4MMzkKjCSMw8h7TeiAdC9HLK2lVOBa1cveigHocXO-cm3UW7VPfqfTLxW1V3cbMU1Kp-LNDAowx6x3mIK0jDPQZMBEaOCAhSVCVNb7lbVfxh1YA6EkPT-BPlWC36gpXivBKKVoqIA394AUa0NzUTufDcyzDhCXrAgfsCREclatr_-xbuOSaj9XF2KIU15dZHWZFHNO4B4ug5G6DYJag6BqENRdENRNLXr1-BkPJX--vRroashVChOkv2f_B_sbhle_FA</recordid><startdate>20210910</startdate><enddate>20210910</enddate><creator>Kanwar, Nisha</creator><creator>Blanco, Celia</creator><creator>Chen, Irene A.</creator><creator>Seelig, Burckhard</creator><general>Nature Publishing Group UK</general><general>Nature Publishing Group</general><general>Nature Portfolio</general><scope>C6C</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20210910</creationdate><title>PacBio sequencing output increased through uniform and directional fivefold concatenation</title><author>Kanwar, Nisha ; Blanco, Celia ; Chen, Irene A. ; Seelig, Burckhard</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>631/1647/48</topic><topic>631/1647/514/1948</topic><topic>631/1647/514/1949</topic><topic>631/1647/514/2254</topic><topic>631/181/2475</topic><topic>631/181/735</topic><topic>631/45</topic><topic>631/61</topic><topic>631/61/514</topic><topic>Accuracy</topic><topic>Acids</topic><topic>Animals</topic><topic>Base Sequence</topic><topic>Computational Biology - methods</topic><topic>Computational Biology - standards</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>DNA sequencing</topic><topic>Engineering</topic><topic>Gene Library</topic><topic>Genes</topic><topic>High-Throughput Nucleotide Sequencing - methods</topic><topic>Humanities and Social Sciences</topic><topic>Methods</topic><topic>Mice</topic><topic>Molecular Sequence Annotation</topic><topic>multidisciplinary</topic><topic>Mutation</topic><topic>Nucleotide sequence</topic><topic>Protein engineering</topic><topic>Proteins</topic><topic>Remakes & sequels</topic><topic>Sample preparation</topic><topic>Science</topic><topic>Science (multidisciplinary)</topic><topic>Sequence Analysis, DNA - methods</topic><topic>Sequence Analysis, DNA - standards</topic><topic>Sequence Analysis, Protein</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kanwar, Nisha</creatorcontrib><creatorcontrib>Blanco, Celia</creatorcontrib><creatorcontrib>Chen, Irene A.</creatorcontrib><creatorcontrib>Seelig, Burckhard</creatorcontrib><collection>SpringerOpen</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Proquest Health & Medical Complete</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Biological Sciences</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Journals (ProQuest Database)</collection><collection>Biological Science Database</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Scientific reports</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kanwar, Nisha</au><au>Blanco, Celia</au><au>Chen, Irene A.</au><au>Seelig, Burckhard</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>PacBio sequencing output increased through uniform and directional fivefold concatenation</atitle><jtitle>Scientific reports</jtitle><stitle>Sci Rep</stitle><addtitle>Sci Rep</addtitle><date>2021-09-10</date><risdate>2021</risdate><volume>11</volume><issue>1</issue><spage>18065</spage><epage>18065</epage><pages>18065-18065</pages><artnum>18065</artnum><issn>2045-2322</issn><eissn>2045-2322</eissn><abstract>Advances in sequencing technology have allowed researchers to sequence DNA with greater ease and at decreasing costs. Main developments have focused on either sequencing many short sequences or fewer large sequences. Methods for sequencing mid-sized sequences of 600–5,000 bp are currently less efficient. For example, the PacBio Sequel I system yields ~ 100,000–300,000 reads with an accuracy per base pair of 90–99%. We sought to sequence several DNA populations of ~ 870 bp in length with a sequencing accuracy of 99% and to the greatest depth possible. We optimised a simple, robust method to concatenate genes of ~ 870 bp five times and then sequenced the resulting DNA of ~ 5,000 bp by PacBioSMRT long-read sequencing. Our method improved upon previously published concatenation attempts, leading to a greater sequencing depth, high-quality reads and limited sample preparation at little expense. We applied this efficient concatenation protocol to sequence nine DNA populations from a protein engineering study. The improved method is accompanied by a simple and user-friendly analysis pipeline, DeCatCounter, to sequence medium-length sequences efficiently at one-fifth of the cost.</abstract><cop>London</cop><pub>Nature Publishing Group UK</pub><pmid>34508117</pmid><doi>10.1038/s41598-021-96829-z</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2045-2322 |
ispartof | Scientific reports, 2021-09, Vol.11 (1), p.18065-18065, Article 18065 |
issn | 2045-2322 2045-2322 |
language | eng |
recordid | cdi_doaj_primary_oai_doaj_org_article_e15146f13e9d454ea27128ae5e18d288 |
source | Open Access: PubMed Central; Publicly Available Content (ProQuest); Free Full-Text Journals in Chemistry; Springer Nature - nature.com Journals - Fully Open Access |
subjects | 631/1647/48 631/1647/514/1948 631/1647/514/1949 631/1647/514/2254 631/181/2475 631/181/735 631/45 631/61 631/61/514 Accuracy Acids Animals Base Sequence Computational Biology - methods Computational Biology - standards Deoxyribonucleic acid DNA DNA sequencing Engineering Gene Library Genes High-Throughput Nucleotide Sequencing - methods Humanities and Social Sciences Methods Mice Molecular Sequence Annotation multidisciplinary Mutation Nucleotide sequence Protein engineering Proteins Remakes & sequels Sample preparation Science Science (multidisciplinary) Sequence Analysis, DNA - methods Sequence Analysis, DNA - standards Sequence Analysis, Protein |
title | PacBio sequencing output increased through uniform and directional fivefold concatenation |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T08%3A07%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=PacBio%20sequencing%20output%20increased%20through%20uniform%20and%20directional%20fivefold%20concatenation&rft.jtitle=Scientific%20reports&rft.au=Kanwar,%20Nisha&rft.date=2021-09-10&rft.volume=11&rft.issue=1&rft.spage=18065&rft.epage=18065&rft.pages=18065-18065&rft.artnum=18065&rft.issn=2045-2322&rft.eissn=2045-2322&rft_id=info:doi/10.1038/s41598-021-96829-z&rft_dat=%3Cproquest_doaj_%3E2571922954%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c540t-885bdf460f39c8b84f4e374149b8109eb7b56a2370a69b19dd3381885696d87e3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2571040535&rft_id=info:pmid/34508117&rfr_iscdi=true |