Loading…

Sequencing Strategy to Ensure Accurate Plasmid Assembly

Despite the wide use of plasmids in research and clinical production, the need to verify plasmid sequences is a bottleneck that is too often underestimated in the manufacturing process. Although sequencing platforms continue to improve, the method and assembly pipeline chosen still influence the fin...

Full description

Saved in:
Bibliographic Details
Published in:ACS synthetic biology 2024-12, Vol.13 (12), p.4099
Main Authors: Hernandez, Sarah I, Berezin, Casey-Tyler, Miller, Katie M, Peccoud, Samuel J, Peccoud, Jean
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c183t-6fd87c60eeaa90e6e15c82477ef4888b02d7923934c7ccfc47e26c3d665699ab3
container_end_page
container_issue 12
container_start_page 4099
container_title ACS synthetic biology
container_volume 13
creator Hernandez, Sarah I
Berezin, Casey-Tyler
Miller, Katie M
Peccoud, Samuel J
Peccoud, Jean
description Despite the wide use of plasmids in research and clinical production, the need to verify plasmid sequences is a bottleneck that is too often underestimated in the manufacturing process. Although sequencing platforms continue to improve, the method and assembly pipeline chosen still influence the final plasmid assembly sequence. Furthermore, few dedicated tools exist for plasmid assembly, especially for assembly. Here, we evaluated short-read, long-read, and hybrid (both short and long reads) assembly pipelines across three replicates of a 24-plasmid library. Consistent with previous characterizations of each sequencing technology, short-read assemblies had issues resolving GC-rich regions, and long-read assemblies commonly had small insertions and deletions, especially in repetitive regions. The hybrid approach facilitated the most accurate, consistent assembly generation and identified mutations relative to the reference sequence. Although Sanger sequencing can be used to verify specific regions, some GC-rich and repetitive regions were difficult to resolve using any method, suggesting that easily sequenced genetic parts should be prioritized in the design of new genetic constructs.
doi_str_mv 10.1021/acssynbio.4c00539
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_3128319061</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3128319061</sourcerecordid><originalsourceid>FETCH-LOGICAL-c183t-6fd87c60eeaa90e6e15c82477ef4888b02d7923934c7ccfc47e26c3d665699ab3</originalsourceid><addsrcrecordid>eNpNkEtrwzAQhEVpaUKaH9BL8bEXp5JlvY4hpA8ItJD2LOT1Orj4kUr2wf--DnFD97LLMjMMHyH3jK4YTdiTgxCGJivbVQqUCm6uyDxhksWCSn79756RZQjfdBwhuOD6lsy4EVRrpudE7fGnxwbK5hDtO-86PAxR10bbJvQeozVAf3pGH5ULdZlH6xCwzqrhjtwUrgq4nPaCfD1vPzev8e795W2z3sXANO9iWeRagaSIzhmKEpkAnaRKYZFqrTOa5Mok3PAUFEABqcJEAs-lFNIYl_EFeTznHn07Fg2drcsAWFWuwbYPlrNEc2aoZKOUnaXg2xA8Fvboy9r5wTJqT8jsBZmdkI2ehym-z2rML44_QPwXxk9pEg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3128319061</pqid></control><display><type>article</type><title>Sequencing Strategy to Ensure Accurate Plasmid Assembly</title><source>American Chemical Society:Jisc Collections:American Chemical Society Read &amp; Publish Agreement 2022-2024 (Reading list)</source><creator>Hernandez, Sarah I ; Berezin, Casey-Tyler ; Miller, Katie M ; Peccoud, Samuel J ; Peccoud, Jean</creator><creatorcontrib>Hernandez, Sarah I ; Berezin, Casey-Tyler ; Miller, Katie M ; Peccoud, Samuel J ; Peccoud, Jean</creatorcontrib><description>Despite the wide use of plasmids in research and clinical production, the need to verify plasmid sequences is a bottleneck that is too often underestimated in the manufacturing process. Although sequencing platforms continue to improve, the method and assembly pipeline chosen still influence the final plasmid assembly sequence. Furthermore, few dedicated tools exist for plasmid assembly, especially for assembly. Here, we evaluated short-read, long-read, and hybrid (both short and long reads) assembly pipelines across three replicates of a 24-plasmid library. Consistent with previous characterizations of each sequencing technology, short-read assemblies had issues resolving GC-rich regions, and long-read assemblies commonly had small insertions and deletions, especially in repetitive regions. The hybrid approach facilitated the most accurate, consistent assembly generation and identified mutations relative to the reference sequence. Although Sanger sequencing can be used to verify specific regions, some GC-rich and repetitive regions were difficult to resolve using any method, suggesting that easily sequenced genetic parts should be prioritized in the design of new genetic constructs.</description><identifier>ISSN: 2161-5063</identifier><identifier>EISSN: 2161-5063</identifier><identifier>DOI: 10.1021/acssynbio.4c00539</identifier><identifier>PMID: 39508818</identifier><language>eng</language><publisher>United States</publisher><subject>Escherichia coli - genetics ; Gene Library ; High-Throughput Nucleotide Sequencing - methods ; Plasmids - genetics ; Sequence Analysis, DNA - methods</subject><ispartof>ACS synthetic biology, 2024-12, Vol.13 (12), p.4099</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c183t-6fd87c60eeaa90e6e15c82477ef4888b02d7923934c7ccfc47e26c3d665699ab3</cites><orcidid>0000-0001-7649-6127</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/39508818$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hernandez, Sarah I</creatorcontrib><creatorcontrib>Berezin, Casey-Tyler</creatorcontrib><creatorcontrib>Miller, Katie M</creatorcontrib><creatorcontrib>Peccoud, Samuel J</creatorcontrib><creatorcontrib>Peccoud, Jean</creatorcontrib><title>Sequencing Strategy to Ensure Accurate Plasmid Assembly</title><title>ACS synthetic biology</title><addtitle>ACS Synth Biol</addtitle><description>Despite the wide use of plasmids in research and clinical production, the need to verify plasmid sequences is a bottleneck that is too often underestimated in the manufacturing process. Although sequencing platforms continue to improve, the method and assembly pipeline chosen still influence the final plasmid assembly sequence. Furthermore, few dedicated tools exist for plasmid assembly, especially for assembly. Here, we evaluated short-read, long-read, and hybrid (both short and long reads) assembly pipelines across three replicates of a 24-plasmid library. Consistent with previous characterizations of each sequencing technology, short-read assemblies had issues resolving GC-rich regions, and long-read assemblies commonly had small insertions and deletions, especially in repetitive regions. The hybrid approach facilitated the most accurate, consistent assembly generation and identified mutations relative to the reference sequence. Although Sanger sequencing can be used to verify specific regions, some GC-rich and repetitive regions were difficult to resolve using any method, suggesting that easily sequenced genetic parts should be prioritized in the design of new genetic constructs.</description><subject>Escherichia coli - genetics</subject><subject>Gene Library</subject><subject>High-Throughput Nucleotide Sequencing - methods</subject><subject>Plasmids - genetics</subject><subject>Sequence Analysis, DNA - methods</subject><issn>2161-5063</issn><issn>2161-5063</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpNkEtrwzAQhEVpaUKaH9BL8bEXp5JlvY4hpA8ItJD2LOT1Orj4kUr2wf--DnFD97LLMjMMHyH3jK4YTdiTgxCGJivbVQqUCm6uyDxhksWCSn79756RZQjfdBwhuOD6lsy4EVRrpudE7fGnxwbK5hDtO-86PAxR10bbJvQeozVAf3pGH5ULdZlH6xCwzqrhjtwUrgq4nPaCfD1vPzev8e795W2z3sXANO9iWeRagaSIzhmKEpkAnaRKYZFqrTOa5Mok3PAUFEABqcJEAs-lFNIYl_EFeTznHn07Fg2drcsAWFWuwbYPlrNEc2aoZKOUnaXg2xA8Fvboy9r5wTJqT8jsBZmdkI2ehym-z2rML44_QPwXxk9pEg</recordid><startdate>20241220</startdate><enddate>20241220</enddate><creator>Hernandez, Sarah I</creator><creator>Berezin, Casey-Tyler</creator><creator>Miller, Katie M</creator><creator>Peccoud, Samuel J</creator><creator>Peccoud, Jean</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-7649-6127</orcidid></search><sort><creationdate>20241220</creationdate><title>Sequencing Strategy to Ensure Accurate Plasmid Assembly</title><author>Hernandez, Sarah I ; Berezin, Casey-Tyler ; Miller, Katie M ; Peccoud, Samuel J ; Peccoud, Jean</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c183t-6fd87c60eeaa90e6e15c82477ef4888b02d7923934c7ccfc47e26c3d665699ab3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Escherichia coli - genetics</topic><topic>Gene Library</topic><topic>High-Throughput Nucleotide Sequencing - methods</topic><topic>Plasmids - genetics</topic><topic>Sequence Analysis, DNA - methods</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hernandez, Sarah I</creatorcontrib><creatorcontrib>Berezin, Casey-Tyler</creatorcontrib><creatorcontrib>Miller, Katie M</creatorcontrib><creatorcontrib>Peccoud, Samuel J</creatorcontrib><creatorcontrib>Peccoud, Jean</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>ACS synthetic biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hernandez, Sarah I</au><au>Berezin, Casey-Tyler</au><au>Miller, Katie M</au><au>Peccoud, Samuel J</au><au>Peccoud, Jean</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Sequencing Strategy to Ensure Accurate Plasmid Assembly</atitle><jtitle>ACS synthetic biology</jtitle><addtitle>ACS Synth Biol</addtitle><date>2024-12-20</date><risdate>2024</risdate><volume>13</volume><issue>12</issue><spage>4099</spage><pages>4099-</pages><issn>2161-5063</issn><eissn>2161-5063</eissn><abstract>Despite the wide use of plasmids in research and clinical production, the need to verify plasmid sequences is a bottleneck that is too often underestimated in the manufacturing process. Although sequencing platforms continue to improve, the method and assembly pipeline chosen still influence the final plasmid assembly sequence. Furthermore, few dedicated tools exist for plasmid assembly, especially for assembly. Here, we evaluated short-read, long-read, and hybrid (both short and long reads) assembly pipelines across three replicates of a 24-plasmid library. Consistent with previous characterizations of each sequencing technology, short-read assemblies had issues resolving GC-rich regions, and long-read assemblies commonly had small insertions and deletions, especially in repetitive regions. The hybrid approach facilitated the most accurate, consistent assembly generation and identified mutations relative to the reference sequence. Although Sanger sequencing can be used to verify specific regions, some GC-rich and repetitive regions were difficult to resolve using any method, suggesting that easily sequenced genetic parts should be prioritized in the design of new genetic constructs.</abstract><cop>United States</cop><pmid>39508818</pmid><doi>10.1021/acssynbio.4c00539</doi><orcidid>https://orcid.org/0000-0001-7649-6127</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 2161-5063
ispartof ACS synthetic biology, 2024-12, Vol.13 (12), p.4099
issn 2161-5063
2161-5063
language eng
recordid cdi_proquest_miscellaneous_3128319061
source American Chemical Society:Jisc Collections:American Chemical Society Read & Publish Agreement 2022-2024 (Reading list)
subjects Escherichia coli - genetics
Gene Library
High-Throughput Nucleotide Sequencing - methods
Plasmids - genetics
Sequence Analysis, DNA - methods
title Sequencing Strategy to Ensure Accurate Plasmid Assembly
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T22%3A12%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Sequencing%20Strategy%20to%20Ensure%20Accurate%20Plasmid%20Assembly&rft.jtitle=ACS%20synthetic%20biology&rft.au=Hernandez,%20Sarah%20I&rft.date=2024-12-20&rft.volume=13&rft.issue=12&rft.spage=4099&rft.pages=4099-&rft.issn=2161-5063&rft.eissn=2161-5063&rft_id=info:doi/10.1021/acssynbio.4c00539&rft_dat=%3Cproquest_cross%3E3128319061%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c183t-6fd87c60eeaa90e6e15c82477ef4888b02d7923934c7ccfc47e26c3d665699ab3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3128319061&rft_id=info:pmid/39508818&rfr_iscdi=true