Loading…

Extension of Partial Gene Transcripts by Iterative Mapping of RNA-Seq Raw Reads

Many non-model organisms lack reference genomes and the sequencing and de novo assembly of an organisms transcriptome is an affordable means by which to characterize the coding component of its genome. Despite the advances that have made this possible, assembling a transcriptome without a known refe...

Full description

Saved in:
Bibliographic Details
Published in:IEEE/ACM transactions on computational biology and bioinformatics 2019-05, Vol.16 (3), p.1036-1041
Main Authors: Singh, Kumar Saurabh, Troczka, Bartlomiej J., Beadle, Katherine, Field, Linda M., Davies, T. G. Emyr, Williamson, Martin S., Nauen, Ralf, Bass, Chris
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Many non-model organisms lack reference genomes and the sequencing and de novo assembly of an organisms transcriptome is an affordable means by which to characterize the coding component of its genome. Despite the advances that have made this possible, assembling a transcriptome without a known reference usually results in a collection of full-length and partial gene transcripts. The downstream analysis of genes represented as partial transcripts then often requires further experimental work in the laboratory in order to obtain full- length sequences. We have explored whether partial transcripts, encoding genes of interest present in de novo assembled transcriptomes of a model and non-model insect species, could be further extended by iterative mapping against the raw transcriptome sequencing reads. Partial sequences encoding cytochrome P450s and carboxyl/cholinesterase were used in this analysis, because they are large multigene families and exhibit significant variation in expression. We present an effective method to improve the contiguity of partial transcripts in silico that, in the absence of a reference genome, may be a quick and cost-effective alternative to their extension by laboratory experimentation. Our approach resulted in the successful extension of incompletely assembled transcripts, often to full length. We experimentally validated these results in silico and using real-time PCR and sequencing.
ISSN:1545-5963
1557-9964
DOI:10.1109/TCBB.2018.2865309