Loading…
Improving the gene structure annotation of the apicomplexan parasite Neospora caninum fulfils a vital requirement towards an in silico-derived vaccine
[Display omitted] •In silico vaccine candidate discovery is dependent on precise protein sequences.•Current Neospora caninum sequences are derived from gene predictions and RNA-Seq.•We evaluate current Neospora protein coding sequences for potential inaccuracies.•Almost 28% of sequences found have q...
Saved in:
Published in: | International journal for parasitology 2015-04, Vol.45 (5), p.305-318 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | [Display omitted]
•In silico vaccine candidate discovery is dependent on precise protein sequences.•Current Neospora caninum sequences are derived from gene predictions and RNA-Seq.•We evaluate current Neospora protein coding sequences for potential inaccuracies.•Almost 28% of sequences found have questionable start codons and/or exon borders.•A strategy based on ATG context is used to more accurately determine start codons.
Neospora caninum is an apicomplexan parasite which can cause abortion in cattle, instigating major economic burden. Vaccination has been proposed as the most cost-effective control measure to alleviate this burden. Consequently the overriding aspiration for N. caninum research is the identification and subsequent evaluation of vaccine candidates in animal models. To save time, cost and effort, it is now feasible to use an in silico approach for vaccine candidate prediction. Precise protein sequences, derived from the correct open reading frame, are paramount and arguably the most important factor determining the success or failure of this approach. The challenge is that publicly available N. caninum sequences are mostly derived from gene predictions. Annotated inaccuracies can lead to erroneously predicted vaccine candidates by bioinformatics programs. This study evaluates the current N. caninum annotation for potential inaccuracies. Comparisons with annotation from a closely related pathogen, Toxoplasma gondii, are also made to distinguish patterns of inconsistency. More importantly, a mRNA sequencing (RNA-Seq) experiment is used to validate the annotation. Potential discrepancies originating from a questionable start codon context and exon boundaries were identified in 1943 protein coding sequences. We conclude, where experimental data were available, that the majority of N. caninum gene sequences were reliably predicted. Nevertheless, almost 28% of genes were identified as questionable. Given the limitations of RNA-Seq, the intention of this study was not to replace the existing annotation but to support or oppose particular aspects of it. Ideally, many studies aimed at improving the annotation are required to build a consensus. We believe this study, in providing a new resource on gene structure and annotation, is a worthy contributor to this endeavour. |
---|---|
ISSN: | 0020-7519 1879-0135 |
DOI: | 10.1016/j.ijpara.2015.01.006 |