Loading…
IVaCS: an Integrated Variant Calling System
The reduction in sequencing costs associated with next generation sequencing technologies (NGS) has led to a rapid upsurge in the amount of genome re-sequencing data, paving the way for the advent of personalized genomics and precision medicine. Accurate genotyping is crucial for effective analyses...
Saved in:
Published in: | PeerJ preprints 2016-07 |
---|---|
Main Authors: | , , , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The reduction in sequencing costs associated with next generation sequencing technologies (NGS) has led to a rapid upsurge in the amount of genome re-sequencing data, paving the way for the advent of personalized genomics and precision medicine. Accurate genotyping is crucial for effective analyses of these data, and in particular for the correct identification of candidate causal mutations in diagnostic screenings. The body of genome resequencing data will likely see exponential growth in the next few years, underlining the need for publicly available, accurate and time-effective bioinformatics systems for data analysis. Ideally, such systems should be easy to use and constantly updated as new genomes and software tools are released. Here we present IVaCS, a fully automated, highly accurate system with a web based graphical interface for genotyping and variant annotation. IVaCS offers state of the art tools for variant calling and annotation along with expert made pipelines for the analysis of whole genome sequencing (WGS), whole-exome sequencing (WES) and targeted resequencing (TGS) data, performing all steps from quality trimming to variant annotation. The system is specifically designed to assist users with little or no bioinformatics skills and all the pipelines are available through a user friendly web interface. The final output is provided in the form of a dynamic web page where variants can be selected on the base of user defined hard filters. A comprehensive report containing detailed information and statistics concerning the execution of each step of the pipelines is also generated. Extensive tests on publicly available genome resequencing data (Illumina platinum genome NA12878), show that our system recovers a slightly better sensitivity and a higher specificity than the commercial Illumina VCAT 2.0 software. IVaCS is implemented with a modular architecture and each module (quality trimming, reads mapping, variant calling, variant annotation) can be used independently. IVaCS may manage all the major commercial kits for exome sequencing, such as Illumina, Agilent or Nimblegen, along with a comprehensive collection of reference genomes (all the Illumina genomes, including human, mouse and cow, among the others) with corresponding genomic annotations. Finally, the software leverages an ensemble of publicly available resources (e.g., dbSNP, OMIM, COSMIC and ClinVar among others) for the functional annotation of human variants. Advanced users needin |
---|---|
ISSN: | 2167-9843 |
DOI: | 10.7287/peerj.preprints.2213v1 |