Loading…
Recent advances in inferring viral diversity from high-throughput sequencing data
•Statistical methods for local diversity estimation in virus populations, as well as computational approaches for global reconstruction of viral haplotypes are described.•Strategies for read mapping are briefly described, as well as limitations of current aligners.•We describe experimental protocols...
Saved in:
Published in: | Virus research 2017-07, Vol.239, p.17-32 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •Statistical methods for local diversity estimation in virus populations, as well as computational approaches for global reconstruction of viral haplotypes are described.•Strategies for read mapping are briefly described, as well as limitations of current aligners.•We describe experimental protocols developed to overcome limitations associated with short and error prone reads.
Rapidly evolving RNA viruses prevail within a host as a collection of closely related variants, referred to as viral quasispecies. Advances in high-throughput sequencing (HTS) technologies have facilitated the assessment of the genetic diversity of such virus populations at an unprecedented level of detail. However, analysis of HTS data from virus populations is challenging due to short, error-prone reads. In order to account for uncertainties originating from these limitations, several computational and statistical methods have been developed for studying the genetic heterogeneity of virus population. Here, we review methods for the analysis of HTS reads, including approaches to local diversity estimation and global haplotype reconstruction. Challenges posed by aligning reads, as well as the impact of reference biases on diversity estimates are also discussed. In addition, we address some of the experimental approaches designed to improve the biological signal-to-noise ratio. In the future, computational methods for the analysis of heterogeneous virus populations are likely to continue being complemented by technological developments. |
---|---|
ISSN: | 0168-1702 1872-7492 |
DOI: | 10.1016/j.virusres.2016.09.016 |