Loading…

Recent advances in inferring viral diversity from high-throughput sequencing data

•Statistical methods for local diversity estimation in virus populations, as well as computational approaches for global reconstruction of viral haplotypes are described.•Strategies for read mapping are briefly described, as well as limitations of current aligners.•We describe experimental protocols...

Full description

Saved in:
Bibliographic Details
Published in:Virus research 2017-07, Vol.239, p.17-32
Main Authors: Posada-Cespedes, Susana, Seifert, David, Beerenwinkel, Niko
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c535t-6d6ab7703f6a3a2e1aedc86f90997891234dbfd5d1a0347e49ff54cb01d4270e3
cites cdi_FETCH-LOGICAL-c535t-6d6ab7703f6a3a2e1aedc86f90997891234dbfd5d1a0347e49ff54cb01d4270e3
container_end_page 32
container_issue
container_start_page 17
container_title Virus research
container_volume 239
creator Posada-Cespedes, Susana
Seifert, David
Beerenwinkel, Niko
description •Statistical methods for local diversity estimation in virus populations, as well as computational approaches for global reconstruction of viral haplotypes are described.•Strategies for read mapping are briefly described, as well as limitations of current aligners.•We describe experimental protocols developed to overcome limitations associated with short and error prone reads. Rapidly evolving RNA viruses prevail within a host as a collection of closely related variants, referred to as viral quasispecies. Advances in high-throughput sequencing (HTS) technologies have facilitated the assessment of the genetic diversity of such virus populations at an unprecedented level of detail. However, analysis of HTS data from virus populations is challenging due to short, error-prone reads. In order to account for uncertainties originating from these limitations, several computational and statistical methods have been developed for studying the genetic heterogeneity of virus population. Here, we review methods for the analysis of HTS reads, including approaches to local diversity estimation and global haplotype reconstruction. Challenges posed by aligning reads, as well as the impact of reference biases on diversity estimates are also discussed. In addition, we address some of the experimental approaches designed to improve the biological signal-to-noise ratio. In the future, computational methods for the analysis of heterogeneous virus populations are likely to continue being complemented by technological developments.
doi_str_mv 10.1016/j.virusres.2016.09.016
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1835393705</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0168170216304130</els_id><sourcerecordid>1835393705</sourcerecordid><originalsourceid>FETCH-LOGICAL-c535t-6d6ab7703f6a3a2e1aedc86f90997891234dbfd5d1a0347e49ff54cb01d4270e3</originalsourceid><addsrcrecordid>eNqFkFtLwzAYhoMobk7_wuilN605tElzpwxPMBBFr0OafNkytnYm7WD_3oxNb4XAS8Lz5U0ehKYEFwQTfrcqdj4MMUAsaNoXWBYpztCY1ILmopT0HI3TSZ0TgekIXcW4whhzJvglGlHBJaMSj9H7Bxho-0zbnW4NxMy3aTkIwbeLLHXodWb9DkL0_T5zodtkS79Y5v0ydMNiuR36LML3AK058Fb3-hpdOL2OcHPKCfp6evycveTzt-fX2cM8NxWr-pxbrhshMHNcM02BaLCm5k5iKUUtCWWlbZytLNGYlQJK6VxVmgYTW1KBgU3Q7fHebejSA2KvNj4aWK91C90QFalZxSQTuEooP6ImdDE5c2ob_EaHvSJYHXSqlfrVqQ46FZYqRRqcnjqGZgP2b-zXXwLujwCkn-48BBWNTzLA-gCmV7bz_3X8AD05i6c</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1835393705</pqid></control><display><type>article</type><title>Recent advances in inferring viral diversity from high-throughput sequencing data</title><source>ScienceDirect Freedom Collection</source><creator>Posada-Cespedes, Susana ; Seifert, David ; Beerenwinkel, Niko</creator><creatorcontrib>Posada-Cespedes, Susana ; Seifert, David ; Beerenwinkel, Niko</creatorcontrib><description>•Statistical methods for local diversity estimation in virus populations, as well as computational approaches for global reconstruction of viral haplotypes are described.•Strategies for read mapping are briefly described, as well as limitations of current aligners.•We describe experimental protocols developed to overcome limitations associated with short and error prone reads. Rapidly evolving RNA viruses prevail within a host as a collection of closely related variants, referred to as viral quasispecies. Advances in high-throughput sequencing (HTS) technologies have facilitated the assessment of the genetic diversity of such virus populations at an unprecedented level of detail. However, analysis of HTS data from virus populations is challenging due to short, error-prone reads. In order to account for uncertainties originating from these limitations, several computational and statistical methods have been developed for studying the genetic heterogeneity of virus population. Here, we review methods for the analysis of HTS reads, including approaches to local diversity estimation and global haplotype reconstruction. Challenges posed by aligning reads, as well as the impact of reference biases on diversity estimates are also discussed. In addition, we address some of the experimental approaches designed to improve the biological signal-to-noise ratio. In the future, computational methods for the analysis of heterogeneous virus populations are likely to continue being complemented by technological developments.</description><identifier>ISSN: 0168-1702</identifier><identifier>EISSN: 1872-7492</identifier><identifier>DOI: 10.1016/j.virusres.2016.09.016</identifier><identifier>PMID: 27693290</identifier><language>eng</language><publisher>Netherlands: Elsevier B.V</publisher><subject>Genetic diversity ; Haplotype reconstruction ; Next-generation sequencing ; Viral quasispecies</subject><ispartof>Virus research, 2017-07, Vol.239, p.17-32</ispartof><rights>2016 The Authors</rights><rights>Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c535t-6d6ab7703f6a3a2e1aedc86f90997891234dbfd5d1a0347e49ff54cb01d4270e3</citedby><cites>FETCH-LOGICAL-c535t-6d6ab7703f6a3a2e1aedc86f90997891234dbfd5d1a0347e49ff54cb01d4270e3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27693290$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Posada-Cespedes, Susana</creatorcontrib><creatorcontrib>Seifert, David</creatorcontrib><creatorcontrib>Beerenwinkel, Niko</creatorcontrib><title>Recent advances in inferring viral diversity from high-throughput sequencing data</title><title>Virus research</title><addtitle>Virus Res</addtitle><description>•Statistical methods for local diversity estimation in virus populations, as well as computational approaches for global reconstruction of viral haplotypes are described.•Strategies for read mapping are briefly described, as well as limitations of current aligners.•We describe experimental protocols developed to overcome limitations associated with short and error prone reads. Rapidly evolving RNA viruses prevail within a host as a collection of closely related variants, referred to as viral quasispecies. Advances in high-throughput sequencing (HTS) technologies have facilitated the assessment of the genetic diversity of such virus populations at an unprecedented level of detail. However, analysis of HTS data from virus populations is challenging due to short, error-prone reads. In order to account for uncertainties originating from these limitations, several computational and statistical methods have been developed for studying the genetic heterogeneity of virus population. Here, we review methods for the analysis of HTS reads, including approaches to local diversity estimation and global haplotype reconstruction. Challenges posed by aligning reads, as well as the impact of reference biases on diversity estimates are also discussed. In addition, we address some of the experimental approaches designed to improve the biological signal-to-noise ratio. In the future, computational methods for the analysis of heterogeneous virus populations are likely to continue being complemented by technological developments.</description><subject>Genetic diversity</subject><subject>Haplotype reconstruction</subject><subject>Next-generation sequencing</subject><subject>Viral quasispecies</subject><issn>0168-1702</issn><issn>1872-7492</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><recordid>eNqFkFtLwzAYhoMobk7_wuilN605tElzpwxPMBBFr0OafNkytnYm7WD_3oxNb4XAS8Lz5U0ehKYEFwQTfrcqdj4MMUAsaNoXWBYpztCY1ILmopT0HI3TSZ0TgekIXcW4whhzJvglGlHBJaMSj9H7Bxho-0zbnW4NxMy3aTkIwbeLLHXodWb9DkL0_T5zodtkS79Y5v0ydMNiuR36LML3AK058Fb3-hpdOL2OcHPKCfp6evycveTzt-fX2cM8NxWr-pxbrhshMHNcM02BaLCm5k5iKUUtCWWlbZytLNGYlQJK6VxVmgYTW1KBgU3Q7fHebejSA2KvNj4aWK91C90QFalZxSQTuEooP6ImdDE5c2ob_EaHvSJYHXSqlfrVqQ46FZYqRRqcnjqGZgP2b-zXXwLujwCkn-48BBWNTzLA-gCmV7bz_3X8AD05i6c</recordid><startdate>20170715</startdate><enddate>20170715</enddate><creator>Posada-Cespedes, Susana</creator><creator>Seifert, David</creator><creator>Beerenwinkel, Niko</creator><general>Elsevier B.V</general><scope>6I.</scope><scope>AAFTH</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20170715</creationdate><title>Recent advances in inferring viral diversity from high-throughput sequencing data</title><author>Posada-Cespedes, Susana ; Seifert, David ; Beerenwinkel, Niko</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c535t-6d6ab7703f6a3a2e1aedc86f90997891234dbfd5d1a0347e49ff54cb01d4270e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Genetic diversity</topic><topic>Haplotype reconstruction</topic><topic>Next-generation sequencing</topic><topic>Viral quasispecies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Posada-Cespedes, Susana</creatorcontrib><creatorcontrib>Seifert, David</creatorcontrib><creatorcontrib>Beerenwinkel, Niko</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Virus research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Posada-Cespedes, Susana</au><au>Seifert, David</au><au>Beerenwinkel, Niko</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Recent advances in inferring viral diversity from high-throughput sequencing data</atitle><jtitle>Virus research</jtitle><addtitle>Virus Res</addtitle><date>2017-07-15</date><risdate>2017</risdate><volume>239</volume><spage>17</spage><epage>32</epage><pages>17-32</pages><issn>0168-1702</issn><eissn>1872-7492</eissn><abstract>•Statistical methods for local diversity estimation in virus populations, as well as computational approaches for global reconstruction of viral haplotypes are described.•Strategies for read mapping are briefly described, as well as limitations of current aligners.•We describe experimental protocols developed to overcome limitations associated with short and error prone reads. Rapidly evolving RNA viruses prevail within a host as a collection of closely related variants, referred to as viral quasispecies. Advances in high-throughput sequencing (HTS) technologies have facilitated the assessment of the genetic diversity of such virus populations at an unprecedented level of detail. However, analysis of HTS data from virus populations is challenging due to short, error-prone reads. In order to account for uncertainties originating from these limitations, several computational and statistical methods have been developed for studying the genetic heterogeneity of virus population. Here, we review methods for the analysis of HTS reads, including approaches to local diversity estimation and global haplotype reconstruction. Challenges posed by aligning reads, as well as the impact of reference biases on diversity estimates are also discussed. In addition, we address some of the experimental approaches designed to improve the biological signal-to-noise ratio. In the future, computational methods for the analysis of heterogeneous virus populations are likely to continue being complemented by technological developments.</abstract><cop>Netherlands</cop><pub>Elsevier B.V</pub><pmid>27693290</pmid><doi>10.1016/j.virusres.2016.09.016</doi><tpages>16</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0168-1702
ispartof Virus research, 2017-07, Vol.239, p.17-32
issn 0168-1702
1872-7492
language eng
recordid cdi_proquest_miscellaneous_1835393705
source ScienceDirect Freedom Collection
subjects Genetic diversity
Haplotype reconstruction
Next-generation sequencing
Viral quasispecies
title Recent advances in inferring viral diversity from high-throughput sequencing data
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T04%3A25%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Recent%20advances%20in%20inferring%20viral%20diversity%20from%20high-throughput%20sequencing%20data&rft.jtitle=Virus%20research&rft.au=Posada-Cespedes,%20Susana&rft.date=2017-07-15&rft.volume=239&rft.spage=17&rft.epage=32&rft.pages=17-32&rft.issn=0168-1702&rft.eissn=1872-7492&rft_id=info:doi/10.1016/j.virusres.2016.09.016&rft_dat=%3Cproquest_cross%3E1835393705%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c535t-6d6ab7703f6a3a2e1aedc86f90997891234dbfd5d1a0347e49ff54cb01d4270e3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1835393705&rft_id=info:pmid/27693290&rfr_iscdi=true