Loading…

Accurate assembly of minority viral haplotypes from next-generation sequencing through efficient noise reduction

Abstract Rapidly evolving RNA viruses continuously produce minority haplotypes that can become dominant if they are drug-resistant or can better evade the immune system. Therefore, early detection and identification of minority viral haplotypes may help to promptly adjust the patient’s treatment pla...

Full description

Saved in:
Bibliographic Details
Published in:Nucleic acids research 2021-09, Vol.49 (17), p.e102-e102
Main Authors: Knyazev, Sergey, Tsyvina, Viachaslau, Shankar, Anupama, Melnyk, Andrew, Artyomenko, Alexander, Malygina, Tatiana, Porozov, Yuri B, Campbell, Ellsworth M, Switzer, William M, Skums, Pavel, Mangul, Serghei, Zelikovsky, Alex
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Rapidly evolving RNA viruses continuously produce minority haplotypes that can become dominant if they are drug-resistant or can better evade the immune system. Therefore, early detection and identification of minority viral haplotypes may help to promptly adjust the patient’s treatment plan preventing potential disease complications. Minority haplotypes can be identified using next-generation sequencing, but sequencing noise hinders accurate identification. The elimination of sequencing noise is a non-trivial task that still remains open. Here we propose CliqueSNV based on extracting pairs of statistically linked mutations from noisy reads. This effectively reduces sequencing noise and enables identifying minority haplotypes with the frequency below the sequencing error rate. We comparatively assess the performance of CliqueSNV using an in vitro mixture of nine haplotypes that were derived from the mutation profile of an existing HIV patient. We show that CliqueSNV can accurately assemble viral haplotypes with frequencies as low as 0.1% and maintains consistent performance across short and long bases sequencing platforms.
ISSN:0305-1048
1362-4962
DOI:10.1093/nar/gkab576