Loading…

Sequence deeper without sequencing more: Bayesian resolution of ambiguously mapped reads

Next-generation sequencing (NGS) has transformed molecular biology and contributed to many seminal insights into genomic regulation and function. Apart from whole-genome sequencing, an NGS workflow involves alignment of the sequencing reads to the genome of study, after which the resulting alignment...

Full description

Saved in:
Bibliographic Details
Published in:PLoS computational biology 2021-04, Vol.17 (4), p.e1008926-e1008926
Main Authors: Shah, Rohan N, Ruthenburg, Alexander J
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c633t-30ff5a7d3aa291b3c93e4cda04cdc8bc9a4ce33c9c5f1f5fad29f54e2316ab303
cites cdi_FETCH-LOGICAL-c633t-30ff5a7d3aa291b3c93e4cda04cdc8bc9a4ce33c9c5f1f5fad29f54e2316ab303
container_end_page e1008926
container_issue 4
container_start_page e1008926
container_title PLoS computational biology
container_volume 17
creator Shah, Rohan N
Ruthenburg, Alexander J
description Next-generation sequencing (NGS) has transformed molecular biology and contributed to many seminal insights into genomic regulation and function. Apart from whole-genome sequencing, an NGS workflow involves alignment of the sequencing reads to the genome of study, after which the resulting alignments can be used for downstream analyses. However, alignment is complicated by the repetitive sequences; many reads align to more than one genomic locus, with 15-30% of the genome not being uniquely mappable by short-read NGS. This problem is typically addressed by discarding reads that do not uniquely map to the genome, but this practice can lead to systematic distortion of the data. Previous studies that developed methods for handling ambiguously mapped reads were often of limited applicability or were computationally intensive, hindering their broader usage. In this work, we present SmartMap: an algorithm that augments industry-standard aligners to enable usage of ambiguously mapped reads by assigning weights to each alignment with Bayesian analysis of the read distribution and alignment quality. SmartMap is computationally efficient, utilizing far fewer weighting iterations than previously thought necessary to process alignments and, as such, analyzing more than a billion alignments of NGS reads in approximately one hour on a desktop PC. By applying SmartMap to peak-type NGS data, including MNase-seq, ChIP-seq, and ATAC-seq in three organisms, we can increase read depth by up to 53% and increase the mapped proportion of the genome by up to 18% compared to analyses utilizing only uniquely mapped reads. We further show that SmartMap enables the analysis of more than 140,000 repetitive elements that could not be analyzed by traditional ChIP-seq workflows, and we utilize this method to gain insight into the epigenetic regulation of different classes of repetitive elements. These data emphasize both the dangers of discarding ambiguously mapped reads and their power for driving biological discovery.
doi_str_mv 10.1371/journal.pcbi.1008926
format article
fullrecord <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_2528201584</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A660619597</galeid><doaj_id>oai_doaj_org_article_18b352bcedc94f33b62bcc9cc323f2b2</doaj_id><sourcerecordid>A660619597</sourcerecordid><originalsourceid>FETCH-LOGICAL-c633t-30ff5a7d3aa291b3c93e4cda04cdc8bc9a4ce33c9c5f1f5fad29f54e2316ab303</originalsourceid><addsrcrecordid>eNqVkt-L1DAQx4so3nn6H4gWfNGHXZNO0219EM7DHwuHgqfgW5imk16WtqlJq-5_75y7d9yKL1JIk8lnvjP5MknyWIqlhJV8ufFzGLBbjqZ2SylEWWXFneRYKgWLFajy7q39UfIgxo0QvK2K-8kRQLnKQMrj5NsFfZ9pMJQ2RCOF9KebLv08pXEXd0Ob9j7Qq_QNbik6HNJA0Xfz5PyQeptiX7t29nPstmmP40gNA9jEh8k9i12kR_v_SfL13dsvZx8W55_er89OzxemAJgWIKxVuGoAMatkDaYCyk2DghdT1qbC3BBw2CgrrbLYZJVVOXH3BdYg4CR5utMdOx_13pSoM5WVmZCqzJlY74jG40aPwfUYttqj038CPrQaw-RMR1qWNaisNtSYKrcAdcEHrm0gA5vVGWu93leb654pGqaA3YHo4c3gLnXrf-hScCdQssDzvUDwbHCcdO-ioa7DgdhE7luqolRcj9Fnf6H_ft1yR7XID3CD9VzX8NdQ74wfyDqOnxaFKGSlqhUnvDhIYGaiX1OLc4x6ffH5P9iPh2y-Y03wMQayN65Ioa8m9rp9fTWxej-xnPbktqM3SdcjCr8B9YTqHA</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2528201584</pqid></control><display><type>article</type><title>Sequence deeper without sequencing more: Bayesian resolution of ambiguously mapped reads</title><source>Open Access: PubMed Central</source><source>Publicly Available Content (ProQuest)</source><creator>Shah, Rohan N ; Ruthenburg, Alexander J</creator><contributor>Ioshikhes, Ilya</contributor><creatorcontrib>Shah, Rohan N ; Ruthenburg, Alexander J ; Ioshikhes, Ilya</creatorcontrib><description>Next-generation sequencing (NGS) has transformed molecular biology and contributed to many seminal insights into genomic regulation and function. Apart from whole-genome sequencing, an NGS workflow involves alignment of the sequencing reads to the genome of study, after which the resulting alignments can be used for downstream analyses. However, alignment is complicated by the repetitive sequences; many reads align to more than one genomic locus, with 15-30% of the genome not being uniquely mappable by short-read NGS. This problem is typically addressed by discarding reads that do not uniquely map to the genome, but this practice can lead to systematic distortion of the data. Previous studies that developed methods for handling ambiguously mapped reads were often of limited applicability or were computationally intensive, hindering their broader usage. In this work, we present SmartMap: an algorithm that augments industry-standard aligners to enable usage of ambiguously mapped reads by assigning weights to each alignment with Bayesian analysis of the read distribution and alignment quality. SmartMap is computationally efficient, utilizing far fewer weighting iterations than previously thought necessary to process alignments and, as such, analyzing more than a billion alignments of NGS reads in approximately one hour on a desktop PC. By applying SmartMap to peak-type NGS data, including MNase-seq, ChIP-seq, and ATAC-seq in three organisms, we can increase read depth by up to 53% and increase the mapped proportion of the genome by up to 18% compared to analyses utilizing only uniquely mapped reads. We further show that SmartMap enables the analysis of more than 140,000 repetitive elements that could not be analyzed by traditional ChIP-seq workflows, and we utilize this method to gain insight into the epigenetic regulation of different classes of repetitive elements. These data emphasize both the dangers of discarding ambiguously mapped reads and their power for driving biological discovery.</description><identifier>ISSN: 1553-7358</identifier><identifier>ISSN: 1553-734X</identifier><identifier>EISSN: 1553-7358</identifier><identifier>DOI: 10.1371/journal.pcbi.1008926</identifier><identifier>PMID: 33872311</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Algorithms ; Alignment ; Ambiguity ; Bayesian analysis ; Biology and Life Sciences ; Computer and Information Sciences ; Datasets ; DNA sequencing ; Engineering and Technology ; Gene expression ; Gene loci ; Genetic regulation ; Genetic research ; Genomes ; High-throughput screening (Biochemical assaying) ; Iterative methods ; Methods ; Molecular biology ; Nucleotide sequencing ; Physical Sciences ; Research and Analysis Methods ; Software ; Weight</subject><ispartof>PLoS computational biology, 2021-04, Vol.17 (4), p.e1008926-e1008926</ispartof><rights>COPYRIGHT 2021 Public Library of Science</rights><rights>2021 Shah, Ruthenburg. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2021 Shah, Ruthenburg 2021 Shah, Ruthenburg</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c633t-30ff5a7d3aa291b3c93e4cda04cdc8bc9a4ce33c9c5f1f5fad29f54e2316ab303</citedby><cites>FETCH-LOGICAL-c633t-30ff5a7d3aa291b3c93e4cda04cdc8bc9a4ce33c9c5f1f5fad29f54e2316ab303</cites><orcidid>0000-0003-2709-4564 ; 0000-0002-2646-7042</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2528201584/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2528201584?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,44590,53791,53793,75126</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33872311$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Ioshikhes, Ilya</contributor><creatorcontrib>Shah, Rohan N</creatorcontrib><creatorcontrib>Ruthenburg, Alexander J</creatorcontrib><title>Sequence deeper without sequencing more: Bayesian resolution of ambiguously mapped reads</title><title>PLoS computational biology</title><addtitle>PLoS Comput Biol</addtitle><description>Next-generation sequencing (NGS) has transformed molecular biology and contributed to many seminal insights into genomic regulation and function. Apart from whole-genome sequencing, an NGS workflow involves alignment of the sequencing reads to the genome of study, after which the resulting alignments can be used for downstream analyses. However, alignment is complicated by the repetitive sequences; many reads align to more than one genomic locus, with 15-30% of the genome not being uniquely mappable by short-read NGS. This problem is typically addressed by discarding reads that do not uniquely map to the genome, but this practice can lead to systematic distortion of the data. Previous studies that developed methods for handling ambiguously mapped reads were often of limited applicability or were computationally intensive, hindering their broader usage. In this work, we present SmartMap: an algorithm that augments industry-standard aligners to enable usage of ambiguously mapped reads by assigning weights to each alignment with Bayesian analysis of the read distribution and alignment quality. SmartMap is computationally efficient, utilizing far fewer weighting iterations than previously thought necessary to process alignments and, as such, analyzing more than a billion alignments of NGS reads in approximately one hour on a desktop PC. By applying SmartMap to peak-type NGS data, including MNase-seq, ChIP-seq, and ATAC-seq in three organisms, we can increase read depth by up to 53% and increase the mapped proportion of the genome by up to 18% compared to analyses utilizing only uniquely mapped reads. We further show that SmartMap enables the analysis of more than 140,000 repetitive elements that could not be analyzed by traditional ChIP-seq workflows, and we utilize this method to gain insight into the epigenetic regulation of different classes of repetitive elements. These data emphasize both the dangers of discarding ambiguously mapped reads and their power for driving biological discovery.</description><subject>Algorithms</subject><subject>Alignment</subject><subject>Ambiguity</subject><subject>Bayesian analysis</subject><subject>Biology and Life Sciences</subject><subject>Computer and Information Sciences</subject><subject>Datasets</subject><subject>DNA sequencing</subject><subject>Engineering and Technology</subject><subject>Gene expression</subject><subject>Gene loci</subject><subject>Genetic regulation</subject><subject>Genetic research</subject><subject>Genomes</subject><subject>High-throughput screening (Biochemical assaying)</subject><subject>Iterative methods</subject><subject>Methods</subject><subject>Molecular biology</subject><subject>Nucleotide sequencing</subject><subject>Physical Sciences</subject><subject>Research and Analysis Methods</subject><subject>Software</subject><subject>Weight</subject><issn>1553-7358</issn><issn>1553-734X</issn><issn>1553-7358</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNqVkt-L1DAQx4so3nn6H4gWfNGHXZNO0219EM7DHwuHgqfgW5imk16WtqlJq-5_75y7d9yKL1JIk8lnvjP5MknyWIqlhJV8ufFzGLBbjqZ2SylEWWXFneRYKgWLFajy7q39UfIgxo0QvK2K-8kRQLnKQMrj5NsFfZ9pMJQ2RCOF9KebLv08pXEXd0Ob9j7Qq_QNbik6HNJA0Xfz5PyQeptiX7t29nPstmmP40gNA9jEh8k9i12kR_v_SfL13dsvZx8W55_er89OzxemAJgWIKxVuGoAMatkDaYCyk2DghdT1qbC3BBw2CgrrbLYZJVVOXH3BdYg4CR5utMdOx_13pSoM5WVmZCqzJlY74jG40aPwfUYttqj038CPrQaw-RMR1qWNaisNtSYKrcAdcEHrm0gA5vVGWu93leb654pGqaA3YHo4c3gLnXrf-hScCdQssDzvUDwbHCcdO-ioa7DgdhE7luqolRcj9Fnf6H_ft1yR7XID3CD9VzX8NdQ74wfyDqOnxaFKGSlqhUnvDhIYGaiX1OLc4x6ffH5P9iPh2y-Y03wMQayN65Ioa8m9rp9fTWxej-xnPbktqM3SdcjCr8B9YTqHA</recordid><startdate>20210401</startdate><enddate>20210401</enddate><creator>Shah, Rohan N</creator><creator>Ruthenburg, Alexander J</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISN</scope><scope>ISR</scope><scope>3V.</scope><scope>7QO</scope><scope>7QP</scope><scope>7TK</scope><scope>7TM</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>LK8</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-2709-4564</orcidid><orcidid>https://orcid.org/0000-0002-2646-7042</orcidid></search><sort><creationdate>20210401</creationdate><title>Sequence deeper without sequencing more: Bayesian resolution of ambiguously mapped reads</title><author>Shah, Rohan N ; Ruthenburg, Alexander J</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c633t-30ff5a7d3aa291b3c93e4cda04cdc8bc9a4ce33c9c5f1f5fad29f54e2316ab303</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Alignment</topic><topic>Ambiguity</topic><topic>Bayesian analysis</topic><topic>Biology and Life Sciences</topic><topic>Computer and Information Sciences</topic><topic>Datasets</topic><topic>DNA sequencing</topic><topic>Engineering and Technology</topic><topic>Gene expression</topic><topic>Gene loci</topic><topic>Genetic regulation</topic><topic>Genetic research</topic><topic>Genomes</topic><topic>High-throughput screening (Biochemical assaying)</topic><topic>Iterative methods</topic><topic>Methods</topic><topic>Molecular biology</topic><topic>Nucleotide sequencing</topic><topic>Physical Sciences</topic><topic>Research and Analysis Methods</topic><topic>Software</topic><topic>Weight</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shah, Rohan N</creatorcontrib><creatorcontrib>Ruthenburg, Alexander J</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Canada</collection><collection>Science (Gale in Context)</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>ProQuest Health and Medical</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer science database</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Computing Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>PML(ProQuest Medical Library)</collection><collection>ProQuest Biological Science Journals</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PLoS computational biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shah, Rohan N</au><au>Ruthenburg, Alexander J</au><au>Ioshikhes, Ilya</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Sequence deeper without sequencing more: Bayesian resolution of ambiguously mapped reads</atitle><jtitle>PLoS computational biology</jtitle><addtitle>PLoS Comput Biol</addtitle><date>2021-04-01</date><risdate>2021</risdate><volume>17</volume><issue>4</issue><spage>e1008926</spage><epage>e1008926</epage><pages>e1008926-e1008926</pages><issn>1553-7358</issn><issn>1553-734X</issn><eissn>1553-7358</eissn><abstract>Next-generation sequencing (NGS) has transformed molecular biology and contributed to many seminal insights into genomic regulation and function. Apart from whole-genome sequencing, an NGS workflow involves alignment of the sequencing reads to the genome of study, after which the resulting alignments can be used for downstream analyses. However, alignment is complicated by the repetitive sequences; many reads align to more than one genomic locus, with 15-30% of the genome not being uniquely mappable by short-read NGS. This problem is typically addressed by discarding reads that do not uniquely map to the genome, but this practice can lead to systematic distortion of the data. Previous studies that developed methods for handling ambiguously mapped reads were often of limited applicability or were computationally intensive, hindering their broader usage. In this work, we present SmartMap: an algorithm that augments industry-standard aligners to enable usage of ambiguously mapped reads by assigning weights to each alignment with Bayesian analysis of the read distribution and alignment quality. SmartMap is computationally efficient, utilizing far fewer weighting iterations than previously thought necessary to process alignments and, as such, analyzing more than a billion alignments of NGS reads in approximately one hour on a desktop PC. By applying SmartMap to peak-type NGS data, including MNase-seq, ChIP-seq, and ATAC-seq in three organisms, we can increase read depth by up to 53% and increase the mapped proportion of the genome by up to 18% compared to analyses utilizing only uniquely mapped reads. We further show that SmartMap enables the analysis of more than 140,000 repetitive elements that could not be analyzed by traditional ChIP-seq workflows, and we utilize this method to gain insight into the epigenetic regulation of different classes of repetitive elements. These data emphasize both the dangers of discarding ambiguously mapped reads and their power for driving biological discovery.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>33872311</pmid><doi>10.1371/journal.pcbi.1008926</doi><orcidid>https://orcid.org/0000-0003-2709-4564</orcidid><orcidid>https://orcid.org/0000-0002-2646-7042</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1553-7358
ispartof PLoS computational biology, 2021-04, Vol.17 (4), p.e1008926-e1008926
issn 1553-7358
1553-734X
1553-7358
language eng
recordid cdi_plos_journals_2528201584
source Open Access: PubMed Central; Publicly Available Content (ProQuest)
subjects Algorithms
Alignment
Ambiguity
Bayesian analysis
Biology and Life Sciences
Computer and Information Sciences
Datasets
DNA sequencing
Engineering and Technology
Gene expression
Gene loci
Genetic regulation
Genetic research
Genomes
High-throughput screening (Biochemical assaying)
Iterative methods
Methods
Molecular biology
Nucleotide sequencing
Physical Sciences
Research and Analysis Methods
Software
Weight
title Sequence deeper without sequencing more: Bayesian resolution of ambiguously mapped reads
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T03%3A41%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Sequence%20deeper%20without%20sequencing%20more:%20Bayesian%20resolution%20of%20ambiguously%20mapped%20reads&rft.jtitle=PLoS%20computational%20biology&rft.au=Shah,%20Rohan%20N&rft.date=2021-04-01&rft.volume=17&rft.issue=4&rft.spage=e1008926&rft.epage=e1008926&rft.pages=e1008926-e1008926&rft.issn=1553-7358&rft.eissn=1553-7358&rft_id=info:doi/10.1371/journal.pcbi.1008926&rft_dat=%3Cgale_plos_%3EA660619597%3C/gale_plos_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c633t-30ff5a7d3aa291b3c93e4cda04cdc8bc9a4ce33c9c5f1f5fad29f54e2316ab303%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2528201584&rft_id=info:pmid/33872311&rft_galeid=A660619597&rfr_iscdi=true