Loading…

Tree‐sequence recording in SLiM opens new horizons for forward‐time simulation of whole genomes

There is an increasing demand for evolutionary models to incorporate relatively realistic dynamics, ranging from selection at many genomic sites to complex demography, population structure, and ecological interactions. Such models can generally be implemented as individual‐based forward simulations,...

Full description

Saved in:
Bibliographic Details
Published in:Molecular ecology resources 2019-03, Vol.19 (2), p.552-566
Main Authors: Haller, Benjamin C., Galloway, Jared, Kelleher, Jerome, Messer, Philipp W., Ralph, Peter L.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c4678-e6362b2012ed39104243558069be9087d8b1475724c51eb570f75e63d22acf503
cites cdi_FETCH-LOGICAL-c4678-e6362b2012ed39104243558069be9087d8b1475724c51eb570f75e63d22acf503
container_end_page 566
container_issue 2
container_start_page 552
container_title Molecular ecology resources
container_volume 19
creator Haller, Benjamin C.
Galloway, Jared
Kelleher, Jerome
Messer, Philipp W.
Ralph, Peter L.
description There is an increasing demand for evolutionary models to incorporate relatively realistic dynamics, ranging from selection at many genomic sites to complex demography, population structure, and ecological interactions. Such models can generally be implemented as individual‐based forward simulations, but the large computational overhead of these models often makes simulation of whole chromosome sequences in large populations infeasible. This situation presents an important obstacle to the field that requires conceptual advances to overcome. The recently developed tree‐sequence recording method (Kelleher, Thornton, Ashander, & Ralph, 2018), which stores the genealogical history of all genomes in the simulated population, could provide such an advance. This method has several benefits: (1) it allows neutral mutations to be omitted entirely from forward‐time simulations and added later, thereby dramatically improving computational efficiency; (2) it allows neutral burn‐in to be constructed extremely efficiently after the fact, using “recapitation”; (3) it allows direct examination and analysis of the genealogical trees along the genome; and (4) it provides a compact representation of a population's genealogy that can be analysed in Python using the msprime package. We have implemented the tree‐sequence recording method in SLiM 3 (a free, open‐source evolutionary simulation software package) and extended it to allow the recording of non‐neutral mutations, greatly broadening the utility of this method. To demonstrate the versatility and performance of this approach, we showcase several practical applications that would have been beyond the reach of previously existing methods, opening up new horizons for the modelling and exploration of evolutionary processes.
doi_str_mv 10.1111/1755-0998.12968
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6393187</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2186242143</sourcerecordid><originalsourceid>FETCH-LOGICAL-c4678-e6362b2012ed39104243558069be9087d8b1475724c51eb570f75e63d22acf503</originalsourceid><addsrcrecordid>eNqFkU9PFDEYxhujEUTO3kgTL1wW2k7_XkgMATVZ9CAm3JqZzju7JTPt0u6wwZMfwc_oJ7Hj4ga92KTpv9_ztG8fhN5QckJLO6VKiBkxRp9QZqR-hvZ3O893c32zh17lfEuIJEbxl2ivIkIKrdk-ctcJ4Of3HxnuRggOcAIXU-vDAvuAv8z9FY4rCBkH2OBlTP5bLIsupqlv6tQW7doPgLMfxr5e-xhw7PBmGXvACwhxgPwavejqPsPh43iAvl5eXJ9_mM0_v_94_m4-c1wqPQNZSdYwQhm0laGEM14JoYk0DRiiVasbypVQjDtBoRGKdEoUUctY7TpBqgN0tvVdjc0ArYOwTnVvV8kPdXqwsfb275Pgl3YR762sTEW1KgbHjwYplu_Iazv47KDv6wBxzJZRoYVQhk93vf0HvY1jCqW8QmnJOKO8KtTplnIp5pyg2z2GEjsFaKeI7BSX_R1gURw9rWHH_0msAGILbHwPD__zs1cXn7bGvwCXSKeF</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2186242143</pqid></control><display><type>article</type><title>Tree‐sequence recording in SLiM opens new horizons for forward‐time simulation of whole genomes</title><source>Wiley-Blackwell Read &amp; Publish Collection</source><creator>Haller, Benjamin C. ; Galloway, Jared ; Kelleher, Jerome ; Messer, Philipp W. ; Ralph, Peter L.</creator><creatorcontrib>Haller, Benjamin C. ; Galloway, Jared ; Kelleher, Jerome ; Messer, Philipp W. ; Ralph, Peter L.</creatorcontrib><description>There is an increasing demand for evolutionary models to incorporate relatively realistic dynamics, ranging from selection at many genomic sites to complex demography, population structure, and ecological interactions. Such models can generally be implemented as individual‐based forward simulations, but the large computational overhead of these models often makes simulation of whole chromosome sequences in large populations infeasible. This situation presents an important obstacle to the field that requires conceptual advances to overcome. The recently developed tree‐sequence recording method (Kelleher, Thornton, Ashander, &amp; Ralph, 2018), which stores the genealogical history of all genomes in the simulated population, could provide such an advance. This method has several benefits: (1) it allows neutral mutations to be omitted entirely from forward‐time simulations and added later, thereby dramatically improving computational efficiency; (2) it allows neutral burn‐in to be constructed extremely efficiently after the fact, using “recapitation”; (3) it allows direct examination and analysis of the genealogical trees along the genome; and (4) it provides a compact representation of a population's genealogy that can be analysed in Python using the msprime package. We have implemented the tree‐sequence recording method in SLiM 3 (a free, open‐source evolutionary simulation software package) and extended it to allow the recording of non‐neutral mutations, greatly broadening the utility of this method. To demonstrate the versatility and performance of this approach, we showcase several practical applications that would have been beyond the reach of previously existing methods, opening up new horizons for the modelling and exploration of evolutionary processes.</description><identifier>ISSN: 1755-098X</identifier><identifier>EISSN: 1755-0998</identifier><identifier>DOI: 10.1111/1755-0998.12968</identifier><identifier>PMID: 30565882</identifier><language>eng</language><publisher>England: Wiley Subscription Services, Inc</publisher><subject>background selection ; Biological Evolution ; coalescent ; Computational Biology ; Computer applications ; Computer Simulation ; Computing time ; Demography ; Evolution ; genealogical history ; Genealogy ; Genetics, Population - methods ; Genomes ; Mathematical models ; Mutation ; pedigree recording ; Population structure ; Recording ; selective sweeps ; Simulation ; Software ; tree sequences</subject><ispartof>Molecular ecology resources, 2019-03, Vol.19 (2), p.552-566</ispartof><rights>2018 John Wiley &amp; Sons Ltd</rights><rights>2018 John Wiley &amp; Sons Ltd.</rights><rights>Copyright © 2019 John Wiley &amp; Sons Ltd</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c4678-e6362b2012ed39104243558069be9087d8b1475724c51eb570f75e63d22acf503</citedby><cites>FETCH-LOGICAL-c4678-e6362b2012ed39104243558069be9087d8b1475724c51eb570f75e63d22acf503</cites><orcidid>0000-0002-7894-5253 ; 0000-0003-1874-8327 ; 0000-0002-9459-6866 ; 0000-0001-8453-9377</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30565882$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Haller, Benjamin C.</creatorcontrib><creatorcontrib>Galloway, Jared</creatorcontrib><creatorcontrib>Kelleher, Jerome</creatorcontrib><creatorcontrib>Messer, Philipp W.</creatorcontrib><creatorcontrib>Ralph, Peter L.</creatorcontrib><title>Tree‐sequence recording in SLiM opens new horizons for forward‐time simulation of whole genomes</title><title>Molecular ecology resources</title><addtitle>Mol Ecol Resour</addtitle><description>There is an increasing demand for evolutionary models to incorporate relatively realistic dynamics, ranging from selection at many genomic sites to complex demography, population structure, and ecological interactions. Such models can generally be implemented as individual‐based forward simulations, but the large computational overhead of these models often makes simulation of whole chromosome sequences in large populations infeasible. This situation presents an important obstacle to the field that requires conceptual advances to overcome. The recently developed tree‐sequence recording method (Kelleher, Thornton, Ashander, &amp; Ralph, 2018), which stores the genealogical history of all genomes in the simulated population, could provide such an advance. This method has several benefits: (1) it allows neutral mutations to be omitted entirely from forward‐time simulations and added later, thereby dramatically improving computational efficiency; (2) it allows neutral burn‐in to be constructed extremely efficiently after the fact, using “recapitation”; (3) it allows direct examination and analysis of the genealogical trees along the genome; and (4) it provides a compact representation of a population's genealogy that can be analysed in Python using the msprime package. We have implemented the tree‐sequence recording method in SLiM 3 (a free, open‐source evolutionary simulation software package) and extended it to allow the recording of non‐neutral mutations, greatly broadening the utility of this method. To demonstrate the versatility and performance of this approach, we showcase several practical applications that would have been beyond the reach of previously existing methods, opening up new horizons for the modelling and exploration of evolutionary processes.</description><subject>background selection</subject><subject>Biological Evolution</subject><subject>coalescent</subject><subject>Computational Biology</subject><subject>Computer applications</subject><subject>Computer Simulation</subject><subject>Computing time</subject><subject>Demography</subject><subject>Evolution</subject><subject>genealogical history</subject><subject>Genealogy</subject><subject>Genetics, Population - methods</subject><subject>Genomes</subject><subject>Mathematical models</subject><subject>Mutation</subject><subject>pedigree recording</subject><subject>Population structure</subject><subject>Recording</subject><subject>selective sweeps</subject><subject>Simulation</subject><subject>Software</subject><subject>tree sequences</subject><issn>1755-098X</issn><issn>1755-0998</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNqFkU9PFDEYxhujEUTO3kgTL1wW2k7_XkgMATVZ9CAm3JqZzju7JTPt0u6wwZMfwc_oJ7Hj4ga92KTpv9_ztG8fhN5QckJLO6VKiBkxRp9QZqR-hvZ3O893c32zh17lfEuIJEbxl2ivIkIKrdk-ctcJ4Of3HxnuRggOcAIXU-vDAvuAv8z9FY4rCBkH2OBlTP5bLIsupqlv6tQW7doPgLMfxr5e-xhw7PBmGXvACwhxgPwavejqPsPh43iAvl5eXJ9_mM0_v_94_m4-c1wqPQNZSdYwQhm0laGEM14JoYk0DRiiVasbypVQjDtBoRGKdEoUUctY7TpBqgN0tvVdjc0ArYOwTnVvV8kPdXqwsfb275Pgl3YR762sTEW1KgbHjwYplu_Iazv47KDv6wBxzJZRoYVQhk93vf0HvY1jCqW8QmnJOKO8KtTplnIp5pyg2z2GEjsFaKeI7BSX_R1gURw9rWHH_0msAGILbHwPD__zs1cXn7bGvwCXSKeF</recordid><startdate>201903</startdate><enddate>201903</enddate><creator>Haller, Benjamin C.</creator><creator>Galloway, Jared</creator><creator>Kelleher, Jerome</creator><creator>Messer, Philipp W.</creator><creator>Ralph, Peter L.</creator><general>Wiley Subscription Services, Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SN</scope><scope>7SS</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>M7N</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-7894-5253</orcidid><orcidid>https://orcid.org/0000-0003-1874-8327</orcidid><orcidid>https://orcid.org/0000-0002-9459-6866</orcidid><orcidid>https://orcid.org/0000-0001-8453-9377</orcidid></search><sort><creationdate>201903</creationdate><title>Tree‐sequence recording in SLiM opens new horizons for forward‐time simulation of whole genomes</title><author>Haller, Benjamin C. ; Galloway, Jared ; Kelleher, Jerome ; Messer, Philipp W. ; Ralph, Peter L.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c4678-e6362b2012ed39104243558069be9087d8b1475724c51eb570f75e63d22acf503</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>background selection</topic><topic>Biological Evolution</topic><topic>coalescent</topic><topic>Computational Biology</topic><topic>Computer applications</topic><topic>Computer Simulation</topic><topic>Computing time</topic><topic>Demography</topic><topic>Evolution</topic><topic>genealogical history</topic><topic>Genealogy</topic><topic>Genetics, Population - methods</topic><topic>Genomes</topic><topic>Mathematical models</topic><topic>Mutation</topic><topic>pedigree recording</topic><topic>Population structure</topic><topic>Recording</topic><topic>selective sweeps</topic><topic>Simulation</topic><topic>Software</topic><topic>tree sequences</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Haller, Benjamin C.</creatorcontrib><creatorcontrib>Galloway, Jared</creatorcontrib><creatorcontrib>Kelleher, Jerome</creatorcontrib><creatorcontrib>Messer, Philipp W.</creatorcontrib><creatorcontrib>Ralph, Peter L.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Molecular ecology resources</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Haller, Benjamin C.</au><au>Galloway, Jared</au><au>Kelleher, Jerome</au><au>Messer, Philipp W.</au><au>Ralph, Peter L.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Tree‐sequence recording in SLiM opens new horizons for forward‐time simulation of whole genomes</atitle><jtitle>Molecular ecology resources</jtitle><addtitle>Mol Ecol Resour</addtitle><date>2019-03</date><risdate>2019</risdate><volume>19</volume><issue>2</issue><spage>552</spage><epage>566</epage><pages>552-566</pages><issn>1755-098X</issn><eissn>1755-0998</eissn><abstract>There is an increasing demand for evolutionary models to incorporate relatively realistic dynamics, ranging from selection at many genomic sites to complex demography, population structure, and ecological interactions. Such models can generally be implemented as individual‐based forward simulations, but the large computational overhead of these models often makes simulation of whole chromosome sequences in large populations infeasible. This situation presents an important obstacle to the field that requires conceptual advances to overcome. The recently developed tree‐sequence recording method (Kelleher, Thornton, Ashander, &amp; Ralph, 2018), which stores the genealogical history of all genomes in the simulated population, could provide such an advance. This method has several benefits: (1) it allows neutral mutations to be omitted entirely from forward‐time simulations and added later, thereby dramatically improving computational efficiency; (2) it allows neutral burn‐in to be constructed extremely efficiently after the fact, using “recapitation”; (3) it allows direct examination and analysis of the genealogical trees along the genome; and (4) it provides a compact representation of a population's genealogy that can be analysed in Python using the msprime package. We have implemented the tree‐sequence recording method in SLiM 3 (a free, open‐source evolutionary simulation software package) and extended it to allow the recording of non‐neutral mutations, greatly broadening the utility of this method. To demonstrate the versatility and performance of this approach, we showcase several practical applications that would have been beyond the reach of previously existing methods, opening up new horizons for the modelling and exploration of evolutionary processes.</abstract><cop>England</cop><pub>Wiley Subscription Services, Inc</pub><pmid>30565882</pmid><doi>10.1111/1755-0998.12968</doi><tpages>15</tpages><orcidid>https://orcid.org/0000-0002-7894-5253</orcidid><orcidid>https://orcid.org/0000-0003-1874-8327</orcidid><orcidid>https://orcid.org/0000-0002-9459-6866</orcidid><orcidid>https://orcid.org/0000-0001-8453-9377</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1755-098X
ispartof Molecular ecology resources, 2019-03, Vol.19 (2), p.552-566
issn 1755-098X
1755-0998
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6393187
source Wiley-Blackwell Read & Publish Collection
subjects background selection
Biological Evolution
coalescent
Computational Biology
Computer applications
Computer Simulation
Computing time
Demography
Evolution
genealogical history
Genealogy
Genetics, Population - methods
Genomes
Mathematical models
Mutation
pedigree recording
Population structure
Recording
selective sweeps
Simulation
Software
tree sequences
title Tree‐sequence recording in SLiM opens new horizons for forward‐time simulation of whole genomes
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T08%3A39%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Tree%E2%80%90sequence%20recording%20in%20SLiM%20opens%20new%20horizons%20for%20forward%E2%80%90time%20simulation%20of%20whole%20genomes&rft.jtitle=Molecular%20ecology%20resources&rft.au=Haller,%20Benjamin%20C.&rft.date=2019-03&rft.volume=19&rft.issue=2&rft.spage=552&rft.epage=566&rft.pages=552-566&rft.issn=1755-098X&rft.eissn=1755-0998&rft_id=info:doi/10.1111/1755-0998.12968&rft_dat=%3Cproquest_pubme%3E2186242143%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c4678-e6362b2012ed39104243558069be9087d8b1475724c51eb570f75e63d22acf503%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2186242143&rft_id=info:pmid/30565882&rfr_iscdi=true