Loading…

SW1PerS: Sliding windows and 1-persistence scoring; discovering periodicity in gene expression time series data

Identifying periodically expressed genes across different processes (e.g. the cell and metabolic cycles, circadian rhythms, etc) is a central problem in computational biology. Biological time series may contain (multiple) unknown signal shapes of systemic relevance, imperfections like noise, damping...

Full description

Saved in:
Bibliographic Details
Published in:BMC bioinformatics 2015-08, Vol.16 (1), p.257, Article 257
Main Authors: Perea, Jose A, Deckard, Anastasia, Haase, Steve B, Harer, John
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c594t-912f42e0fd7da57ace6384d2536a07a7c486df00ab8414c4f749a3c1af93e7203
cites cdi_FETCH-LOGICAL-c594t-912f42e0fd7da57ace6384d2536a07a7c486df00ab8414c4f749a3c1af93e7203
container_end_page
container_issue 1
container_start_page 257
container_title BMC bioinformatics
container_volume 16
creator Perea, Jose A
Deckard, Anastasia
Haase, Steve B
Harer, John
description Identifying periodically expressed genes across different processes (e.g. the cell and metabolic cycles, circadian rhythms, etc) is a central problem in computational biology. Biological time series may contain (multiple) unknown signal shapes of systemic relevance, imperfections like noise, damping, and trending, or limited sampling density. While there exist methods for detecting periodicity, their design biases (e.g. toward a specific signal shape) can limit their applicability in one or more of these situations. We present in this paper a novel method, SW1PerS, for quantifying periodicity in time series in a shape-agnostic manner and with resistance to damping. The measurement is performed directly, without presupposing a particular pattern, by evaluating the circularity of a high-dimensional representation of the signal. SW1PerS is compared to other algorithms using synthetic data and performance is quantified under varying noise models, noise levels, sampling densities, and signal shapes. Results on biological data are also analyzed and compared. On the task of periodic/not-periodic classification, using synthetic data, SW1PerS outperforms all other algorithms in the low-noise regime. SW1PerS is shown to be the most shape-agnostic of the evaluated methods, and the only one to consistently classify damped signals as highly periodic. On biological data, and for several experiments, the lists of top 10% genes ranked with SW1PerS recover up to 67% of those generated with other popular algorithms. Moreover, the list of genes from data on the Yeast metabolic cycle which are highly-ranked only by SW1PerS, contains evidently non-cosine patterns (e.g. ECM33, CDC9, SAM1,2 and MSH6) with highly periodic expression profiles. In data from the Yeast cell cycle SW1PerS identifies genes not preferred by other algorithms, hence not previously reported as periodic, but found in other experiments such as the universal growth rate response of Slavov. These genes are BOP3, CDC10, YIL108W, YER034W, MLP1, PAC2 and RTT101. In biological systems with low noise, i.e. where periodic signals with interesting shapes are more likely to occur, SW1PerS can be used as a powerful tool in exploratory analyses. Indeed, by having an initial set of periodic genes with a rich variety of signal types, pattern/shape information can be included in the study of systems and the generation of hypotheses regarding the structure of gene regulatory networks.
doi_str_mv 10.1186/s12859-015-0645-6
format article
fullrecord <record><control><sourceid>gale_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4537550</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A541357915</galeid><sourcerecordid>A541357915</sourcerecordid><originalsourceid>FETCH-LOGICAL-c594t-912f42e0fd7da57ace6384d2536a07a7c486df00ab8414c4f749a3c1af93e7203</originalsourceid><addsrcrecordid>eNptkt1r1jAUxosobk7_AG8k4I1edMtJk6ZVEMbQORgovoqXIUtOa0abvEvaffz3prz7ekVykZPk9zzhHJ6ieA10H6CpDxKwRrQlBVHSmouyflLsApdQMqDi6aN6p3iR0jmlIBsqnhc7rGZScsZ3i7D6Dd8xrj6Q1eCs8z25ct6Gq0S0twTKNcbk0oTeIEkmxEx8JNbl8hKXA8mAC9YZN90Q50mPHgleryOm5IInkxuzMDOYiNWTflk86_SQ8NXtvlf8-vL559HX8vTb8cnR4WlpRMunsgXWcYa0s9JqIbXBumq4ZaKqNZVaGt7UtqNUnzUcuOGd5K2uDOiurVAyWu0Vnza-6_lsRGvQT1EPah3dqOONCtqp7Rfv_qg-XCouKinEYvDu1iCGixnTpMbcNQ6D9hjmpEBSQWkeP2T07T_oeZijz-2pZeDAGJPVA9XrAZXzXcj_msVUHQoOlZAtiEzt_4fKy-LoTPDYuXy_JXi_JcjMhNdTr-eU1MnqxzYLG9bEkFLE7n4eQNWSKLVJlMqJUkuiVJ01bx4P8l5xF6HqLwZmxY8</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1780122273</pqid></control><display><type>article</type><title>SW1PerS: Sliding windows and 1-persistence scoring; discovering periodicity in gene expression time series data</title><source>Publicly Available Content Database</source><source>PubMed</source><creator>Perea, Jose A ; Deckard, Anastasia ; Haase, Steve B ; Harer, John</creator><creatorcontrib>Perea, Jose A ; Deckard, Anastasia ; Haase, Steve B ; Harer, John</creatorcontrib><description>Identifying periodically expressed genes across different processes (e.g. the cell and metabolic cycles, circadian rhythms, etc) is a central problem in computational biology. Biological time series may contain (multiple) unknown signal shapes of systemic relevance, imperfections like noise, damping, and trending, or limited sampling density. While there exist methods for detecting periodicity, their design biases (e.g. toward a specific signal shape) can limit their applicability in one or more of these situations. We present in this paper a novel method, SW1PerS, for quantifying periodicity in time series in a shape-agnostic manner and with resistance to damping. The measurement is performed directly, without presupposing a particular pattern, by evaluating the circularity of a high-dimensional representation of the signal. SW1PerS is compared to other algorithms using synthetic data and performance is quantified under varying noise models, noise levels, sampling densities, and signal shapes. Results on biological data are also analyzed and compared. On the task of periodic/not-periodic classification, using synthetic data, SW1PerS outperforms all other algorithms in the low-noise regime. SW1PerS is shown to be the most shape-agnostic of the evaluated methods, and the only one to consistently classify damped signals as highly periodic. On biological data, and for several experiments, the lists of top 10% genes ranked with SW1PerS recover up to 67% of those generated with other popular algorithms. Moreover, the list of genes from data on the Yeast metabolic cycle which are highly-ranked only by SW1PerS, contains evidently non-cosine patterns (e.g. ECM33, CDC9, SAM1,2 and MSH6) with highly periodic expression profiles. In data from the Yeast cell cycle SW1PerS identifies genes not preferred by other algorithms, hence not previously reported as periodic, but found in other experiments such as the universal growth rate response of Slavov. These genes are BOP3, CDC10, YIL108W, YER034W, MLP1, PAC2 and RTT101. In biological systems with low noise, i.e. where periodic signals with interesting shapes are more likely to occur, SW1PerS can be used as a powerful tool in exploratory analyses. Indeed, by having an initial set of periodic genes with a rich variety of signal types, pattern/shape information can be included in the study of systems and the generation of hypotheses regarding the structure of gene regulatory networks.</description><identifier>ISSN: 1471-2105</identifier><identifier>EISSN: 1471-2105</identifier><identifier>DOI: 10.1186/s12859-015-0645-6</identifier><identifier>PMID: 26277424</identifier><language>eng</language><publisher>England: BioMed Central Ltd</publisher><subject>Algorithms ; Area Under Curve ; Cell Division ; Circadian Rhythm ; Gene Expression Profiling - methods ; Methodology ; Oligonucleotide Array Sequence Analysis ; ROC Curve ; Saccharomyces cerevisiae - genetics ; Saccharomyces cerevisiae - metabolism</subject><ispartof>BMC bioinformatics, 2015-08, Vol.16 (1), p.257, Article 257</ispartof><rights>COPYRIGHT 2015 BioMed Central Ltd.</rights><rights>Copyright BioMed Central 2015</rights><rights>Perea et al. 2015</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c594t-912f42e0fd7da57ace6384d2536a07a7c486df00ab8414c4f749a3c1af93e7203</citedby><cites>FETCH-LOGICAL-c594t-912f42e0fd7da57ace6384d2536a07a7c486df00ab8414c4f749a3c1af93e7203</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC4537550/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/1780122273?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,44590,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/26277424$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Perea, Jose A</creatorcontrib><creatorcontrib>Deckard, Anastasia</creatorcontrib><creatorcontrib>Haase, Steve B</creatorcontrib><creatorcontrib>Harer, John</creatorcontrib><title>SW1PerS: Sliding windows and 1-persistence scoring; discovering periodicity in gene expression time series data</title><title>BMC bioinformatics</title><addtitle>BMC Bioinformatics</addtitle><description>Identifying periodically expressed genes across different processes (e.g. the cell and metabolic cycles, circadian rhythms, etc) is a central problem in computational biology. Biological time series may contain (multiple) unknown signal shapes of systemic relevance, imperfections like noise, damping, and trending, or limited sampling density. While there exist methods for detecting periodicity, their design biases (e.g. toward a specific signal shape) can limit their applicability in one or more of these situations. We present in this paper a novel method, SW1PerS, for quantifying periodicity in time series in a shape-agnostic manner and with resistance to damping. The measurement is performed directly, without presupposing a particular pattern, by evaluating the circularity of a high-dimensional representation of the signal. SW1PerS is compared to other algorithms using synthetic data and performance is quantified under varying noise models, noise levels, sampling densities, and signal shapes. Results on biological data are also analyzed and compared. On the task of periodic/not-periodic classification, using synthetic data, SW1PerS outperforms all other algorithms in the low-noise regime. SW1PerS is shown to be the most shape-agnostic of the evaluated methods, and the only one to consistently classify damped signals as highly periodic. On biological data, and for several experiments, the lists of top 10% genes ranked with SW1PerS recover up to 67% of those generated with other popular algorithms. Moreover, the list of genes from data on the Yeast metabolic cycle which are highly-ranked only by SW1PerS, contains evidently non-cosine patterns (e.g. ECM33, CDC9, SAM1,2 and MSH6) with highly periodic expression profiles. In data from the Yeast cell cycle SW1PerS identifies genes not preferred by other algorithms, hence not previously reported as periodic, but found in other experiments such as the universal growth rate response of Slavov. These genes are BOP3, CDC10, YIL108W, YER034W, MLP1, PAC2 and RTT101. In biological systems with low noise, i.e. where periodic signals with interesting shapes are more likely to occur, SW1PerS can be used as a powerful tool in exploratory analyses. Indeed, by having an initial set of periodic genes with a rich variety of signal types, pattern/shape information can be included in the study of systems and the generation of hypotheses regarding the structure of gene regulatory networks.</description><subject>Algorithms</subject><subject>Area Under Curve</subject><subject>Cell Division</subject><subject>Circadian Rhythm</subject><subject>Gene Expression Profiling - methods</subject><subject>Methodology</subject><subject>Oligonucleotide Array Sequence Analysis</subject><subject>ROC Curve</subject><subject>Saccharomyces cerevisiae - genetics</subject><subject>Saccharomyces cerevisiae - metabolism</subject><issn>1471-2105</issn><issn>1471-2105</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNptkt1r1jAUxosobk7_AG8k4I1edMtJk6ZVEMbQORgovoqXIUtOa0abvEvaffz3prz7ekVykZPk9zzhHJ6ieA10H6CpDxKwRrQlBVHSmouyflLsApdQMqDi6aN6p3iR0jmlIBsqnhc7rGZScsZ3i7D6Dd8xrj6Q1eCs8z25ct6Gq0S0twTKNcbk0oTeIEkmxEx8JNbl8hKXA8mAC9YZN90Q50mPHgleryOm5IInkxuzMDOYiNWTflk86_SQ8NXtvlf8-vL559HX8vTb8cnR4WlpRMunsgXWcYa0s9JqIbXBumq4ZaKqNZVaGt7UtqNUnzUcuOGd5K2uDOiurVAyWu0Vnza-6_lsRGvQT1EPah3dqOONCtqp7Rfv_qg-XCouKinEYvDu1iCGixnTpMbcNQ6D9hjmpEBSQWkeP2T07T_oeZijz-2pZeDAGJPVA9XrAZXzXcj_msVUHQoOlZAtiEzt_4fKy-LoTPDYuXy_JXi_JcjMhNdTr-eU1MnqxzYLG9bEkFLE7n4eQNWSKLVJlMqJUkuiVJ01bx4P8l5xF6HqLwZmxY8</recordid><startdate>20150816</startdate><enddate>20150816</enddate><creator>Perea, Jose A</creator><creator>Deckard, Anastasia</creator><creator>Haase, Steve B</creator><creator>Harer, John</creator><general>BioMed Central Ltd</general><general>BioMed Central</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISR</scope><scope>3V.</scope><scope>7QO</scope><scope>7SC</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>L7M</scope><scope>LK8</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20150816</creationdate><title>SW1PerS: Sliding windows and 1-persistence scoring; discovering periodicity in gene expression time series data</title><author>Perea, Jose A ; Deckard, Anastasia ; Haase, Steve B ; Harer, John</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c594t-912f42e0fd7da57ace6384d2536a07a7c486df00ab8414c4f749a3c1af93e7203</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Algorithms</topic><topic>Area Under Curve</topic><topic>Cell Division</topic><topic>Circadian Rhythm</topic><topic>Gene Expression Profiling - methods</topic><topic>Methodology</topic><topic>Oligonucleotide Array Sequence Analysis</topic><topic>ROC Curve</topic><topic>Saccharomyces cerevisiae - genetics</topic><topic>Saccharomyces cerevisiae - metabolism</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Perea, Jose A</creatorcontrib><creatorcontrib>Deckard, Anastasia</creatorcontrib><creatorcontrib>Haase, Steve B</creatorcontrib><creatorcontrib>Harer, John</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>ProQuest Biological Science Collection</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>PML(ProQuest Medical Library)</collection><collection>Biological Science Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>BMC bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Perea, Jose A</au><au>Deckard, Anastasia</au><au>Haase, Steve B</au><au>Harer, John</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SW1PerS: Sliding windows and 1-persistence scoring; discovering periodicity in gene expression time series data</atitle><jtitle>BMC bioinformatics</jtitle><addtitle>BMC Bioinformatics</addtitle><date>2015-08-16</date><risdate>2015</risdate><volume>16</volume><issue>1</issue><spage>257</spage><pages>257-</pages><artnum>257</artnum><issn>1471-2105</issn><eissn>1471-2105</eissn><abstract>Identifying periodically expressed genes across different processes (e.g. the cell and metabolic cycles, circadian rhythms, etc) is a central problem in computational biology. Biological time series may contain (multiple) unknown signal shapes of systemic relevance, imperfections like noise, damping, and trending, or limited sampling density. While there exist methods for detecting periodicity, their design biases (e.g. toward a specific signal shape) can limit their applicability in one or more of these situations. We present in this paper a novel method, SW1PerS, for quantifying periodicity in time series in a shape-agnostic manner and with resistance to damping. The measurement is performed directly, without presupposing a particular pattern, by evaluating the circularity of a high-dimensional representation of the signal. SW1PerS is compared to other algorithms using synthetic data and performance is quantified under varying noise models, noise levels, sampling densities, and signal shapes. Results on biological data are also analyzed and compared. On the task of periodic/not-periodic classification, using synthetic data, SW1PerS outperforms all other algorithms in the low-noise regime. SW1PerS is shown to be the most shape-agnostic of the evaluated methods, and the only one to consistently classify damped signals as highly periodic. On biological data, and for several experiments, the lists of top 10% genes ranked with SW1PerS recover up to 67% of those generated with other popular algorithms. Moreover, the list of genes from data on the Yeast metabolic cycle which are highly-ranked only by SW1PerS, contains evidently non-cosine patterns (e.g. ECM33, CDC9, SAM1,2 and MSH6) with highly periodic expression profiles. In data from the Yeast cell cycle SW1PerS identifies genes not preferred by other algorithms, hence not previously reported as periodic, but found in other experiments such as the universal growth rate response of Slavov. These genes are BOP3, CDC10, YIL108W, YER034W, MLP1, PAC2 and RTT101. In biological systems with low noise, i.e. where periodic signals with interesting shapes are more likely to occur, SW1PerS can be used as a powerful tool in exploratory analyses. Indeed, by having an initial set of periodic genes with a rich variety of signal types, pattern/shape information can be included in the study of systems and the generation of hypotheses regarding the structure of gene regulatory networks.</abstract><cop>England</cop><pub>BioMed Central Ltd</pub><pmid>26277424</pmid><doi>10.1186/s12859-015-0645-6</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1471-2105
ispartof BMC bioinformatics, 2015-08, Vol.16 (1), p.257, Article 257
issn 1471-2105
1471-2105
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4537550
source Publicly Available Content Database; PubMed
subjects Algorithms
Area Under Curve
Cell Division
Circadian Rhythm
Gene Expression Profiling - methods
Methodology
Oligonucleotide Array Sequence Analysis
ROC Curve
Saccharomyces cerevisiae - genetics
Saccharomyces cerevisiae - metabolism
title SW1PerS: Sliding windows and 1-persistence scoring; discovering periodicity in gene expression time series data
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T16%3A29%3A21IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SW1PerS:%20Sliding%20windows%20and%201-persistence%20scoring;%20discovering%20periodicity%20in%20gene%20expression%20time%20series%20data&rft.jtitle=BMC%20bioinformatics&rft.au=Perea,%20Jose%20A&rft.date=2015-08-16&rft.volume=16&rft.issue=1&rft.spage=257&rft.pages=257-&rft.artnum=257&rft.issn=1471-2105&rft.eissn=1471-2105&rft_id=info:doi/10.1186/s12859-015-0645-6&rft_dat=%3Cgale_pubme%3EA541357915%3C/gale_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c594t-912f42e0fd7da57ace6384d2536a07a7c486df00ab8414c4f749a3c1af93e7203%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1780122273&rft_id=info:pmid/26277424&rft_galeid=A541357915&rfr_iscdi=true