Loading…

circGPA: circRNA functional annotation based on probability-generating functions

Recent research has already shown that circular RNAs (circRNAs) are functional in gene expression regulation and potentially related to diseases. Due to their stability, circRNAs can also be used as biomarkers for diagnosis. However, the function of most circRNAs remains unknown, and it is expensive...

Full description

Saved in:
Bibliographic Details
Published in:BMC bioinformatics 2022-09, Vol.23 (1), p.1-392, Article 392
Main Authors: RyÅ¡avý, Petr, Kléma, JiÅí, Merkerová, Michaela Dostálová
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c504t-34674f3ee1db8d934a187e08f80b36839bc3fd0f7b6049faab280da1dbda07e83
cites cdi_FETCH-LOGICAL-c504t-34674f3ee1db8d934a187e08f80b36839bc3fd0f7b6049faab280da1dbda07e83
container_end_page 392
container_issue 1
container_start_page 1
container_title BMC bioinformatics
container_volume 23
creator Ryšavý, Petr
Kléma, JiÅí
Merkerová, Michaela Dostálová
description Recent research has already shown that circular RNAs (circRNAs) are functional in gene expression regulation and potentially related to diseases. Due to their stability, circRNAs can also be used as biomarkers for diagnosis. However, the function of most circRNAs remains unknown, and it is expensive and time-consuming to discover it through biological experiments. In this paper, we predict circRNA annotations from the knowledge of their interaction with miRNAs and subsequent miRNA-mRNA interactions. First, we construct an interaction network for a target circRNA and secondly spread the information from the network nodes with the known function to the root circRNA node. This idea itself is not new; our main contribution lies in proposing an efficient and exact deterministic procedure based on the principle of probability-generating functions to calculate the p-value of association test between a circRNA and an annotation term. We show that our publicly available algorithm is both more effective and efficient than the commonly used Monte-Carlo sampling approach that may suffer from difficult quantification of sampling convergence and subsequent sampling inefficiency. We experimentally demonstrate that the new approach is two orders of magnitude faster than the Monte-Carlo sampling, which makes summary annotation of large circRNA files feasible; this includes their reannotation after periodical interaction network updates, for example. We provide a summary annotation of a current circRNA database as one of our outputs. The proposed algorithm could be generalized towards other types of RNA in way that is straightforward.
doi_str_mv 10.1186/s12859-022-04957-8
format article
fullrecord <record><control><sourceid>gale_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_6f075229ca2e46ca9f2e757d3995531c</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A719965770</galeid><doaj_id>oai_doaj_org_article_6f075229ca2e46ca9f2e757d3995531c</doaj_id><sourcerecordid>A719965770</sourcerecordid><originalsourceid>FETCH-LOGICAL-c504t-34674f3ee1db8d934a187e08f80b36839bc3fd0f7b6049faab280da1dbda07e83</originalsourceid><addsrcrecordid>eNptkk1v1DAQhiMEEqXwBzhF4gKHFH_Esc0BaVVBWamCqsDZmvgjeJW1Fzup6L_H6VaFIOSDxzPPvNaM3qp6idEZxqJ7mzERTDaIkAa1kvFGPKpOcMtxQzBij_-Kn1bPct4hhLlA7KS60j7pi6vNu3oJrj9vajcHPfkYYKwhhDjB8qh7yNbUJTik2EPvRz_dNoMNNpV6GB668vPqiYMx2xf392n1_eOHb-efmssvF9vzzWWjGWqnhrYdbx21FpteGElbwIJbJJxAPe0Elb2mziDH-64M5AB6IpCBQhtA3Ap6Wm2PuibCTh2S30O6VRG8ukvENChIk9ejVZ1DnBEiNRDbdhqkI5YzbqiUjFGsi9b7o9Zh7vfWaBumBONKdF0J_oca4o2SDFMhWBF4fS-Q4s_Z5kntfdZ2HCHYOGdFOBayQ0Tggr76B93FOZVtLxRhEnEp5B9qgDKADy6Wf_UiqjYcS9kxzlGhzv5DlWPs3usYrPMlv2p4s2oozGR_TQPMOavt1-s1S46sTjHnZN3DPjBSi-nU0XSqmE7dmU4J-hu4k8iO</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2725907989</pqid></control><display><type>article</type><title>circGPA: circRNA functional annotation based on probability-generating functions</title><source>Publicly Available Content Database</source><source>PubMed Central</source><creator>RyÅ¡avý, Petr ; Kléma, JiÅí ; Merkerová, Michaela Dostálová</creator><creatorcontrib>RyÅ¡avý, Petr ; Kléma, JiÅí ; Merkerová, Michaela Dostálová</creatorcontrib><description>Recent research has already shown that circular RNAs (circRNAs) are functional in gene expression regulation and potentially related to diseases. Due to their stability, circRNAs can also be used as biomarkers for diagnosis. However, the function of most circRNAs remains unknown, and it is expensive and time-consuming to discover it through biological experiments. In this paper, we predict circRNA annotations from the knowledge of their interaction with miRNAs and subsequent miRNA-mRNA interactions. First, we construct an interaction network for a target circRNA and secondly spread the information from the network nodes with the known function to the root circRNA node. This idea itself is not new; our main contribution lies in proposing an efficient and exact deterministic procedure based on the principle of probability-generating functions to calculate the p-value of association test between a circRNA and an annotation term. We show that our publicly available algorithm is both more effective and efficient than the commonly used Monte-Carlo sampling approach that may suffer from difficult quantification of sampling convergence and subsequent sampling inefficiency. We experimentally demonstrate that the new approach is two orders of magnitude faster than the Monte-Carlo sampling, which makes summary annotation of large circRNA files feasible; this includes their reannotation after periodical interaction network updates, for example. We provide a summary annotation of a current circRNA database as one of our outputs. The proposed algorithm could be generalized towards other types of RNA in way that is straightforward.</description><identifier>ISSN: 1471-2105</identifier><identifier>EISSN: 1471-2105</identifier><identifier>DOI: 10.1186/s12859-022-04957-8</identifier><language>eng</language><publisher>London: BioMed Central Ltd</publisher><subject>Algorithms ; Analysis ; Annotation term ; Annotations ; Biomarkers ; Circular RNA ; Combinatorial probabilities ; Functionals ; Gene expression ; Gene regulation ; Geometric probabilities ; Health aspects ; Identification and classification ; Interaction network ; Interactomes ; MicroRNAs ; miRNA ; Monte Carlo simulation ; Neighborhoods ; Ontology ; Probabilities ; Random variables ; RNA ; Sampling</subject><ispartof>BMC bioinformatics, 2022-09, Vol.23 (1), p.1-392, Article 392</ispartof><rights>COPYRIGHT 2022 BioMed Central Ltd.</rights><rights>2022. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>The Author(s) 2022</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c504t-34674f3ee1db8d934a187e08f80b36839bc3fd0f7b6049faab280da1dbda07e83</citedby><cites>FETCH-LOGICAL-c504t-34674f3ee1db8d934a187e08f80b36839bc3fd0f7b6049faab280da1dbda07e83</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC9513885/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2725907989?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,44590,53791,53793</link.rule.ids></links><search><creatorcontrib>RyÅ¡avý, Petr</creatorcontrib><creatorcontrib>Kléma, JiÅí</creatorcontrib><creatorcontrib>Merkerová, Michaela Dostálová</creatorcontrib><title>circGPA: circRNA functional annotation based on probability-generating functions</title><title>BMC bioinformatics</title><description>Recent research has already shown that circular RNAs (circRNAs) are functional in gene expression regulation and potentially related to diseases. Due to their stability, circRNAs can also be used as biomarkers for diagnosis. However, the function of most circRNAs remains unknown, and it is expensive and time-consuming to discover it through biological experiments. In this paper, we predict circRNA annotations from the knowledge of their interaction with miRNAs and subsequent miRNA-mRNA interactions. First, we construct an interaction network for a target circRNA and secondly spread the information from the network nodes with the known function to the root circRNA node. This idea itself is not new; our main contribution lies in proposing an efficient and exact deterministic procedure based on the principle of probability-generating functions to calculate the p-value of association test between a circRNA and an annotation term. We show that our publicly available algorithm is both more effective and efficient than the commonly used Monte-Carlo sampling approach that may suffer from difficult quantification of sampling convergence and subsequent sampling inefficiency. We experimentally demonstrate that the new approach is two orders of magnitude faster than the Monte-Carlo sampling, which makes summary annotation of large circRNA files feasible; this includes their reannotation after periodical interaction network updates, for example. We provide a summary annotation of a current circRNA database as one of our outputs. The proposed algorithm could be generalized towards other types of RNA in way that is straightforward.</description><subject>Algorithms</subject><subject>Analysis</subject><subject>Annotation term</subject><subject>Annotations</subject><subject>Biomarkers</subject><subject>Circular RNA</subject><subject>Combinatorial probabilities</subject><subject>Functionals</subject><subject>Gene expression</subject><subject>Gene regulation</subject><subject>Geometric probabilities</subject><subject>Health aspects</subject><subject>Identification and classification</subject><subject>Interaction network</subject><subject>Interactomes</subject><subject>MicroRNAs</subject><subject>miRNA</subject><subject>Monte Carlo simulation</subject><subject>Neighborhoods</subject><subject>Ontology</subject><subject>Probabilities</subject><subject>Random variables</subject><subject>RNA</subject><subject>Sampling</subject><issn>1471-2105</issn><issn>1471-2105</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNptkk1v1DAQhiMEEqXwBzhF4gKHFH_Esc0BaVVBWamCqsDZmvgjeJW1Fzup6L_H6VaFIOSDxzPPvNaM3qp6idEZxqJ7mzERTDaIkAa1kvFGPKpOcMtxQzBij_-Kn1bPct4hhLlA7KS60j7pi6vNu3oJrj9vajcHPfkYYKwhhDjB8qh7yNbUJTik2EPvRz_dNoMNNpV6GB668vPqiYMx2xf392n1_eOHb-efmssvF9vzzWWjGWqnhrYdbx21FpteGElbwIJbJJxAPe0Elb2mziDH-64M5AB6IpCBQhtA3Ap6Wm2PuibCTh2S30O6VRG8ukvENChIk9ejVZ1DnBEiNRDbdhqkI5YzbqiUjFGsi9b7o9Zh7vfWaBumBONKdF0J_oca4o2SDFMhWBF4fS-Q4s_Z5kntfdZ2HCHYOGdFOBayQ0Tggr76B93FOZVtLxRhEnEp5B9qgDKADy6Wf_UiqjYcS9kxzlGhzv5DlWPs3usYrPMlv2p4s2oozGR_TQPMOavt1-s1S46sTjHnZN3DPjBSi-nU0XSqmE7dmU4J-hu4k8iO</recordid><startdate>20220927</startdate><enddate>20220927</enddate><creator>RyÅ¡avý, Petr</creator><creator>Kléma, JiÅí</creator><creator>Merkerová, Michaela Dostálová</creator><general>BioMed Central Ltd</general><general>BioMed Central</general><general>BMC</general><scope>AAYXX</scope><scope>CITATION</scope><scope>ISR</scope><scope>3V.</scope><scope>7QO</scope><scope>7SC</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>L7M</scope><scope>LK8</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20220927</creationdate><title>circGPA: circRNA functional annotation based on probability-generating functions</title><author>RyÅ¡avý, Petr ; Kléma, JiÅí ; Merkerová, Michaela Dostálová</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c504t-34674f3ee1db8d934a187e08f80b36839bc3fd0f7b6049faab280da1dbda07e83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Algorithms</topic><topic>Analysis</topic><topic>Annotation term</topic><topic>Annotations</topic><topic>Biomarkers</topic><topic>Circular RNA</topic><topic>Combinatorial probabilities</topic><topic>Functionals</topic><topic>Gene expression</topic><topic>Gene regulation</topic><topic>Geometric probabilities</topic><topic>Health aspects</topic><topic>Identification and classification</topic><topic>Interaction network</topic><topic>Interactomes</topic><topic>MicroRNAs</topic><topic>miRNA</topic><topic>Monte Carlo simulation</topic><topic>Neighborhoods</topic><topic>Ontology</topic><topic>Probabilities</topic><topic>Random variables</topic><topic>RNA</topic><topic>Sampling</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>RyÅ¡avý, Petr</creatorcontrib><creatorcontrib>Kléma, JiÅí</creatorcontrib><creatorcontrib>Merkerová, Michaela Dostálová</creatorcontrib><collection>CrossRef</collection><collection>Science (Gale in Context)</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Proquest Health and Medical Complete</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Database‎ (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Biological Sciences</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>BMC bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>RyÅ¡avý, Petr</au><au>Kléma, JiÅí</au><au>Merkerová, Michaela Dostálová</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>circGPA: circRNA functional annotation based on probability-generating functions</atitle><jtitle>BMC bioinformatics</jtitle><date>2022-09-27</date><risdate>2022</risdate><volume>23</volume><issue>1</issue><spage>1</spage><epage>392</epage><pages>1-392</pages><artnum>392</artnum><issn>1471-2105</issn><eissn>1471-2105</eissn><abstract>Recent research has already shown that circular RNAs (circRNAs) are functional in gene expression regulation and potentially related to diseases. Due to their stability, circRNAs can also be used as biomarkers for diagnosis. However, the function of most circRNAs remains unknown, and it is expensive and time-consuming to discover it through biological experiments. In this paper, we predict circRNA annotations from the knowledge of their interaction with miRNAs and subsequent miRNA-mRNA interactions. First, we construct an interaction network for a target circRNA and secondly spread the information from the network nodes with the known function to the root circRNA node. This idea itself is not new; our main contribution lies in proposing an efficient and exact deterministic procedure based on the principle of probability-generating functions to calculate the p-value of association test between a circRNA and an annotation term. We show that our publicly available algorithm is both more effective and efficient than the commonly used Monte-Carlo sampling approach that may suffer from difficult quantification of sampling convergence and subsequent sampling inefficiency. We experimentally demonstrate that the new approach is two orders of magnitude faster than the Monte-Carlo sampling, which makes summary annotation of large circRNA files feasible; this includes their reannotation after periodical interaction network updates, for example. We provide a summary annotation of a current circRNA database as one of our outputs. The proposed algorithm could be generalized towards other types of RNA in way that is straightforward.</abstract><cop>London</cop><pub>BioMed Central Ltd</pub><doi>10.1186/s12859-022-04957-8</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1471-2105
ispartof BMC bioinformatics, 2022-09, Vol.23 (1), p.1-392, Article 392
issn 1471-2105
1471-2105
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_6f075229ca2e46ca9f2e757d3995531c
source Publicly Available Content Database; PubMed Central
subjects Algorithms
Analysis
Annotation term
Annotations
Biomarkers
Circular RNA
Combinatorial probabilities
Functionals
Gene expression
Gene regulation
Geometric probabilities
Health aspects
Identification and classification
Interaction network
Interactomes
MicroRNAs
miRNA
Monte Carlo simulation
Neighborhoods
Ontology
Probabilities
Random variables
RNA
Sampling
title circGPA: circRNA functional annotation based on probability-generating functions
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T19%3A00%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=circGPA:%20circRNA%20functional%20annotation%20based%20on%20probability-generating%20functions&rft.jtitle=BMC%20bioinformatics&rft.au=Ry%C3%85%C2%A1av%C3%83%C2%BD,%20Petr&rft.date=2022-09-27&rft.volume=23&rft.issue=1&rft.spage=1&rft.epage=392&rft.pages=1-392&rft.artnum=392&rft.issn=1471-2105&rft.eissn=1471-2105&rft_id=info:doi/10.1186/s12859-022-04957-8&rft_dat=%3Cgale_doaj_%3EA719965770%3C/gale_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c504t-34674f3ee1db8d934a187e08f80b36839bc3fd0f7b6049faab280da1dbda07e83%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2725907989&rft_id=info:pmid/&rft_galeid=A719965770&rfr_iscdi=true