Loading…

Vernal: a tool for mining fuzzy network motifs in RNA

Abstract Motivation RNA 3D motifs are recurrent substructures, modeled as networks of base pair interactions, which are crucial for understanding structure–function relationships. The task of automatically identifying such motifs is computationally hard, and remains a key challenge in the field of R...

Full description

Saved in:
Bibliographic Details
Published in:Bioinformatics 2022-01, Vol.38 (4), p.970-976
Main Authors: Oliver, Carlos, Mallet, Vincent, Philippopoulos, Pericles, Hamilton, William L, Waldispühl, Jérôme
Format: Article
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Motivation RNA 3D motifs are recurrent substructures, modeled as networks of base pair interactions, which are crucial for understanding structure–function relationships. The task of automatically identifying such motifs is computationally hard, and remains a key challenge in the field of RNA structural biology and network analysis. State-of-the-art methods solve special cases of the motif problem by constraining the structural variability in occurrences of a motif, and narrowing the substructure search space. Results Here, we relax these constraints by posing the motif finding problem as a graph representation learning and clustering task. This framing takes advantage of the continuous nature of graph representations to model the flexibility and variability of RNA motifs in an efficient manner. We propose a set of node similarity functions, clustering methods and motif construction algorithms to recover flexible RNA motifs. Our tool, Vernal can be easily customized by users to desired levels of motif flexibility, abundance and size. We show that Vernal is able to retrieve and expand known classes of motifs, as well as to propose novel motifs. Availability and implementation The source code, data and a webserver are available at vernal.cs.mcgill.ca. We also provide a flexible interface and a user-friendly webserver to browse and download our results. Supplementary information Supplementary data are available at Bioinformatics online.
ISSN:1367-4803
1460-2059
1367-4811
DOI:10.1093/bioinformatics/btab768