Loading…
miPIE: NGS-based Prediction of miRNA Using Integrated Evidence
Methods for the de novo identification of microRNA (miRNA) have been developed using a range of sequence-based features. With the increasing availability of next generation sequencing (NGS) transcriptome data, there is a need for miRNA identification that integrates both NGS transcript expression-ba...
Saved in:
Published in: | Scientific reports 2019-02, Vol.9 (1), p.1548-1548, Article 1548 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c474t-8e9e4ac4371ec2a8908409dcc1f9838518b6767f33fd03b4e55b76aa64b018143 |
---|---|
cites | cdi_FETCH-LOGICAL-c474t-8e9e4ac4371ec2a8908409dcc1f9838518b6767f33fd03b4e55b76aa64b018143 |
container_end_page | 1548 |
container_issue | 1 |
container_start_page | 1548 |
container_title | Scientific reports |
container_volume | 9 |
creator | Peace, R. J. Sheikh Hassani, M. Green, J. R. |
description | Methods for the de novo identification of microRNA (miRNA) have been developed using a range of sequence-based features. With the increasing availability of next generation sequencing (NGS) transcriptome data, there is a need for miRNA identification that integrates both NGS transcript expression-based patterns as well as advanced genomic sequence-based methods. While miRDeep2 does examine the predicted secondary structure of putative miRNA sequences, it does not leverage many of the sequence-based features used in state-of-the-art de novo methods. Meanwhile, other NGS-based methods, such as miRanalyzer, place an emphasis on sequence-based features without leveraging advanced expression-based features reflecting miRNA biosynthesis. This represents an opportunity to combine the strengths of NGS-based analysis with recent advances in de novo sequence-based miRNA prediction. We here develop a method, microRNA Prediction using Integrated Evidence (miPIE), which integrates both expression-based and sequence-based features to achieve significantly improved miRNA prediction performance. Feature selection identifies the 20 most discriminative features, 3 of which reflect strictly expression-based information. Evaluation using precision-recall curves, for six NGS data sets representing six diverse species, demonstrates substantial improvements in prediction performance compared to three methods: miRDeep2, miRanalyzer, and mirnovo. The individual contributions of expression-based and sequence-based features are also examined and we demonstrate that their combination is more effective than either alone. |
doi_str_mv | 10.1038/s41598-018-38107-z |
format | article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6367335</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2176710321</sourcerecordid><originalsourceid>FETCH-LOGICAL-c474t-8e9e4ac4371ec2a8908409dcc1f9838518b6767f33fd03b4e55b76aa64b018143</originalsourceid><addsrcrecordid>eNp9kUtLxDAUhYMojoz-ARdScOOmmlfbxIUgMo4DMoqPdUjT2zEyTTXpCM6vN9rxuTCbBO53z70nB6Fdgg8JZuIocJJJkWIiUiYILtLlGtqimGcpZZSu_3gP0E4IjziejEpO5CYaMFwwxvNiC5009noyOk6m49u01AGq5NpDZU1nW5e0ddLYm-lpch-smyUT18HM6y5CoxdbgTOwjTZqPQ-ws7qH6P58dHd2kV5ejSdnp5ep4QXvUgESuDacFQQM1UJiwbGsjCG1FExkRJR5kRc1Y3WFWckhy8oi1zrnZTRIOBuik173aVE2UBlwnddz9eRto_2rarVVvyvOPqhZ-6JylkerWRQ4WAn49nkBoVONDQbmc-2gXQRFKZVYsh7d_4M-tgvvoj1FSdwyfj8lkaI9ZXwbgof6axmC1XtCqk9IRQPqIyG1jE17P218tXzmEQHWAyGW3Az89-x_ZN8AsKmaHA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2176710321</pqid></control><display><type>article</type><title>miPIE: NGS-based Prediction of miRNA Using Integrated Evidence</title><source>PMC (PubMed Central)</source><source>Publicly Available Content (ProQuest)</source><source>Free Full-Text Journals in Chemistry</source><source>Springer Nature - nature.com Journals - Fully Open Access</source><creator>Peace, R. J. ; Sheikh Hassani, M. ; Green, J. R.</creator><creatorcontrib>Peace, R. J. ; Sheikh Hassani, M. ; Green, J. R.</creatorcontrib><description>Methods for the de novo identification of microRNA (miRNA) have been developed using a range of sequence-based features. With the increasing availability of next generation sequencing (NGS) transcriptome data, there is a need for miRNA identification that integrates both NGS transcript expression-based patterns as well as advanced genomic sequence-based methods. While miRDeep2 does examine the predicted secondary structure of putative miRNA sequences, it does not leverage many of the sequence-based features used in state-of-the-art de novo methods. Meanwhile, other NGS-based methods, such as miRanalyzer, place an emphasis on sequence-based features without leveraging advanced expression-based features reflecting miRNA biosynthesis. This represents an opportunity to combine the strengths of NGS-based analysis with recent advances in de novo sequence-based miRNA prediction. We here develop a method, microRNA Prediction using Integrated Evidence (miPIE), which integrates both expression-based and sequence-based features to achieve significantly improved miRNA prediction performance. Feature selection identifies the 20 most discriminative features, 3 of which reflect strictly expression-based information. Evaluation using precision-recall curves, for six NGS data sets representing six diverse species, demonstrates substantial improvements in prediction performance compared to three methods: miRDeep2, miRanalyzer, and mirnovo. The individual contributions of expression-based and sequence-based features are also examined and we demonstrate that their combination is more effective than either alone.</description><identifier>ISSN: 2045-2322</identifier><identifier>EISSN: 2045-2322</identifier><identifier>DOI: 10.1038/s41598-018-38107-z</identifier><identifier>PMID: 30733467</identifier><language>eng</language><publisher>London: Nature Publishing Group UK</publisher><subject>45 ; 45/91 ; 631/114/1305 ; 631/114/794 ; Biosynthesis ; Gene expression ; Humanities and Social Sciences ; MicroRNAs ; miRNA ; multidisciplinary ; Next-generation sequencing ; Protein structure ; Science ; Science (multidisciplinary) ; Secondary structure ; Species diversity ; Transcription</subject><ispartof>Scientific reports, 2019-02, Vol.9 (1), p.1548-1548, Article 1548</ispartof><rights>The Author(s) 2019</rights><rights>This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c474t-8e9e4ac4371ec2a8908409dcc1f9838518b6767f33fd03b4e55b76aa64b018143</citedby><cites>FETCH-LOGICAL-c474t-8e9e4ac4371ec2a8908409dcc1f9838518b6767f33fd03b4e55b76aa64b018143</cites><orcidid>0000-0002-6039-2355</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2176710321/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2176710321?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,25731,27901,27902,36989,36990,44566,53766,53768,74869</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30733467$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Peace, R. J.</creatorcontrib><creatorcontrib>Sheikh Hassani, M.</creatorcontrib><creatorcontrib>Green, J. R.</creatorcontrib><title>miPIE: NGS-based Prediction of miRNA Using Integrated Evidence</title><title>Scientific reports</title><addtitle>Sci Rep</addtitle><addtitle>Sci Rep</addtitle><description>Methods for the de novo identification of microRNA (miRNA) have been developed using a range of sequence-based features. With the increasing availability of next generation sequencing (NGS) transcriptome data, there is a need for miRNA identification that integrates both NGS transcript expression-based patterns as well as advanced genomic sequence-based methods. While miRDeep2 does examine the predicted secondary structure of putative miRNA sequences, it does not leverage many of the sequence-based features used in state-of-the-art de novo methods. Meanwhile, other NGS-based methods, such as miRanalyzer, place an emphasis on sequence-based features without leveraging advanced expression-based features reflecting miRNA biosynthesis. This represents an opportunity to combine the strengths of NGS-based analysis with recent advances in de novo sequence-based miRNA prediction. We here develop a method, microRNA Prediction using Integrated Evidence (miPIE), which integrates both expression-based and sequence-based features to achieve significantly improved miRNA prediction performance. Feature selection identifies the 20 most discriminative features, 3 of which reflect strictly expression-based information. Evaluation using precision-recall curves, for six NGS data sets representing six diverse species, demonstrates substantial improvements in prediction performance compared to three methods: miRDeep2, miRanalyzer, and mirnovo. The individual contributions of expression-based and sequence-based features are also examined and we demonstrate that their combination is more effective than either alone.</description><subject>45</subject><subject>45/91</subject><subject>631/114/1305</subject><subject>631/114/794</subject><subject>Biosynthesis</subject><subject>Gene expression</subject><subject>Humanities and Social Sciences</subject><subject>MicroRNAs</subject><subject>miRNA</subject><subject>multidisciplinary</subject><subject>Next-generation sequencing</subject><subject>Protein structure</subject><subject>Science</subject><subject>Science (multidisciplinary)</subject><subject>Secondary structure</subject><subject>Species diversity</subject><subject>Transcription</subject><issn>2045-2322</issn><issn>2045-2322</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNp9kUtLxDAUhYMojoz-ARdScOOmmlfbxIUgMo4DMoqPdUjT2zEyTTXpCM6vN9rxuTCbBO53z70nB6Fdgg8JZuIocJJJkWIiUiYILtLlGtqimGcpZZSu_3gP0E4IjziejEpO5CYaMFwwxvNiC5009noyOk6m49u01AGq5NpDZU1nW5e0ddLYm-lpch-smyUT18HM6y5CoxdbgTOwjTZqPQ-ws7qH6P58dHd2kV5ejSdnp5ep4QXvUgESuDacFQQM1UJiwbGsjCG1FExkRJR5kRc1Y3WFWckhy8oi1zrnZTRIOBuik173aVE2UBlwnddz9eRto_2rarVVvyvOPqhZ-6JylkerWRQ4WAn49nkBoVONDQbmc-2gXQRFKZVYsh7d_4M-tgvvoj1FSdwyfj8lkaI9ZXwbgof6axmC1XtCqk9IRQPqIyG1jE17P218tXzmEQHWAyGW3Az89-x_ZN8AsKmaHA</recordid><startdate>20190207</startdate><enddate>20190207</enddate><creator>Peace, R. J.</creator><creator>Sheikh Hassani, M.</creator><creator>Green, J. R.</creator><general>Nature Publishing Group UK</general><general>Nature Publishing Group</general><scope>C6C</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-6039-2355</orcidid></search><sort><creationdate>20190207</creationdate><title>miPIE: NGS-based Prediction of miRNA Using Integrated Evidence</title><author>Peace, R. J. ; Sheikh Hassani, M. ; Green, J. R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c474t-8e9e4ac4371ec2a8908409dcc1f9838518b6767f33fd03b4e55b76aa64b018143</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>45</topic><topic>45/91</topic><topic>631/114/1305</topic><topic>631/114/794</topic><topic>Biosynthesis</topic><topic>Gene expression</topic><topic>Humanities and Social Sciences</topic><topic>MicroRNAs</topic><topic>miRNA</topic><topic>multidisciplinary</topic><topic>Next-generation sequencing</topic><topic>Protein structure</topic><topic>Science</topic><topic>Science (multidisciplinary)</topic><topic>Secondary structure</topic><topic>Species diversity</topic><topic>Transcription</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Peace, R. J.</creatorcontrib><creatorcontrib>Sheikh Hassani, M.</creatorcontrib><creatorcontrib>Green, J. R.</creatorcontrib><collection>Springer Open Access</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health & Medical Complete (ProQuest)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Biological Sciences</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>PML(ProQuest Medical Library)</collection><collection>Science Database (ProQuest)</collection><collection>Biological Science Database</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Scientific reports</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Peace, R. J.</au><au>Sheikh Hassani, M.</au><au>Green, J. R.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>miPIE: NGS-based Prediction of miRNA Using Integrated Evidence</atitle><jtitle>Scientific reports</jtitle><stitle>Sci Rep</stitle><addtitle>Sci Rep</addtitle><date>2019-02-07</date><risdate>2019</risdate><volume>9</volume><issue>1</issue><spage>1548</spage><epage>1548</epage><pages>1548-1548</pages><artnum>1548</artnum><issn>2045-2322</issn><eissn>2045-2322</eissn><abstract>Methods for the de novo identification of microRNA (miRNA) have been developed using a range of sequence-based features. With the increasing availability of next generation sequencing (NGS) transcriptome data, there is a need for miRNA identification that integrates both NGS transcript expression-based patterns as well as advanced genomic sequence-based methods. While miRDeep2 does examine the predicted secondary structure of putative miRNA sequences, it does not leverage many of the sequence-based features used in state-of-the-art de novo methods. Meanwhile, other NGS-based methods, such as miRanalyzer, place an emphasis on sequence-based features without leveraging advanced expression-based features reflecting miRNA biosynthesis. This represents an opportunity to combine the strengths of NGS-based analysis with recent advances in de novo sequence-based miRNA prediction. We here develop a method, microRNA Prediction using Integrated Evidence (miPIE), which integrates both expression-based and sequence-based features to achieve significantly improved miRNA prediction performance. Feature selection identifies the 20 most discriminative features, 3 of which reflect strictly expression-based information. Evaluation using precision-recall curves, for six NGS data sets representing six diverse species, demonstrates substantial improvements in prediction performance compared to three methods: miRDeep2, miRanalyzer, and mirnovo. The individual contributions of expression-based and sequence-based features are also examined and we demonstrate that their combination is more effective than either alone.</abstract><cop>London</cop><pub>Nature Publishing Group UK</pub><pmid>30733467</pmid><doi>10.1038/s41598-018-38107-z</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-6039-2355</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2045-2322 |
ispartof | Scientific reports, 2019-02, Vol.9 (1), p.1548-1548, Article 1548 |
issn | 2045-2322 2045-2322 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6367335 |
source | PMC (PubMed Central); Publicly Available Content (ProQuest); Free Full-Text Journals in Chemistry; Springer Nature - nature.com Journals - Fully Open Access |
subjects | 45 45/91 631/114/1305 631/114/794 Biosynthesis Gene expression Humanities and Social Sciences MicroRNAs miRNA multidisciplinary Next-generation sequencing Protein structure Science Science (multidisciplinary) Secondary structure Species diversity Transcription |
title | miPIE: NGS-based Prediction of miRNA Using Integrated Evidence |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T08%3A04%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=miPIE:%20NGS-based%20Prediction%20of%20miRNA%20Using%20Integrated%20Evidence&rft.jtitle=Scientific%20reports&rft.au=Peace,%20R.%20J.&rft.date=2019-02-07&rft.volume=9&rft.issue=1&rft.spage=1548&rft.epage=1548&rft.pages=1548-1548&rft.artnum=1548&rft.issn=2045-2322&rft.eissn=2045-2322&rft_id=info:doi/10.1038/s41598-018-38107-z&rft_dat=%3Cproquest_pubme%3E2176710321%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c474t-8e9e4ac4371ec2a8908409dcc1f9838518b6767f33fd03b4e55b76aa64b018143%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2176710321&rft_id=info:pmid/30733467&rfr_iscdi=true |