Loading…

Segmentation of DNA using simple recurrent neural network

We report the discovery of strong correlations between protein coding regions and the prediction errors when using the simple recurrent network to segment genome sequences. We are going to use SARS genome to demonstrate how we conduct training and derive corresponding results. The distribution of pr...

Full description

Saved in:
Bibliographic Details
Published in:Knowledge-based systems 2012-02, Vol.26, p.271-280
Main Authors: Cheng, Wei-Chen, Huang, Jau-Chi, Liou, Cheng-Yuan
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We report the discovery of strong correlations between protein coding regions and the prediction errors when using the simple recurrent network to segment genome sequences. We are going to use SARS genome to demonstrate how we conduct training and derive corresponding results. The distribution of prediction error indicates how the underlying hidden regularity of the genome sequences and the results are consistent with the finding of biologists: predicated protein coding features of SARS genome. This implies that the simple recurrent network is capable of providing new features for further biological studies when applied on genome studies. The HA gene of influenza A subtype H1N1 is also analyzed in a similar way.
ISSN:0950-7051
1872-7409
DOI:10.1016/j.knosys.2011.09.001