Loading…

Prediction of eukaryotic gene structures based on multilevel optimization

Computational gene structure prediction, which is valuable for finding new genes and understanding the composition of genomes, plays a very important role in various kinds of genome projects. For eukaryotic gene structures, however, the prediction accuracy of existing methods is still limited. This...

Full description

Saved in:
Bibliographic Details
Published in:Chinese science bulletin 2004-02, Vol.49 (4), p.321-328
Main Authors: Zhou, Yanhong, Yang, Lei, Wang, Hui, Lu, Feng, Wan, Honghui
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3
cites cdi_FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3
container_end_page 328
container_issue 4
container_start_page 321
container_title Chinese science bulletin
container_volume 49
creator Zhou, Yanhong
Yang, Lei
Wang, Hui
Lu, Feng
Wan, Honghui
description Computational gene structure prediction, which is valuable for finding new genes and understanding the composition of genomes, plays a very important role in various kinds of genome projects. For eukaryotic gene structures, however, the prediction accuracy of existing methods is still limited. This paper presents a method of predicting eukaryotic gene structures based on multilevel optimization. The complicated problem of predicting gene structure in eukaryotic DNA sequence containing multiple genes can be decomposed into a series of sub-problems at several levels with decreasing complexity, including the gene level (single-exon gene, multi-exon gene), the element level (exon, intron, etc.), and the feature level (functional site signals, codon usage preference, etc.). On the basis of this decomposition, a multilevel model for the prediction of complex gene structures is created by a multilevel optimization process, in which the models dealing with sub-problems at low complexity level are first optimized respectively, and then optimally combined together to form models for those sub-problems at higher complexity level. Based on the multilevel model, a dynamic programming algorithm is designed to search for optimal gene structures from DNA sequences, and a new program GeneKey (1.0) for the prediction of eukaryotic gene structures is developed. Testing results with widely used datasets demonstrate that the prediction accuracies of GeneKey (1.0) at the nucleotide level, exon level and gene level are all higher than that of the well known program GENSCAN. A web server of GeneKey(1.0) is available at http://infosci.hust.edu.cn
doi_str_mv 10.1007/BF02900313
format article
fullrecord <record><control><sourceid>wanfang_jour_proqu</sourceid><recordid>TN_cdi_wanfang_journals_kxtb_e200404004</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><cqvip_id>9318971</cqvip_id><wanfj_id>kxtb_e200404004</wanfj_id><sourcerecordid>kxtb_e200404004</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3</originalsourceid><addsrcrecordid>eNpFkEtLAzEQgBdRsFYv_oIVD4KwmkmyeRy1WC0U9KDnsNlNavrYbZOsr19vSgtlDjMD3zz4suwS0B0gxO8fxwhLhAiQo2wAgkEhSwrHqUYIClYScZqdhTBPHQGOB9nkzZvG1dF1bd7Z3PSLyv920dX5zLQmD9H3dey9CbmugmnyhK36ZXRL82WWebeObuX-qu34eXZiq2UwF_s8zD7GT--jl2L6-jwZPUyLGhimRVmChjJ91nAhiWTYMiqYYVxaCtQgSjUVpBEN41g3FJcgiebGasRKTUCTYXaz2_tdtbZqZ2re9b5NF9XiJ2plMEI0BaKJvN6Ra99tehPiAcVcMEkBS5mo2x1V-y4Eb6xae7dKFhQgtZWqDlITfLWHP7t2tnHpvK7qhU0-lCQgJAfyDw18cl4</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2786941299</pqid></control><display><type>article</type><title>Prediction of eukaryotic gene structures based on multilevel optimization</title><source>SpringerLink Journals</source><creator>Zhou, Yanhong ; Yang, Lei ; Wang, Hui ; Lu, Feng ; Wan, Honghui</creator><creatorcontrib>Zhou, Yanhong ; Yang, Lei ; Wang, Hui ; Lu, Feng ; Wan, Honghui</creatorcontrib><description>Computational gene structure prediction, which is valuable for finding new genes and understanding the composition of genomes, plays a very important role in various kinds of genome projects. For eukaryotic gene structures, however, the prediction accuracy of existing methods is still limited. This paper presents a method of predicting eukaryotic gene structures based on multilevel optimization. The complicated problem of predicting gene structure in eukaryotic DNA sequence containing multiple genes can be decomposed into a series of sub-problems at several levels with decreasing complexity, including the gene level (single-exon gene, multi-exon gene), the element level (exon, intron, etc.), and the feature level (functional site signals, codon usage preference, etc.). On the basis of this decomposition, a multilevel model for the prediction of complex gene structures is created by a multilevel optimization process, in which the models dealing with sub-problems at low complexity level are first optimized respectively, and then optimally combined together to form models for those sub-problems at higher complexity level. Based on the multilevel model, a dynamic programming algorithm is designed to search for optimal gene structures from DNA sequences, and a new program GeneKey (1.0) for the prediction of eukaryotic gene structures is developed. Testing results with widely used datasets demonstrate that the prediction accuracies of GeneKey (1.0) at the nucleotide level, exon level and gene level are all higher than that of the well known program GENSCAN. A web server of GeneKey(1.0) is available at http://infosci.hust.edu.cn</description><identifier>ISSN: 1001-6538</identifier><identifier>ISSN: 2095-9273</identifier><identifier>EISSN: 1861-9541</identifier><identifier>EISSN: 2095-9281</identifier><identifier>DOI: 10.1007/BF02900313</identifier><language>eng</language><publisher>Beijing: Springer Nature B.V</publisher><subject>Accuracy ; Algorithms ; Complexity ; Computer applications ; Decomposition ; Deoxyribonucleic acid ; DNA ; DNA structure ; Dynamic programming ; Gene sequencing ; Genes ; Genomes ; Multilevel ; Nucleotide sequence ; Nucleotides ; Optimization ; Predictions ; 基因工程 ; 真核基因 ; 结构预测 ; 脱氧核糖核酸 ; 蛋白质编码区</subject><ispartof>Chinese science bulletin, 2004-02, Vol.49 (4), p.321-328</ispartof><rights>Science in China Press 2004.</rights><rights>Copyright © Wanfang Data Co. Ltd. All Rights Reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3</citedby><cites>FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://image.cqvip.com/vip1000/qk/86894X/86894X.jpg</thumbnail><link.rule.ids>314,776,780,1638,27903,27904</link.rule.ids></links><search><creatorcontrib>Zhou, Yanhong</creatorcontrib><creatorcontrib>Yang, Lei</creatorcontrib><creatorcontrib>Wang, Hui</creatorcontrib><creatorcontrib>Lu, Feng</creatorcontrib><creatorcontrib>Wan, Honghui</creatorcontrib><title>Prediction of eukaryotic gene structures based on multilevel optimization</title><title>Chinese science bulletin</title><addtitle>Chinese Science Bulletin</addtitle><description>Computational gene structure prediction, which is valuable for finding new genes and understanding the composition of genomes, plays a very important role in various kinds of genome projects. For eukaryotic gene structures, however, the prediction accuracy of existing methods is still limited. This paper presents a method of predicting eukaryotic gene structures based on multilevel optimization. The complicated problem of predicting gene structure in eukaryotic DNA sequence containing multiple genes can be decomposed into a series of sub-problems at several levels with decreasing complexity, including the gene level (single-exon gene, multi-exon gene), the element level (exon, intron, etc.), and the feature level (functional site signals, codon usage preference, etc.). On the basis of this decomposition, a multilevel model for the prediction of complex gene structures is created by a multilevel optimization process, in which the models dealing with sub-problems at low complexity level are first optimized respectively, and then optimally combined together to form models for those sub-problems at higher complexity level. Based on the multilevel model, a dynamic programming algorithm is designed to search for optimal gene structures from DNA sequences, and a new program GeneKey (1.0) for the prediction of eukaryotic gene structures is developed. Testing results with widely used datasets demonstrate that the prediction accuracies of GeneKey (1.0) at the nucleotide level, exon level and gene level are all higher than that of the well known program GENSCAN. A web server of GeneKey(1.0) is available at http://infosci.hust.edu.cn</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Complexity</subject><subject>Computer applications</subject><subject>Decomposition</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>DNA structure</subject><subject>Dynamic programming</subject><subject>Gene sequencing</subject><subject>Genes</subject><subject>Genomes</subject><subject>Multilevel</subject><subject>Nucleotide sequence</subject><subject>Nucleotides</subject><subject>Optimization</subject><subject>Predictions</subject><subject>基因工程</subject><subject>真核基因</subject><subject>结构预测</subject><subject>脱氧核糖核酸</subject><subject>蛋白质编码区</subject><issn>1001-6538</issn><issn>2095-9273</issn><issn>1861-9541</issn><issn>2095-9281</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><recordid>eNpFkEtLAzEQgBdRsFYv_oIVD4KwmkmyeRy1WC0U9KDnsNlNavrYbZOsr19vSgtlDjMD3zz4suwS0B0gxO8fxwhLhAiQo2wAgkEhSwrHqUYIClYScZqdhTBPHQGOB9nkzZvG1dF1bd7Z3PSLyv920dX5zLQmD9H3dey9CbmugmnyhK36ZXRL82WWebeObuX-qu34eXZiq2UwF_s8zD7GT--jl2L6-jwZPUyLGhimRVmChjJ91nAhiWTYMiqYYVxaCtQgSjUVpBEN41g3FJcgiebGasRKTUCTYXaz2_tdtbZqZ2re9b5NF9XiJ2plMEI0BaKJvN6Ra99tehPiAcVcMEkBS5mo2x1V-y4Eb6xae7dKFhQgtZWqDlITfLWHP7t2tnHpvK7qhU0-lCQgJAfyDw18cl4</recordid><startdate>200402</startdate><enddate>200402</enddate><creator>Zhou, Yanhong</creator><creator>Yang, Lei</creator><creator>Wang, Hui</creator><creator>Lu, Feng</creator><creator>Wan, Honghui</creator><general>Springer Nature B.V</general><general>School of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China%School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China%School of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China%Laboratory of Bioinformatics, Maryland Institute of Dynamic Genomics, Silver Spring, MD 20906, USA</general><general>School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China</general><scope>2RA</scope><scope>92L</scope><scope>CQIGP</scope><scope>W94</scope><scope>WU4</scope><scope>~WA</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>2B.</scope><scope>4A8</scope><scope>92I</scope><scope>93N</scope><scope>PSX</scope><scope>TCJ</scope></search><sort><creationdate>200402</creationdate><title>Prediction of eukaryotic gene structures based on multilevel optimization</title><author>Zhou, Yanhong ; Yang, Lei ; Wang, Hui ; Lu, Feng ; Wan, Honghui</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Complexity</topic><topic>Computer applications</topic><topic>Decomposition</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>DNA structure</topic><topic>Dynamic programming</topic><topic>Gene sequencing</topic><topic>Genes</topic><topic>Genomes</topic><topic>Multilevel</topic><topic>Nucleotide sequence</topic><topic>Nucleotides</topic><topic>Optimization</topic><topic>Predictions</topic><topic>基因工程</topic><topic>真核基因</topic><topic>结构预测</topic><topic>脱氧核糖核酸</topic><topic>蛋白质编码区</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhou, Yanhong</creatorcontrib><creatorcontrib>Yang, Lei</creatorcontrib><creatorcontrib>Wang, Hui</creatorcontrib><creatorcontrib>Lu, Feng</creatorcontrib><creatorcontrib>Wan, Honghui</creatorcontrib><collection>维普_期刊</collection><collection>中文科技期刊数据库-CALIS站点</collection><collection>维普中文期刊数据库</collection><collection>中文科技期刊数据库-自然科学</collection><collection>中文科技期刊数据库-自然科学-生物科学</collection><collection>中文科技期刊数据库- 镜像站点</collection><collection>CrossRef</collection><collection>Wanfang Data Journals - Hong Kong</collection><collection>WANFANG Data Centre</collection><collection>Wanfang Data Journals</collection><collection>万方数据期刊 - 香港版</collection><collection>China Online Journals (COJ)</collection><collection>China Online Journals (COJ)</collection><jtitle>Chinese science bulletin</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhou, Yanhong</au><au>Yang, Lei</au><au>Wang, Hui</au><au>Lu, Feng</au><au>Wan, Honghui</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of eukaryotic gene structures based on multilevel optimization</atitle><jtitle>Chinese science bulletin</jtitle><addtitle>Chinese Science Bulletin</addtitle><date>2004-02</date><risdate>2004</risdate><volume>49</volume><issue>4</issue><spage>321</spage><epage>328</epage><pages>321-328</pages><issn>1001-6538</issn><issn>2095-9273</issn><eissn>1861-9541</eissn><eissn>2095-9281</eissn><abstract>Computational gene structure prediction, which is valuable for finding new genes and understanding the composition of genomes, plays a very important role in various kinds of genome projects. For eukaryotic gene structures, however, the prediction accuracy of existing methods is still limited. This paper presents a method of predicting eukaryotic gene structures based on multilevel optimization. The complicated problem of predicting gene structure in eukaryotic DNA sequence containing multiple genes can be decomposed into a series of sub-problems at several levels with decreasing complexity, including the gene level (single-exon gene, multi-exon gene), the element level (exon, intron, etc.), and the feature level (functional site signals, codon usage preference, etc.). On the basis of this decomposition, a multilevel model for the prediction of complex gene structures is created by a multilevel optimization process, in which the models dealing with sub-problems at low complexity level are first optimized respectively, and then optimally combined together to form models for those sub-problems at higher complexity level. Based on the multilevel model, a dynamic programming algorithm is designed to search for optimal gene structures from DNA sequences, and a new program GeneKey (1.0) for the prediction of eukaryotic gene structures is developed. Testing results with widely used datasets demonstrate that the prediction accuracies of GeneKey (1.0) at the nucleotide level, exon level and gene level are all higher than that of the well known program GENSCAN. A web server of GeneKey(1.0) is available at http://infosci.hust.edu.cn</abstract><cop>Beijing</cop><pub>Springer Nature B.V</pub><doi>10.1007/BF02900313</doi><tpages>8</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1001-6538
ispartof Chinese science bulletin, 2004-02, Vol.49 (4), p.321-328
issn 1001-6538
2095-9273
1861-9541
2095-9281
language eng
recordid cdi_wanfang_journals_kxtb_e200404004
source SpringerLink Journals
subjects Accuracy
Algorithms
Complexity
Computer applications
Decomposition
Deoxyribonucleic acid
DNA
DNA structure
Dynamic programming
Gene sequencing
Genes
Genomes
Multilevel
Nucleotide sequence
Nucleotides
Optimization
Predictions
基因工程
真核基因
结构预测
脱氧核糖核酸
蛋白质编码区
title Prediction of eukaryotic gene structures based on multilevel optimization
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T17%3A09%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-wanfang_jour_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20eukaryotic%20gene%20structures%20based%20on%20multilevel%20optimization&rft.jtitle=Chinese%20science%20bulletin&rft.au=Zhou,%20Yanhong&rft.date=2004-02&rft.volume=49&rft.issue=4&rft.spage=321&rft.epage=328&rft.pages=321-328&rft.issn=1001-6538&rft.eissn=1861-9541&rft_id=info:doi/10.1007/BF02900313&rft_dat=%3Cwanfang_jour_proqu%3Ekxtb_e200404004%3C/wanfang_jour_proqu%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2786941299&rft_id=info:pmid/&rft_cqvip_id=9318971&rft_wanfj_id=kxtb_e200404004&rfr_iscdi=true