Loading…
Prediction of eukaryotic gene structures based on multilevel optimization
Computational gene structure prediction, which is valuable for finding new genes and understanding the composition of genomes, plays a very important role in various kinds of genome projects. For eukaryotic gene structures, however, the prediction accuracy of existing methods is still limited. This...
Saved in:
Published in: | Chinese science bulletin 2004-02, Vol.49 (4), p.321-328 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3 |
---|---|
cites | cdi_FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3 |
container_end_page | 328 |
container_issue | 4 |
container_start_page | 321 |
container_title | Chinese science bulletin |
container_volume | 49 |
creator | Zhou, Yanhong Yang, Lei Wang, Hui Lu, Feng Wan, Honghui |
description | Computational gene structure prediction, which is valuable for finding new genes and understanding the composition of genomes, plays a very important role in various kinds of genome projects. For eukaryotic gene structures, however, the prediction accuracy of existing methods is still limited. This paper presents a method of predicting eukaryotic gene structures based on multilevel optimization. The complicated problem of predicting gene structure in eukaryotic DNA sequence containing multiple genes can be decomposed into a series of sub-problems at several levels with decreasing complexity, including the gene level (single-exon gene, multi-exon gene), the element level (exon, intron, etc.), and the feature level (functional site signals, codon usage preference, etc.). On the basis of this decomposition, a multilevel model for the prediction of complex gene structures is created by a multilevel optimization process, in which the models dealing with sub-problems at low complexity level are first optimized respectively, and then optimally combined together to form models for those sub-problems at higher complexity level. Based on the multilevel model, a dynamic programming algorithm is designed to search for optimal gene structures from DNA sequences, and a new program GeneKey (1.0) for the prediction of eukaryotic gene structures is developed. Testing results with widely used datasets demonstrate that the prediction accuracies of GeneKey (1.0) at the nucleotide level, exon level and gene level are all higher than that of the well known program GENSCAN. A web server of GeneKey(1.0) is available at http://infosci.hust.edu.cn |
doi_str_mv | 10.1007/BF02900313 |
format | article |
fullrecord | <record><control><sourceid>wanfang_jour_proqu</sourceid><recordid>TN_cdi_wanfang_journals_kxtb_e200404004</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><cqvip_id>9318971</cqvip_id><wanfj_id>kxtb_e200404004</wanfj_id><sourcerecordid>kxtb_e200404004</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3</originalsourceid><addsrcrecordid>eNpFkEtLAzEQgBdRsFYv_oIVD4KwmkmyeRy1WC0U9KDnsNlNavrYbZOsr19vSgtlDjMD3zz4suwS0B0gxO8fxwhLhAiQo2wAgkEhSwrHqUYIClYScZqdhTBPHQGOB9nkzZvG1dF1bd7Z3PSLyv920dX5zLQmD9H3dey9CbmugmnyhK36ZXRL82WWebeObuX-qu34eXZiq2UwF_s8zD7GT--jl2L6-jwZPUyLGhimRVmChjJ91nAhiWTYMiqYYVxaCtQgSjUVpBEN41g3FJcgiebGasRKTUCTYXaz2_tdtbZqZ2re9b5NF9XiJ2plMEI0BaKJvN6Ra99tehPiAcVcMEkBS5mo2x1V-y4Eb6xae7dKFhQgtZWqDlITfLWHP7t2tnHpvK7qhU0-lCQgJAfyDw18cl4</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2786941299</pqid></control><display><type>article</type><title>Prediction of eukaryotic gene structures based on multilevel optimization</title><source>SpringerLink Journals</source><creator>Zhou, Yanhong ; Yang, Lei ; Wang, Hui ; Lu, Feng ; Wan, Honghui</creator><creatorcontrib>Zhou, Yanhong ; Yang, Lei ; Wang, Hui ; Lu, Feng ; Wan, Honghui</creatorcontrib><description>Computational gene structure prediction, which is valuable for finding new genes and understanding the composition of genomes, plays a very important role in various kinds of genome projects. For eukaryotic gene structures, however, the prediction accuracy of existing methods is still limited. This paper presents a method of predicting eukaryotic gene structures based on multilevel optimization. The complicated problem of predicting gene structure in eukaryotic DNA sequence containing multiple genes can be decomposed into a series of sub-problems at several levels with decreasing complexity, including the gene level (single-exon gene, multi-exon gene), the element level (exon, intron, etc.), and the feature level (functional site signals, codon usage preference, etc.). On the basis of this decomposition, a multilevel model for the prediction of complex gene structures is created by a multilevel optimization process, in which the models dealing with sub-problems at low complexity level are first optimized respectively, and then optimally combined together to form models for those sub-problems at higher complexity level. Based on the multilevel model, a dynamic programming algorithm is designed to search for optimal gene structures from DNA sequences, and a new program GeneKey (1.0) for the prediction of eukaryotic gene structures is developed. Testing results with widely used datasets demonstrate that the prediction accuracies of GeneKey (1.0) at the nucleotide level, exon level and gene level are all higher than that of the well known program GENSCAN. A web server of GeneKey(1.0) is available at http://infosci.hust.edu.cn</description><identifier>ISSN: 1001-6538</identifier><identifier>ISSN: 2095-9273</identifier><identifier>EISSN: 1861-9541</identifier><identifier>EISSN: 2095-9281</identifier><identifier>DOI: 10.1007/BF02900313</identifier><language>eng</language><publisher>Beijing: Springer Nature B.V</publisher><subject>Accuracy ; Algorithms ; Complexity ; Computer applications ; Decomposition ; Deoxyribonucleic acid ; DNA ; DNA structure ; Dynamic programming ; Gene sequencing ; Genes ; Genomes ; Multilevel ; Nucleotide sequence ; Nucleotides ; Optimization ; Predictions ; 基因工程 ; 真核基因 ; 结构预测 ; 脱氧核糖核酸 ; 蛋白质编码区</subject><ispartof>Chinese science bulletin, 2004-02, Vol.49 (4), p.321-328</ispartof><rights>Science in China Press 2004.</rights><rights>Copyright © Wanfang Data Co. Ltd. All Rights Reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3</citedby><cites>FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://image.cqvip.com/vip1000/qk/86894X/86894X.jpg</thumbnail><link.rule.ids>314,776,780,1638,27903,27904</link.rule.ids></links><search><creatorcontrib>Zhou, Yanhong</creatorcontrib><creatorcontrib>Yang, Lei</creatorcontrib><creatorcontrib>Wang, Hui</creatorcontrib><creatorcontrib>Lu, Feng</creatorcontrib><creatorcontrib>Wan, Honghui</creatorcontrib><title>Prediction of eukaryotic gene structures based on multilevel optimization</title><title>Chinese science bulletin</title><addtitle>Chinese Science Bulletin</addtitle><description>Computational gene structure prediction, which is valuable for finding new genes and understanding the composition of genomes, plays a very important role in various kinds of genome projects. For eukaryotic gene structures, however, the prediction accuracy of existing methods is still limited. This paper presents a method of predicting eukaryotic gene structures based on multilevel optimization. The complicated problem of predicting gene structure in eukaryotic DNA sequence containing multiple genes can be decomposed into a series of sub-problems at several levels with decreasing complexity, including the gene level (single-exon gene, multi-exon gene), the element level (exon, intron, etc.), and the feature level (functional site signals, codon usage preference, etc.). On the basis of this decomposition, a multilevel model for the prediction of complex gene structures is created by a multilevel optimization process, in which the models dealing with sub-problems at low complexity level are first optimized respectively, and then optimally combined together to form models for those sub-problems at higher complexity level. Based on the multilevel model, a dynamic programming algorithm is designed to search for optimal gene structures from DNA sequences, and a new program GeneKey (1.0) for the prediction of eukaryotic gene structures is developed. Testing results with widely used datasets demonstrate that the prediction accuracies of GeneKey (1.0) at the nucleotide level, exon level and gene level are all higher than that of the well known program GENSCAN. A web server of GeneKey(1.0) is available at http://infosci.hust.edu.cn</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Complexity</subject><subject>Computer applications</subject><subject>Decomposition</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>DNA structure</subject><subject>Dynamic programming</subject><subject>Gene sequencing</subject><subject>Genes</subject><subject>Genomes</subject><subject>Multilevel</subject><subject>Nucleotide sequence</subject><subject>Nucleotides</subject><subject>Optimization</subject><subject>Predictions</subject><subject>基因工程</subject><subject>真核基因</subject><subject>结构预测</subject><subject>脱氧核糖核酸</subject><subject>蛋白质编码区</subject><issn>1001-6538</issn><issn>2095-9273</issn><issn>1861-9541</issn><issn>2095-9281</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><recordid>eNpFkEtLAzEQgBdRsFYv_oIVD4KwmkmyeRy1WC0U9KDnsNlNavrYbZOsr19vSgtlDjMD3zz4suwS0B0gxO8fxwhLhAiQo2wAgkEhSwrHqUYIClYScZqdhTBPHQGOB9nkzZvG1dF1bd7Z3PSLyv920dX5zLQmD9H3dey9CbmugmnyhK36ZXRL82WWebeObuX-qu34eXZiq2UwF_s8zD7GT--jl2L6-jwZPUyLGhimRVmChjJ91nAhiWTYMiqYYVxaCtQgSjUVpBEN41g3FJcgiebGasRKTUCTYXaz2_tdtbZqZ2re9b5NF9XiJ2plMEI0BaKJvN6Ra99tehPiAcVcMEkBS5mo2x1V-y4Eb6xae7dKFhQgtZWqDlITfLWHP7t2tnHpvK7qhU0-lCQgJAfyDw18cl4</recordid><startdate>200402</startdate><enddate>200402</enddate><creator>Zhou, Yanhong</creator><creator>Yang, Lei</creator><creator>Wang, Hui</creator><creator>Lu, Feng</creator><creator>Wan, Honghui</creator><general>Springer Nature B.V</general><general>School of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China%School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China%School of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China%Laboratory of Bioinformatics, Maryland Institute of Dynamic Genomics, Silver Spring, MD 20906, USA</general><general>School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China</general><scope>2RA</scope><scope>92L</scope><scope>CQIGP</scope><scope>W94</scope><scope>WU4</scope><scope>~WA</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>2B.</scope><scope>4A8</scope><scope>92I</scope><scope>93N</scope><scope>PSX</scope><scope>TCJ</scope></search><sort><creationdate>200402</creationdate><title>Prediction of eukaryotic gene structures based on multilevel optimization</title><author>Zhou, Yanhong ; Yang, Lei ; Wang, Hui ; Lu, Feng ; Wan, Honghui</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Complexity</topic><topic>Computer applications</topic><topic>Decomposition</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>DNA structure</topic><topic>Dynamic programming</topic><topic>Gene sequencing</topic><topic>Genes</topic><topic>Genomes</topic><topic>Multilevel</topic><topic>Nucleotide sequence</topic><topic>Nucleotides</topic><topic>Optimization</topic><topic>Predictions</topic><topic>基因工程</topic><topic>真核基因</topic><topic>结构预测</topic><topic>脱氧核糖核酸</topic><topic>蛋白质编码区</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhou, Yanhong</creatorcontrib><creatorcontrib>Yang, Lei</creatorcontrib><creatorcontrib>Wang, Hui</creatorcontrib><creatorcontrib>Lu, Feng</creatorcontrib><creatorcontrib>Wan, Honghui</creatorcontrib><collection>维普_期刊</collection><collection>中文科技期刊数据库-CALIS站点</collection><collection>维普中文期刊数据库</collection><collection>中文科技期刊数据库-自然科学</collection><collection>中文科技期刊数据库-自然科学-生物科学</collection><collection>中文科技期刊数据库- 镜像站点</collection><collection>CrossRef</collection><collection>Wanfang Data Journals - Hong Kong</collection><collection>WANFANG Data Centre</collection><collection>Wanfang Data Journals</collection><collection>万方数据期刊 - 香港版</collection><collection>China Online Journals (COJ)</collection><collection>China Online Journals (COJ)</collection><jtitle>Chinese science bulletin</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhou, Yanhong</au><au>Yang, Lei</au><au>Wang, Hui</au><au>Lu, Feng</au><au>Wan, Honghui</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of eukaryotic gene structures based on multilevel optimization</atitle><jtitle>Chinese science bulletin</jtitle><addtitle>Chinese Science Bulletin</addtitle><date>2004-02</date><risdate>2004</risdate><volume>49</volume><issue>4</issue><spage>321</spage><epage>328</epage><pages>321-328</pages><issn>1001-6538</issn><issn>2095-9273</issn><eissn>1861-9541</eissn><eissn>2095-9281</eissn><abstract>Computational gene structure prediction, which is valuable for finding new genes and understanding the composition of genomes, plays a very important role in various kinds of genome projects. For eukaryotic gene structures, however, the prediction accuracy of existing methods is still limited. This paper presents a method of predicting eukaryotic gene structures based on multilevel optimization. The complicated problem of predicting gene structure in eukaryotic DNA sequence containing multiple genes can be decomposed into a series of sub-problems at several levels with decreasing complexity, including the gene level (single-exon gene, multi-exon gene), the element level (exon, intron, etc.), and the feature level (functional site signals, codon usage preference, etc.). On the basis of this decomposition, a multilevel model for the prediction of complex gene structures is created by a multilevel optimization process, in which the models dealing with sub-problems at low complexity level are first optimized respectively, and then optimally combined together to form models for those sub-problems at higher complexity level. Based on the multilevel model, a dynamic programming algorithm is designed to search for optimal gene structures from DNA sequences, and a new program GeneKey (1.0) for the prediction of eukaryotic gene structures is developed. Testing results with widely used datasets demonstrate that the prediction accuracies of GeneKey (1.0) at the nucleotide level, exon level and gene level are all higher than that of the well known program GENSCAN. A web server of GeneKey(1.0) is available at http://infosci.hust.edu.cn</abstract><cop>Beijing</cop><pub>Springer Nature B.V</pub><doi>10.1007/BF02900313</doi><tpages>8</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1001-6538 |
ispartof | Chinese science bulletin, 2004-02, Vol.49 (4), p.321-328 |
issn | 1001-6538 2095-9273 1861-9541 2095-9281 |
language | eng |
recordid | cdi_wanfang_journals_kxtb_e200404004 |
source | SpringerLink Journals |
subjects | Accuracy Algorithms Complexity Computer applications Decomposition Deoxyribonucleic acid DNA DNA structure Dynamic programming Gene sequencing Genes Genomes Multilevel Nucleotide sequence Nucleotides Optimization Predictions 基因工程 真核基因 结构预测 脱氧核糖核酸 蛋白质编码区 |
title | Prediction of eukaryotic gene structures based on multilevel optimization |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T17%3A09%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-wanfang_jour_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20eukaryotic%20gene%20structures%20based%20on%20multilevel%20optimization&rft.jtitle=Chinese%20science%20bulletin&rft.au=Zhou,%20Yanhong&rft.date=2004-02&rft.volume=49&rft.issue=4&rft.spage=321&rft.epage=328&rft.pages=321-328&rft.issn=1001-6538&rft.eissn=1861-9541&rft_id=info:doi/10.1007/BF02900313&rft_dat=%3Cwanfang_jour_proqu%3Ekxtb_e200404004%3C/wanfang_jour_proqu%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c1624-551b15861d7893962f6486e679f414e044b483d8d672bd425193b7efb065b31b3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2786941299&rft_id=info:pmid/&rft_cqvip_id=9318971&rft_wanfj_id=kxtb_e200404004&rfr_iscdi=true |