Loading…
Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks,...
Saved in:
Published in: | BMC bioinformatics 2010-06, Vol.11 (1), p.325-325, Article 325 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-b715t-7fbec6bcb98418825bd91a764d1b00ac0e9adcb8d98364525865c8dbe58071253 |
---|---|
cites | cdi_FETCH-LOGICAL-b715t-7fbec6bcb98418825bd91a764d1b00ac0e9adcb8d98364525865c8dbe58071253 |
container_end_page | 325 |
container_issue | 1 |
container_start_page | 325 |
container_title | BMC bioinformatics |
container_volume | 11 |
creator | Li, Zhanchao Zhou, Xuan Dai, Zong Zou, Xiaoyong |
description | Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks, the gap between the number of known sequence and the number of known function is widening rapidly, and it is both time-consuming and expensive to determine their function based only on experimental techniques. Therefore, it is vitally significant to develop a computational method for quick and accurate classification of GPCRs.
In this study, a novel three-layer predictor based on support vector machine (SVM) and feature selection is developed for predicting and classifying GPCRs directly from amino acid sequence data. The maximum relevance minimum redundancy (mRMR) is applied to pre-evaluate features with discriminative information while genetic algorithm (GA) is utilized to find the optimized feature subsets. SVM is used for the construction of classification models. The overall accuracy with three-layer predictor at levels of superfamily, family and subfamily are obtained by cross-validation test on two non-redundant dataset. The results are about 0.5% to 16% higher than those of GPCR-CA and GPCRPred.
The results with high success rates indicate that the proposed predictor is a useful automated tool in predicting GPCRs. GPCR-SVMFS, a corresponding executable program for GPCRs prediction and classification, can be acquired freely on request from the authors. |
doi_str_mv | 10.1186/1471-2105-11-325 |
format | article |
fullrecord | <record><control><sourceid>gale_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_9f4b388ee10f448e8e8cabeccfd1ac1b</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A231755060</galeid><doaj_id>oai_doaj_org_article_9f4b388ee10f448e8e8cabeccfd1ac1b</doaj_id><sourcerecordid>A231755060</sourcerecordid><originalsourceid>FETCH-LOGICAL-b715t-7fbec6bcb98418825bd91a764d1b00ac0e9adcb8d98364525865c8dbe58071253</originalsourceid><addsrcrecordid>eNqFk0tv1DAQxyMEoqVw54QicQAOKXYcJ86lUrWCslIlJB5ny49J1lViBztZ2g_Bd8bpLkuDCsgHxzO_-XsecZI8x-gUY1a-xUWFsxwjmmGckZw-SI4Ppod3vo-SJyFcIYQrhujj5ChHlKIK0-Pkx6oTIZjGKDEaZ1PXpBfZ4N0IxqbKTUMHOvWgYBidD6kUIZ4jF6ZhcH5Mt6CiI-2F2hgL6XczbuLh2vRTH8M62AqrIO2N3Vv0ZHU03aTC6rQFC6NRqeha52Nk_zR51IguwLP9fpJ8ff_uy-pDdvnxYr06v8xkTHrMqkaCKqWSNSswYzmVusaiKguNJUJCIaiFVpLpmpGyoDllJVVMS6AsVp1TcpKsd7raiSs-eNMLf8OdMPzW4HzLhY-ZdcDrppCEMQCMmqJgEJcS8XrVaCwUllHrbKc1TLIHrcCOXnQL0aXHmg1v3ZbnNaKkLKPAaicgjfuLwNKjXM_n0fJ5tBxjTm5LerVPw7tvE4SR9yYo6DphwU2BV7SglBBM_k8SUjNGMYrk63-SmNCyzusczaIv_0Cv3ORtHGKkClblrKyK31QrYmuNbVwsSM2i_DwnuIp_ZTlfe3oPFZeG3ihnoTHRvgh4swiIzAjXYyumEPj686cli3as8i4ED82h0Rjx-VHd19oXdyd8CPj1ishPB0IdyA</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1348728674</pqid></control><display><type>article</type><title>Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm</title><source>Publicly Available Content Database</source><source>PubMed Central</source><creator>Li, Zhanchao ; Zhou, Xuan ; Dai, Zong ; Zou, Xiaoyong</creator><creatorcontrib>Li, Zhanchao ; Zhou, Xuan ; Dai, Zong ; Zou, Xiaoyong</creatorcontrib><description>Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks, the gap between the number of known sequence and the number of known function is widening rapidly, and it is both time-consuming and expensive to determine their function based only on experimental techniques. Therefore, it is vitally significant to develop a computational method for quick and accurate classification of GPCRs.
In this study, a novel three-layer predictor based on support vector machine (SVM) and feature selection is developed for predicting and classifying GPCRs directly from amino acid sequence data. The maximum relevance minimum redundancy (mRMR) is applied to pre-evaluate features with discriminative information while genetic algorithm (GA) is utilized to find the optimized feature subsets. SVM is used for the construction of classification models. The overall accuracy with three-layer predictor at levels of superfamily, family and subfamily are obtained by cross-validation test on two non-redundant dataset. The results are about 0.5% to 16% higher than those of GPCR-CA and GPCRPred.
The results with high success rates indicate that the proposed predictor is a useful automated tool in predicting GPCRs. GPCR-SVMFS, a corresponding executable program for GPCRs prediction and classification, can be acquired freely on request from the authors.</description><identifier>ISSN: 1471-2105</identifier><identifier>EISSN: 1471-2105</identifier><identifier>DOI: 10.1186/1471-2105-11-325</identifier><identifier>PMID: 20550715</identifier><language>eng</language><publisher>England: BioMed Central Ltd</publisher><subject>Accuracy ; Algorithms ; Amino acids ; Bioinformatics ; Cell receptors ; Classification ; Computational biology ; DNA sequencing ; Genetic algorithms ; Methods ; Nucleotide sequencing ; Peptides ; Proteins ; Receptors, G-Protein-Coupled - chemistry ; Receptors, G-Protein-Coupled - classification ; Sequence Analysis, Protein ; Studies</subject><ispartof>BMC bioinformatics, 2010-06, Vol.11 (1), p.325-325, Article 325</ispartof><rights>COPYRIGHT 2010 BioMed Central Ltd.</rights><rights>2010 Li et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</rights><rights>Copyright ©2010 Li et al; licensee BioMed Central Ltd. 2010 Li et al; licensee BioMed Central Ltd.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-b715t-7fbec6bcb98418825bd91a764d1b00ac0e9adcb8d98364525865c8dbe58071253</citedby><cites>FETCH-LOGICAL-b715t-7fbec6bcb98418825bd91a764d1b00ac0e9adcb8d98364525865c8dbe58071253</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC2905366/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/1348728674?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,25753,27924,27925,37012,37013,44590,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/20550715$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Li, Zhanchao</creatorcontrib><creatorcontrib>Zhou, Xuan</creatorcontrib><creatorcontrib>Dai, Zong</creatorcontrib><creatorcontrib>Zou, Xiaoyong</creatorcontrib><title>Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm</title><title>BMC bioinformatics</title><addtitle>BMC Bioinformatics</addtitle><description>Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks, the gap between the number of known sequence and the number of known function is widening rapidly, and it is both time-consuming and expensive to determine their function based only on experimental techniques. Therefore, it is vitally significant to develop a computational method for quick and accurate classification of GPCRs.
In this study, a novel three-layer predictor based on support vector machine (SVM) and feature selection is developed for predicting and classifying GPCRs directly from amino acid sequence data. The maximum relevance minimum redundancy (mRMR) is applied to pre-evaluate features with discriminative information while genetic algorithm (GA) is utilized to find the optimized feature subsets. SVM is used for the construction of classification models. The overall accuracy with three-layer predictor at levels of superfamily, family and subfamily are obtained by cross-validation test on two non-redundant dataset. The results are about 0.5% to 16% higher than those of GPCR-CA and GPCRPred.
The results with high success rates indicate that the proposed predictor is a useful automated tool in predicting GPCRs. GPCR-SVMFS, a corresponding executable program for GPCRs prediction and classification, can be acquired freely on request from the authors.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Amino acids</subject><subject>Bioinformatics</subject><subject>Cell receptors</subject><subject>Classification</subject><subject>Computational biology</subject><subject>DNA sequencing</subject><subject>Genetic algorithms</subject><subject>Methods</subject><subject>Nucleotide sequencing</subject><subject>Peptides</subject><subject>Proteins</subject><subject>Receptors, G-Protein-Coupled - chemistry</subject><subject>Receptors, G-Protein-Coupled - classification</subject><subject>Sequence Analysis, Protein</subject><subject>Studies</subject><issn>1471-2105</issn><issn>1471-2105</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNqFk0tv1DAQxyMEoqVw54QicQAOKXYcJ86lUrWCslIlJB5ny49J1lViBztZ2g_Bd8bpLkuDCsgHxzO_-XsecZI8x-gUY1a-xUWFsxwjmmGckZw-SI4Ppod3vo-SJyFcIYQrhujj5ChHlKIK0-Pkx6oTIZjGKDEaZ1PXpBfZ4N0IxqbKTUMHOvWgYBidD6kUIZ4jF6ZhcH5Mt6CiI-2F2hgL6XczbuLh2vRTH8M62AqrIO2N3Vv0ZHU03aTC6rQFC6NRqeha52Nk_zR51IguwLP9fpJ8ff_uy-pDdvnxYr06v8xkTHrMqkaCKqWSNSswYzmVusaiKguNJUJCIaiFVpLpmpGyoDllJVVMS6AsVp1TcpKsd7raiSs-eNMLf8OdMPzW4HzLhY-ZdcDrppCEMQCMmqJgEJcS8XrVaCwUllHrbKc1TLIHrcCOXnQL0aXHmg1v3ZbnNaKkLKPAaicgjfuLwNKjXM_n0fJ5tBxjTm5LerVPw7tvE4SR9yYo6DphwU2BV7SglBBM_k8SUjNGMYrk63-SmNCyzusczaIv_0Cv3ORtHGKkClblrKyK31QrYmuNbVwsSM2i_DwnuIp_ZTlfe3oPFZeG3ihnoTHRvgh4swiIzAjXYyumEPj686cli3as8i4ED82h0Rjx-VHd19oXdyd8CPj1ishPB0IdyA</recordid><startdate>20100616</startdate><enddate>20100616</enddate><creator>Li, Zhanchao</creator><creator>Zhou, Xuan</creator><creator>Dai, Zong</creator><creator>Zou, Xiaoyong</creator><general>BioMed Central Ltd</general><general>BioMed Central</general><general>BMC</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISR</scope><scope>3V.</scope><scope>7QO</scope><scope>7SC</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>L7M</scope><scope>LK8</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20100616</creationdate><title>Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm</title><author>Li, Zhanchao ; Zhou, Xuan ; Dai, Zong ; Zou, Xiaoyong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-b715t-7fbec6bcb98418825bd91a764d1b00ac0e9adcb8d98364525865c8dbe58071253</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Amino acids</topic><topic>Bioinformatics</topic><topic>Cell receptors</topic><topic>Classification</topic><topic>Computational biology</topic><topic>DNA sequencing</topic><topic>Genetic algorithms</topic><topic>Methods</topic><topic>Nucleotide sequencing</topic><topic>Peptides</topic><topic>Proteins</topic><topic>Receptors, G-Protein-Coupled - chemistry</topic><topic>Receptors, G-Protein-Coupled - classification</topic><topic>Sequence Analysis, Protein</topic><topic>Studies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Zhanchao</creatorcontrib><creatorcontrib>Zhou, Xuan</creatorcontrib><creatorcontrib>Dai, Zong</creatorcontrib><creatorcontrib>Zou, Xiaoyong</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Database (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>ProQuest Biological Science Collection</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>BMC bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Zhanchao</au><au>Zhou, Xuan</au><au>Dai, Zong</au><au>Zou, Xiaoyong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm</atitle><jtitle>BMC bioinformatics</jtitle><addtitle>BMC Bioinformatics</addtitle><date>2010-06-16</date><risdate>2010</risdate><volume>11</volume><issue>1</issue><spage>325</spage><epage>325</epage><pages>325-325</pages><artnum>325</artnum><issn>1471-2105</issn><eissn>1471-2105</eissn><abstract>Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks, the gap between the number of known sequence and the number of known function is widening rapidly, and it is both time-consuming and expensive to determine their function based only on experimental techniques. Therefore, it is vitally significant to develop a computational method for quick and accurate classification of GPCRs.
In this study, a novel three-layer predictor based on support vector machine (SVM) and feature selection is developed for predicting and classifying GPCRs directly from amino acid sequence data. The maximum relevance minimum redundancy (mRMR) is applied to pre-evaluate features with discriminative information while genetic algorithm (GA) is utilized to find the optimized feature subsets. SVM is used for the construction of classification models. The overall accuracy with three-layer predictor at levels of superfamily, family and subfamily are obtained by cross-validation test on two non-redundant dataset. The results are about 0.5% to 16% higher than those of GPCR-CA and GPCRPred.
The results with high success rates indicate that the proposed predictor is a useful automated tool in predicting GPCRs. GPCR-SVMFS, a corresponding executable program for GPCRs prediction and classification, can be acquired freely on request from the authors.</abstract><cop>England</cop><pub>BioMed Central Ltd</pub><pmid>20550715</pmid><doi>10.1186/1471-2105-11-325</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1471-2105 |
ispartof | BMC bioinformatics, 2010-06, Vol.11 (1), p.325-325, Article 325 |
issn | 1471-2105 1471-2105 |
language | eng |
recordid | cdi_doaj_primary_oai_doaj_org_article_9f4b388ee10f448e8e8cabeccfd1ac1b |
source | Publicly Available Content Database; PubMed Central |
subjects | Accuracy Algorithms Amino acids Bioinformatics Cell receptors Classification Computational biology DNA sequencing Genetic algorithms Methods Nucleotide sequencing Peptides Proteins Receptors, G-Protein-Coupled - chemistry Receptors, G-Protein-Coupled - classification Sequence Analysis, Protein Studies |
title | Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T18%3A49%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Classification%20of%20G-protein%20coupled%20receptors%20based%20on%20support%20vector%20machine%20with%20maximum%20relevance%20minimum%20redundancy%20and%20genetic%20algorithm&rft.jtitle=BMC%20bioinformatics&rft.au=Li,%20Zhanchao&rft.date=2010-06-16&rft.volume=11&rft.issue=1&rft.spage=325&rft.epage=325&rft.pages=325-325&rft.artnum=325&rft.issn=1471-2105&rft.eissn=1471-2105&rft_id=info:doi/10.1186/1471-2105-11-325&rft_dat=%3Cgale_doaj_%3EA231755060%3C/gale_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-b715t-7fbec6bcb98418825bd91a764d1b00ac0e9adcb8d98364525865c8dbe58071253%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1348728674&rft_id=info:pmid/20550715&rft_galeid=A231755060&rfr_iscdi=true |