Loading…

AFP-CMBPred: Computational identification of antifreeze proteins by extending consensus sequences into multi-blocks evolutionary information

In extremely cold environments, living organisms like plants, animals, fishes, and microbes can die due to the intracellular ice formation in their bodies. To sustain life in such cold environments, some cold-blooded species produced Antifreeze proteins (AFPs), also called ice-binding proteins. AFPs...

Full description

Saved in:
Bibliographic Details
Published in:Computers in biology and medicine 2021-12, Vol.139, p.105006-105006, Article 105006
Main Authors: Ali, Farman, Akbar, Shahid, Ghulam, Ali, Maher, Zulfikar Ahmed, Unar, Ahsanullah, Talpur, Dhani Bux
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c402t-47da8406e21cbdd3e96e81fb584f9dd557021984d4f6098274ac6b6179cf7f13
cites cdi_FETCH-LOGICAL-c402t-47da8406e21cbdd3e96e81fb584f9dd557021984d4f6098274ac6b6179cf7f13
container_end_page 105006
container_issue
container_start_page 105006
container_title Computers in biology and medicine
container_volume 139
creator Ali, Farman
Akbar, Shahid
Ghulam, Ali
Maher, Zulfikar Ahmed
Unar, Ahsanullah
Talpur, Dhani Bux
description In extremely cold environments, living organisms like plants, animals, fishes, and microbes can die due to the intracellular ice formation in their bodies. To sustain life in such cold environments, some cold-blooded species produced Antifreeze proteins (AFPs), also called ice-binding proteins. AFPs are not only limited to the medical field but also have diverse significance in the area of biotechnology, agriculture, and the food industry. Different AFPs exhibit high heterogeneity in their structures and sequences. Keeping the significance of AFPs, several machine-learning-based models have been developed by scientists for the prediction of AFPs. However, due to the complex and diverse nature of AFPs, the prediction performance of the existing methods is limited. Therefore, it is highly indispensable for researchers to develop a reliable computational model that can accurately predict AFPs. In this connection, this study presents a novel predictor for AFPs, named AFP-CMBPred. The sequences of AFPs are formulated via four different feature representation methods, such as Amphiphilic pseudo amino acid composition (Amp-PseAAC), Dipeptide Deviation from Expected Mean (DDE), Multi-Blocks Position Specific Scoring Matrix (MB-PSSM), and Consensus Sequence-based on Multi-Blocks Position Specific Scoring Matrix (CS-MB-PSSM) to collect local and global descriptors. In the next step, the extracted feature vectors are evaluated via Support Vector Machine (SVM) and Random Forest (RF) based classification learners. The prediction performance of both classifiers is further assessed using three validation methods i.e., jackknife test, 10-fold cross-validation test, and independent test. After examining the prediction rates of all validation tests, it was found that our proposed model achieved the higher prediction accuracies of ∼2.65%, ∼2.84%, and ∼3.37% using jackknife, K-fold, and independent test, respectively. The experimental outcomes validate that our proposed “AFP-CMBPred” predictor secured the highest prediction results than the existing models for the identification of AFPs. It is further anticipated that our proposed AFP-CMBPred model will be considered a valuable tool in the research academia and drug development. •Designed a novel predictor named AFP-CMBPred for prediction of Antifreeze proteins.•The local and global discriminative features are explored by Amp-PseAAC, DDE, MB-PSSM, and CS-MB-PSSM.•SVM and RF are used as classification algorithms.•AFP-CMBPred p
doi_str_mv 10.1016/j.compbiomed.2021.105006
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2595559375</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0010482521008003</els_id><sourcerecordid>2595559375</sourcerecordid><originalsourceid>FETCH-LOGICAL-c402t-47da8406e21cbdd3e96e81fb584f9dd557021984d4f6098274ac6b6179cf7f13</originalsourceid><addsrcrecordid>eNqFkcFu1DAQhi0EokvhFZAlLlyyjBPbibm1KwpIRfTQu5XYY-QliRfbqSjPwEPj7LZC4sLJsueb-T3_TwhlsGXA5Lv91oTpMPgwod3WULPyLADkE7JhXasqEA1_SjYADCre1eKMvEhpDwAcGnhOzhrecgVKbsjvi6ubavfl8iaifU93ZeqS--zD3I_UW5yzd94cH2hwtF_vEfEX0kMMGf2c6HBP8WfG2fr5GzVhTjinJdGEPxacDSbq5xzotIzZV8MYzPdE8S6My1Ek3peyC3E6Srwkz1w_Jnz1cJ6T26sPt7tP1fXXj593F9eV4VDnire27zhIrJkZrG1QSeyYG0THnbJWiLY4ojpuuZOgurrlvZGDZK0yrnWsOSdvT2PLDuWTKevJJ4Pj2M8YlqRroYQQqmlFQd_8g-7DEos5hZLFTZBKqkJ1J8rEkFJEpw_RT2U5zUCvgem9_huYXgPTp8BK6-sHgWVYa4-NjwkV4PIEYDHkzmPUyfjVWOsjmqxt8P9X-QMeuq7O</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2604006969</pqid></control><display><type>article</type><title>AFP-CMBPred: Computational identification of antifreeze proteins by extending consensus sequences into multi-blocks evolutionary information</title><source>Elsevier</source><creator>Ali, Farman ; Akbar, Shahid ; Ghulam, Ali ; Maher, Zulfikar Ahmed ; Unar, Ahsanullah ; Talpur, Dhani Bux</creator><creatorcontrib>Ali, Farman ; Akbar, Shahid ; Ghulam, Ali ; Maher, Zulfikar Ahmed ; Unar, Ahsanullah ; Talpur, Dhani Bux</creatorcontrib><description>In extremely cold environments, living organisms like plants, animals, fishes, and microbes can die due to the intracellular ice formation in their bodies. To sustain life in such cold environments, some cold-blooded species produced Antifreeze proteins (AFPs), also called ice-binding proteins. AFPs are not only limited to the medical field but also have diverse significance in the area of biotechnology, agriculture, and the food industry. Different AFPs exhibit high heterogeneity in their structures and sequences. Keeping the significance of AFPs, several machine-learning-based models have been developed by scientists for the prediction of AFPs. However, due to the complex and diverse nature of AFPs, the prediction performance of the existing methods is limited. Therefore, it is highly indispensable for researchers to develop a reliable computational model that can accurately predict AFPs. In this connection, this study presents a novel predictor for AFPs, named AFP-CMBPred. The sequences of AFPs are formulated via four different feature representation methods, such as Amphiphilic pseudo amino acid composition (Amp-PseAAC), Dipeptide Deviation from Expected Mean (DDE), Multi-Blocks Position Specific Scoring Matrix (MB-PSSM), and Consensus Sequence-based on Multi-Blocks Position Specific Scoring Matrix (CS-MB-PSSM) to collect local and global descriptors. In the next step, the extracted feature vectors are evaluated via Support Vector Machine (SVM) and Random Forest (RF) based classification learners. The prediction performance of both classifiers is further assessed using three validation methods i.e., jackknife test, 10-fold cross-validation test, and independent test. After examining the prediction rates of all validation tests, it was found that our proposed model achieved the higher prediction accuracies of ∼2.65%, ∼2.84%, and ∼3.37% using jackknife, K-fold, and independent test, respectively. The experimental outcomes validate that our proposed “AFP-CMBPred” predictor secured the highest prediction results than the existing models for the identification of AFPs. It is further anticipated that our proposed AFP-CMBPred model will be considered a valuable tool in the research academia and drug development. •Designed a novel predictor named AFP-CMBPred for prediction of Antifreeze proteins.•The local and global discriminative features are explored by Amp-PseAAC, DDE, MB-PSSM, and CS-MB-PSSM.•SVM and RF are used as classification algorithms.•AFP-CMBPred predictor secured the highest prediction results for AFPs identification.</description><identifier>ISSN: 0010-4825</identifier><identifier>EISSN: 1879-0534</identifier><identifier>DOI: 10.1016/j.compbiomed.2021.105006</identifier><identifier>PMID: 34749096</identifier><language>eng</language><publisher>United States: Elsevier Ltd</publisher><subject>Algorithms ; Amino acid composition ; Amino acids ; Amphiphilic Pseudo Amino Acid Composition ; Animals ; Antifreeze proteins ; Antifreeze Proteins - genetics ; Bacteria ; Biotechnology ; Cold ; Computational Biology ; Computer applications ; Consensus Sequence ; Consensus Sequences ; Conserved sequence ; Datasets ; DDE ; Drug development ; Feature extraction ; Food industry ; Heterogeneity ; Ice formation ; Learning algorithms ; Machine learning ; Methods ; Multi-Blocks position Specific Scoring Matrix ; Nitrous oxide ; Peptides ; Performance prediction ; Plants ; Predictions ; Proteins ; Support Vector Machine ; Support vector machines</subject><ispartof>Computers in biology and medicine, 2021-12, Vol.139, p.105006-105006, Article 105006</ispartof><rights>2021 Elsevier Ltd</rights><rights>Copyright © 2021 Elsevier Ltd. All rights reserved.</rights><rights>2021. Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c402t-47da8406e21cbdd3e96e81fb584f9dd557021984d4f6098274ac6b6179cf7f13</citedby><cites>FETCH-LOGICAL-c402t-47da8406e21cbdd3e96e81fb584f9dd557021984d4f6098274ac6b6179cf7f13</cites><orcidid>0000-0002-0914-1577</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/34749096$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Ali, Farman</creatorcontrib><creatorcontrib>Akbar, Shahid</creatorcontrib><creatorcontrib>Ghulam, Ali</creatorcontrib><creatorcontrib>Maher, Zulfikar Ahmed</creatorcontrib><creatorcontrib>Unar, Ahsanullah</creatorcontrib><creatorcontrib>Talpur, Dhani Bux</creatorcontrib><title>AFP-CMBPred: Computational identification of antifreeze proteins by extending consensus sequences into multi-blocks evolutionary information</title><title>Computers in biology and medicine</title><addtitle>Comput Biol Med</addtitle><description>In extremely cold environments, living organisms like plants, animals, fishes, and microbes can die due to the intracellular ice formation in their bodies. To sustain life in such cold environments, some cold-blooded species produced Antifreeze proteins (AFPs), also called ice-binding proteins. AFPs are not only limited to the medical field but also have diverse significance in the area of biotechnology, agriculture, and the food industry. Different AFPs exhibit high heterogeneity in their structures and sequences. Keeping the significance of AFPs, several machine-learning-based models have been developed by scientists for the prediction of AFPs. However, due to the complex and diverse nature of AFPs, the prediction performance of the existing methods is limited. Therefore, it is highly indispensable for researchers to develop a reliable computational model that can accurately predict AFPs. In this connection, this study presents a novel predictor for AFPs, named AFP-CMBPred. The sequences of AFPs are formulated via four different feature representation methods, such as Amphiphilic pseudo amino acid composition (Amp-PseAAC), Dipeptide Deviation from Expected Mean (DDE), Multi-Blocks Position Specific Scoring Matrix (MB-PSSM), and Consensus Sequence-based on Multi-Blocks Position Specific Scoring Matrix (CS-MB-PSSM) to collect local and global descriptors. In the next step, the extracted feature vectors are evaluated via Support Vector Machine (SVM) and Random Forest (RF) based classification learners. The prediction performance of both classifiers is further assessed using three validation methods i.e., jackknife test, 10-fold cross-validation test, and independent test. After examining the prediction rates of all validation tests, it was found that our proposed model achieved the higher prediction accuracies of ∼2.65%, ∼2.84%, and ∼3.37% using jackknife, K-fold, and independent test, respectively. The experimental outcomes validate that our proposed “AFP-CMBPred” predictor secured the highest prediction results than the existing models for the identification of AFPs. It is further anticipated that our proposed AFP-CMBPred model will be considered a valuable tool in the research academia and drug development. •Designed a novel predictor named AFP-CMBPred for prediction of Antifreeze proteins.•The local and global discriminative features are explored by Amp-PseAAC, DDE, MB-PSSM, and CS-MB-PSSM.•SVM and RF are used as classification algorithms.•AFP-CMBPred predictor secured the highest prediction results for AFPs identification.</description><subject>Algorithms</subject><subject>Amino acid composition</subject><subject>Amino acids</subject><subject>Amphiphilic Pseudo Amino Acid Composition</subject><subject>Animals</subject><subject>Antifreeze proteins</subject><subject>Antifreeze Proteins - genetics</subject><subject>Bacteria</subject><subject>Biotechnology</subject><subject>Cold</subject><subject>Computational Biology</subject><subject>Computer applications</subject><subject>Consensus Sequence</subject><subject>Consensus Sequences</subject><subject>Conserved sequence</subject><subject>Datasets</subject><subject>DDE</subject><subject>Drug development</subject><subject>Feature extraction</subject><subject>Food industry</subject><subject>Heterogeneity</subject><subject>Ice formation</subject><subject>Learning algorithms</subject><subject>Machine learning</subject><subject>Methods</subject><subject>Multi-Blocks position Specific Scoring Matrix</subject><subject>Nitrous oxide</subject><subject>Peptides</subject><subject>Performance prediction</subject><subject>Plants</subject><subject>Predictions</subject><subject>Proteins</subject><subject>Support Vector Machine</subject><subject>Support vector machines</subject><issn>0010-4825</issn><issn>1879-0534</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNqFkcFu1DAQhi0EokvhFZAlLlyyjBPbibm1KwpIRfTQu5XYY-QliRfbqSjPwEPj7LZC4sLJsueb-T3_TwhlsGXA5Lv91oTpMPgwod3WULPyLADkE7JhXasqEA1_SjYADCre1eKMvEhpDwAcGnhOzhrecgVKbsjvi6ubavfl8iaifU93ZeqS--zD3I_UW5yzd94cH2hwtF_vEfEX0kMMGf2c6HBP8WfG2fr5GzVhTjinJdGEPxacDSbq5xzotIzZV8MYzPdE8S6My1Ek3peyC3E6Srwkz1w_Jnz1cJ6T26sPt7tP1fXXj593F9eV4VDnire27zhIrJkZrG1QSeyYG0THnbJWiLY4ojpuuZOgurrlvZGDZK0yrnWsOSdvT2PLDuWTKevJJ4Pj2M8YlqRroYQQqmlFQd_8g-7DEos5hZLFTZBKqkJ1J8rEkFJEpw_RT2U5zUCvgem9_huYXgPTp8BK6-sHgWVYa4-NjwkV4PIEYDHkzmPUyfjVWOsjmqxt8P9X-QMeuq7O</recordid><startdate>202112</startdate><enddate>202112</enddate><creator>Ali, Farman</creator><creator>Akbar, Shahid</creator><creator>Ghulam, Ali</creator><creator>Maher, Zulfikar Ahmed</creator><creator>Unar, Ahsanullah</creator><creator>Talpur, Dhani Bux</creator><general>Elsevier Ltd</general><general>Elsevier Limited</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7RV</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>KB0</scope><scope>LK8</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M2O</scope><scope>M7P</scope><scope>M7Z</scope><scope>MBDVC</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-0914-1577</orcidid></search><sort><creationdate>202112</creationdate><title>AFP-CMBPred: Computational identification of antifreeze proteins by extending consensus sequences into multi-blocks evolutionary information</title><author>Ali, Farman ; Akbar, Shahid ; Ghulam, Ali ; Maher, Zulfikar Ahmed ; Unar, Ahsanullah ; Talpur, Dhani Bux</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c402t-47da8406e21cbdd3e96e81fb584f9dd557021984d4f6098274ac6b6179cf7f13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Amino acid composition</topic><topic>Amino acids</topic><topic>Amphiphilic Pseudo Amino Acid Composition</topic><topic>Animals</topic><topic>Antifreeze proteins</topic><topic>Antifreeze Proteins - genetics</topic><topic>Bacteria</topic><topic>Biotechnology</topic><topic>Cold</topic><topic>Computational Biology</topic><topic>Computer applications</topic><topic>Consensus Sequence</topic><topic>Consensus Sequences</topic><topic>Conserved sequence</topic><topic>Datasets</topic><topic>DDE</topic><topic>Drug development</topic><topic>Feature extraction</topic><topic>Food industry</topic><topic>Heterogeneity</topic><topic>Ice formation</topic><topic>Learning algorithms</topic><topic>Machine learning</topic><topic>Methods</topic><topic>Multi-Blocks position Specific Scoring Matrix</topic><topic>Nitrous oxide</topic><topic>Peptides</topic><topic>Performance prediction</topic><topic>Plants</topic><topic>Predictions</topic><topic>Proteins</topic><topic>Support Vector Machine</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ali, Farman</creatorcontrib><creatorcontrib>Akbar, Shahid</creatorcontrib><creatorcontrib>Ghulam, Ali</creatorcontrib><creatorcontrib>Maher, Zulfikar Ahmed</creatorcontrib><creatorcontrib>Unar, Ahsanullah</creatorcontrib><creatorcontrib>Talpur, Dhani Bux</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Nursing and Allied Health Journals</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Nursing &amp; Allied Health Database (Alumni Edition)</collection><collection>ProQuest Biological Science Collection</collection><collection>Computing Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Research Library</collection><collection>Biological Science Database</collection><collection>Biochemistry Abstracts 1</collection><collection>Research Library (Corporate)</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><jtitle>Computers in biology and medicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ali, Farman</au><au>Akbar, Shahid</au><au>Ghulam, Ali</au><au>Maher, Zulfikar Ahmed</au><au>Unar, Ahsanullah</au><au>Talpur, Dhani Bux</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>AFP-CMBPred: Computational identification of antifreeze proteins by extending consensus sequences into multi-blocks evolutionary information</atitle><jtitle>Computers in biology and medicine</jtitle><addtitle>Comput Biol Med</addtitle><date>2021-12</date><risdate>2021</risdate><volume>139</volume><spage>105006</spage><epage>105006</epage><pages>105006-105006</pages><artnum>105006</artnum><issn>0010-4825</issn><eissn>1879-0534</eissn><abstract>In extremely cold environments, living organisms like plants, animals, fishes, and microbes can die due to the intracellular ice formation in their bodies. To sustain life in such cold environments, some cold-blooded species produced Antifreeze proteins (AFPs), also called ice-binding proteins. AFPs are not only limited to the medical field but also have diverse significance in the area of biotechnology, agriculture, and the food industry. Different AFPs exhibit high heterogeneity in their structures and sequences. Keeping the significance of AFPs, several machine-learning-based models have been developed by scientists for the prediction of AFPs. However, due to the complex and diverse nature of AFPs, the prediction performance of the existing methods is limited. Therefore, it is highly indispensable for researchers to develop a reliable computational model that can accurately predict AFPs. In this connection, this study presents a novel predictor for AFPs, named AFP-CMBPred. The sequences of AFPs are formulated via four different feature representation methods, such as Amphiphilic pseudo amino acid composition (Amp-PseAAC), Dipeptide Deviation from Expected Mean (DDE), Multi-Blocks Position Specific Scoring Matrix (MB-PSSM), and Consensus Sequence-based on Multi-Blocks Position Specific Scoring Matrix (CS-MB-PSSM) to collect local and global descriptors. In the next step, the extracted feature vectors are evaluated via Support Vector Machine (SVM) and Random Forest (RF) based classification learners. The prediction performance of both classifiers is further assessed using three validation methods i.e., jackknife test, 10-fold cross-validation test, and independent test. After examining the prediction rates of all validation tests, it was found that our proposed model achieved the higher prediction accuracies of ∼2.65%, ∼2.84%, and ∼3.37% using jackknife, K-fold, and independent test, respectively. The experimental outcomes validate that our proposed “AFP-CMBPred” predictor secured the highest prediction results than the existing models for the identification of AFPs. It is further anticipated that our proposed AFP-CMBPred model will be considered a valuable tool in the research academia and drug development. •Designed a novel predictor named AFP-CMBPred for prediction of Antifreeze proteins.•The local and global discriminative features are explored by Amp-PseAAC, DDE, MB-PSSM, and CS-MB-PSSM.•SVM and RF are used as classification algorithms.•AFP-CMBPred predictor secured the highest prediction results for AFPs identification.</abstract><cop>United States</cop><pub>Elsevier Ltd</pub><pmid>34749096</pmid><doi>10.1016/j.compbiomed.2021.105006</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-0914-1577</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0010-4825
ispartof Computers in biology and medicine, 2021-12, Vol.139, p.105006-105006, Article 105006
issn 0010-4825
1879-0534
language eng
recordid cdi_proquest_miscellaneous_2595559375
source Elsevier
subjects Algorithms
Amino acid composition
Amino acids
Amphiphilic Pseudo Amino Acid Composition
Animals
Antifreeze proteins
Antifreeze Proteins - genetics
Bacteria
Biotechnology
Cold
Computational Biology
Computer applications
Consensus Sequence
Consensus Sequences
Conserved sequence
Datasets
DDE
Drug development
Feature extraction
Food industry
Heterogeneity
Ice formation
Learning algorithms
Machine learning
Methods
Multi-Blocks position Specific Scoring Matrix
Nitrous oxide
Peptides
Performance prediction
Plants
Predictions
Proteins
Support Vector Machine
Support vector machines
title AFP-CMBPred: Computational identification of antifreeze proteins by extending consensus sequences into multi-blocks evolutionary information
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T15%3A54%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=AFP-CMBPred:%20Computational%20identification%20of%20antifreeze%20proteins%20by%20extending%20consensus%20sequences%20into%20multi-blocks%20evolutionary%20information&rft.jtitle=Computers%20in%20biology%20and%20medicine&rft.au=Ali,%20Farman&rft.date=2021-12&rft.volume=139&rft.spage=105006&rft.epage=105006&rft.pages=105006-105006&rft.artnum=105006&rft.issn=0010-4825&rft.eissn=1879-0534&rft_id=info:doi/10.1016/j.compbiomed.2021.105006&rft_dat=%3Cproquest_cross%3E2595559375%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c402t-47da8406e21cbdd3e96e81fb584f9dd557021984d4f6098274ac6b6179cf7f13%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2604006969&rft_id=info:pmid/34749096&rfr_iscdi=true