Loading…

Functional Prediction of Hypothetical Proteins from Shigella flexneri and Validation of the Predicted Models by Using ROC Curve Analysis

Shigella spp. constitutes some of the key pathogens responsible for the global burden of diarrhoeal disease. With over 164 million reported cases per annum, shigellosis accounts for 1.1 million deaths each year. Majority of these cases occur among the children of the developing nations and the emerg...

Full description

Saved in:
Bibliographic Details
Published in:Genomics & informatics 2018, 16(4), , pp.26-26
Main Authors: Gazi, Md Amran, Mahmud, Sultan, Fahim, Shah Mohammad, Kibria, Mohammad Golam, Palit, Parag, Islam, Md Rezaul, Rashid, Humaira, Das, Subhasish, Mahfuz, Mustafa, Ahmeed, Tahmeed
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c2756-d70ba2f0855556ff00b7e4fc7e7d14f5414e093772c6d817711865a8abb21c4c3
cites cdi_FETCH-LOGICAL-c2756-d70ba2f0855556ff00b7e4fc7e7d14f5414e093772c6d817711865a8abb21c4c3
container_end_page e26
container_issue 4
container_start_page e26
container_title Genomics & informatics
container_volume 16
creator Gazi, Md Amran
Mahmud, Sultan
Fahim, Shah Mohammad
Kibria, Mohammad Golam
Palit, Parag
Islam, Md Rezaul
Rashid, Humaira
Das, Subhasish
Mahfuz, Mustafa
Ahmeed, Tahmeed
description Shigella spp. constitutes some of the key pathogens responsible for the global burden of diarrhoeal disease. With over 164 million reported cases per annum, shigellosis accounts for 1.1 million deaths each year. Majority of these cases occur among the children of the developing nations and the emergence of multi-drug resistance Shigella strains in clinical isolates demands the development of better/new drugs against this pathogen. The genome of Shigella flexneri was extensively analyzed and found 4,362 proteins among which the functions of 674 proteins, termed as hypothetical proteins (HPs) had not been previously elucidated. Amino acid sequences of all these 674 HPs were studied and the functions of a total of 39 HPs have been assigned with high level of confidence. Here we have utilized a combination of the latest versions of databases to assign the precise function of HPs for which no experimental information is available. These HPs were found to belong to various classes of proteins such as enzymes, binding proteins, signal transducers, lipoprotein, transporters, virulence and other proteins. Evaluation of the performance of the various computational tools conducted using receiver operating characteristic curve analysis and a resoundingly high average accuracy of 93.6% were obtained. Our comprehensive analysis will help to gain greater understanding for the development of many novel potential therapeutic interventions to defeat Shigella infection.
doi_str_mv 10.5808/GI.2018.16.4.e26
format article
fullrecord <record><control><sourceid>proquest_nrf_k</sourceid><recordid>TN_cdi_nrf_kci_oai_kci_go_kr_ARTI_3949824</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2163012006</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2756-d70ba2f0855556ff00b7e4fc7e7d14f5414e093772c6d817711865a8abb21c4c3</originalsourceid><addsrcrecordid>eNpVkU9vEzEQxS0EoqFw54R8hMMuttdrOxekKKJppKKi0iJultd_EtONHezdinwDPjZO0lTgy8ia934zmgfAW4zqViDxcbGsCcKixqymtSXsGZgQ0tAKcUqegwlup6ISjP04A69y_okQow2nL8FZgxgiSPAJ-HMxBj34GFQPvyZr_OEDo4OXu20c1nbw-tCKg_UhQ5fiBn5b-5XtewVdb38HmzxUwcDvqvdGnezFegJaA79EY_sMux28yz6s4M31HM7H9GDhrEzeZZ9fgxdO9dm-eazn4O7i8-38srq6Xizns6tKE96yynDUKeKQaMtjziHUcUud5pYbTF1LMbVo2nBONDMCc46xYK0SqusI1lQ35-DDkRuSk_fay6j8oa6ivE9ydnO7lM2UTgWhRfvpqN2O3cYabcOQVC-3yW9U2h2c_3eCXxfOg2SUIsZIAbx_BKT4a7R5kBuf9f50wcYxS4JZgzApwRQpOkp1ijkn657GYCT3YcvFUu7DlphJKkvYxfLu3_WeDKd0m7_YmKcp</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2163012006</pqid></control><display><type>article</type><title>Functional Prediction of Hypothetical Proteins from Shigella flexneri and Validation of the Predicted Models by Using ROC Curve Analysis</title><source>Open Access: PubMed Central</source><creator>Gazi, Md Amran ; Mahmud, Sultan ; Fahim, Shah Mohammad ; Kibria, Mohammad Golam ; Palit, Parag ; Islam, Md Rezaul ; Rashid, Humaira ; Das, Subhasish ; Mahfuz, Mustafa ; Ahmeed, Tahmeed</creator><creatorcontrib>Gazi, Md Amran ; Mahmud, Sultan ; Fahim, Shah Mohammad ; Kibria, Mohammad Golam ; Palit, Parag ; Islam, Md Rezaul ; Rashid, Humaira ; Das, Subhasish ; Mahfuz, Mustafa ; Ahmeed, Tahmeed</creatorcontrib><description>Shigella spp. constitutes some of the key pathogens responsible for the global burden of diarrhoeal disease. With over 164 million reported cases per annum, shigellosis accounts for 1.1 million deaths each year. Majority of these cases occur among the children of the developing nations and the emergence of multi-drug resistance Shigella strains in clinical isolates demands the development of better/new drugs against this pathogen. The genome of Shigella flexneri was extensively analyzed and found 4,362 proteins among which the functions of 674 proteins, termed as hypothetical proteins (HPs) had not been previously elucidated. Amino acid sequences of all these 674 HPs were studied and the functions of a total of 39 HPs have been assigned with high level of confidence. Here we have utilized a combination of the latest versions of databases to assign the precise function of HPs for which no experimental information is available. These HPs were found to belong to various classes of proteins such as enzymes, binding proteins, signal transducers, lipoprotein, transporters, virulence and other proteins. Evaluation of the performance of the various computational tools conducted using receiver operating characteristic curve analysis and a resoundingly high average accuracy of 93.6% were obtained. Our comprehensive analysis will help to gain greater understanding for the development of many novel potential therapeutic interventions to defeat Shigella infection.</description><identifier>ISSN: 1598-866X</identifier><identifier>ISSN: 2234-0742</identifier><identifier>EISSN: 2234-0742</identifier><identifier>DOI: 10.5808/GI.2018.16.4.e26</identifier><identifier>PMID: 30602087</identifier><language>eng</language><publisher>Korea (South): Korea Genome Organization</publisher><subject>Original ; 자연과학일반</subject><ispartof>Genomics &amp; Informatics, 2018, 16(4), , pp.26-26</ispartof><rights>Copyright © 2018 by Korea Genome Organization 2018</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c2756-d70ba2f0855556ff00b7e4fc7e7d14f5414e093772c6d817711865a8abb21c4c3</citedby><cites>FETCH-LOGICAL-c2756-d70ba2f0855556ff00b7e4fc7e7d14f5414e093772c6d817711865a8abb21c4c3</cites><orcidid>0000-0002-7852-6569 ; 0000-0002-7821-2455 ; 0000-0002-3627-202X ; 0000-0002-3286-7536 ; 0000-0002-0392-9646 ; 0000-0001-7863-2639 ; 0000-0001-8607-573X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6440662/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6440662/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30602087$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink><backlink>$$Uhttps://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART002427048$$DAccess content in National Research Foundation of Korea (NRF)$$Hfree_for_read</backlink></links><search><creatorcontrib>Gazi, Md Amran</creatorcontrib><creatorcontrib>Mahmud, Sultan</creatorcontrib><creatorcontrib>Fahim, Shah Mohammad</creatorcontrib><creatorcontrib>Kibria, Mohammad Golam</creatorcontrib><creatorcontrib>Palit, Parag</creatorcontrib><creatorcontrib>Islam, Md Rezaul</creatorcontrib><creatorcontrib>Rashid, Humaira</creatorcontrib><creatorcontrib>Das, Subhasish</creatorcontrib><creatorcontrib>Mahfuz, Mustafa</creatorcontrib><creatorcontrib>Ahmeed, Tahmeed</creatorcontrib><title>Functional Prediction of Hypothetical Proteins from Shigella flexneri and Validation of the Predicted Models by Using ROC Curve Analysis</title><title>Genomics &amp; informatics</title><addtitle>Genomics Inform</addtitle><description>Shigella spp. constitutes some of the key pathogens responsible for the global burden of diarrhoeal disease. With over 164 million reported cases per annum, shigellosis accounts for 1.1 million deaths each year. Majority of these cases occur among the children of the developing nations and the emergence of multi-drug resistance Shigella strains in clinical isolates demands the development of better/new drugs against this pathogen. The genome of Shigella flexneri was extensively analyzed and found 4,362 proteins among which the functions of 674 proteins, termed as hypothetical proteins (HPs) had not been previously elucidated. Amino acid sequences of all these 674 HPs were studied and the functions of a total of 39 HPs have been assigned with high level of confidence. Here we have utilized a combination of the latest versions of databases to assign the precise function of HPs for which no experimental information is available. These HPs were found to belong to various classes of proteins such as enzymes, binding proteins, signal transducers, lipoprotein, transporters, virulence and other proteins. Evaluation of the performance of the various computational tools conducted using receiver operating characteristic curve analysis and a resoundingly high average accuracy of 93.6% were obtained. Our comprehensive analysis will help to gain greater understanding for the development of many novel potential therapeutic interventions to defeat Shigella infection.</description><subject>Original</subject><subject>자연과학일반</subject><issn>1598-866X</issn><issn>2234-0742</issn><issn>2234-0742</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNpVkU9vEzEQxS0EoqFw54R8hMMuttdrOxekKKJppKKi0iJultd_EtONHezdinwDPjZO0lTgy8ia934zmgfAW4zqViDxcbGsCcKixqymtSXsGZgQ0tAKcUqegwlup6ISjP04A69y_okQow2nL8FZgxgiSPAJ-HMxBj34GFQPvyZr_OEDo4OXu20c1nbw-tCKg_UhQ5fiBn5b-5XtewVdb38HmzxUwcDvqvdGnezFegJaA79EY_sMux28yz6s4M31HM7H9GDhrEzeZZ9fgxdO9dm-eazn4O7i8-38srq6Xizns6tKE96yynDUKeKQaMtjziHUcUud5pYbTF1LMbVo2nBONDMCc46xYK0SqusI1lQ35-DDkRuSk_fay6j8oa6ivE9ydnO7lM2UTgWhRfvpqN2O3cYabcOQVC-3yW9U2h2c_3eCXxfOg2SUIsZIAbx_BKT4a7R5kBuf9f50wcYxS4JZgzApwRQpOkp1ijkn657GYCT3YcvFUu7DlphJKkvYxfLu3_WeDKd0m7_YmKcp</recordid><startdate>20181201</startdate><enddate>20181201</enddate><creator>Gazi, Md Amran</creator><creator>Mahmud, Sultan</creator><creator>Fahim, Shah Mohammad</creator><creator>Kibria, Mohammad Golam</creator><creator>Palit, Parag</creator><creator>Islam, Md Rezaul</creator><creator>Rashid, Humaira</creator><creator>Das, Subhasish</creator><creator>Mahfuz, Mustafa</creator><creator>Ahmeed, Tahmeed</creator><general>Korea Genome Organization</general><general>한국유전체학회</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><scope>ACYCR</scope><orcidid>https://orcid.org/0000-0002-7852-6569</orcidid><orcidid>https://orcid.org/0000-0002-7821-2455</orcidid><orcidid>https://orcid.org/0000-0002-3627-202X</orcidid><orcidid>https://orcid.org/0000-0002-3286-7536</orcidid><orcidid>https://orcid.org/0000-0002-0392-9646</orcidid><orcidid>https://orcid.org/0000-0001-7863-2639</orcidid><orcidid>https://orcid.org/0000-0001-8607-573X</orcidid></search><sort><creationdate>20181201</creationdate><title>Functional Prediction of Hypothetical Proteins from Shigella flexneri and Validation of the Predicted Models by Using ROC Curve Analysis</title><author>Gazi, Md Amran ; Mahmud, Sultan ; Fahim, Shah Mohammad ; Kibria, Mohammad Golam ; Palit, Parag ; Islam, Md Rezaul ; Rashid, Humaira ; Das, Subhasish ; Mahfuz, Mustafa ; Ahmeed, Tahmeed</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2756-d70ba2f0855556ff00b7e4fc7e7d14f5414e093772c6d817711865a8abb21c4c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Original</topic><topic>자연과학일반</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gazi, Md Amran</creatorcontrib><creatorcontrib>Mahmud, Sultan</creatorcontrib><creatorcontrib>Fahim, Shah Mohammad</creatorcontrib><creatorcontrib>Kibria, Mohammad Golam</creatorcontrib><creatorcontrib>Palit, Parag</creatorcontrib><creatorcontrib>Islam, Md Rezaul</creatorcontrib><creatorcontrib>Rashid, Humaira</creatorcontrib><creatorcontrib>Das, Subhasish</creatorcontrib><creatorcontrib>Mahfuz, Mustafa</creatorcontrib><creatorcontrib>Ahmeed, Tahmeed</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>Korean Citation Index</collection><jtitle>Genomics &amp; informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gazi, Md Amran</au><au>Mahmud, Sultan</au><au>Fahim, Shah Mohammad</au><au>Kibria, Mohammad Golam</au><au>Palit, Parag</au><au>Islam, Md Rezaul</au><au>Rashid, Humaira</au><au>Das, Subhasish</au><au>Mahfuz, Mustafa</au><au>Ahmeed, Tahmeed</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Functional Prediction of Hypothetical Proteins from Shigella flexneri and Validation of the Predicted Models by Using ROC Curve Analysis</atitle><jtitle>Genomics &amp; informatics</jtitle><addtitle>Genomics Inform</addtitle><date>2018-12-01</date><risdate>2018</risdate><volume>16</volume><issue>4</issue><spage>e26</spage><epage>e26</epage><pages>e26-e26</pages><issn>1598-866X</issn><issn>2234-0742</issn><eissn>2234-0742</eissn><abstract>Shigella spp. constitutes some of the key pathogens responsible for the global burden of diarrhoeal disease. With over 164 million reported cases per annum, shigellosis accounts for 1.1 million deaths each year. Majority of these cases occur among the children of the developing nations and the emergence of multi-drug resistance Shigella strains in clinical isolates demands the development of better/new drugs against this pathogen. The genome of Shigella flexneri was extensively analyzed and found 4,362 proteins among which the functions of 674 proteins, termed as hypothetical proteins (HPs) had not been previously elucidated. Amino acid sequences of all these 674 HPs were studied and the functions of a total of 39 HPs have been assigned with high level of confidence. Here we have utilized a combination of the latest versions of databases to assign the precise function of HPs for which no experimental information is available. These HPs were found to belong to various classes of proteins such as enzymes, binding proteins, signal transducers, lipoprotein, transporters, virulence and other proteins. Evaluation of the performance of the various computational tools conducted using receiver operating characteristic curve analysis and a resoundingly high average accuracy of 93.6% were obtained. Our comprehensive analysis will help to gain greater understanding for the development of many novel potential therapeutic interventions to defeat Shigella infection.</abstract><cop>Korea (South)</cop><pub>Korea Genome Organization</pub><pmid>30602087</pmid><doi>10.5808/GI.2018.16.4.e26</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-7852-6569</orcidid><orcidid>https://orcid.org/0000-0002-7821-2455</orcidid><orcidid>https://orcid.org/0000-0002-3627-202X</orcidid><orcidid>https://orcid.org/0000-0002-3286-7536</orcidid><orcidid>https://orcid.org/0000-0002-0392-9646</orcidid><orcidid>https://orcid.org/0000-0001-7863-2639</orcidid><orcidid>https://orcid.org/0000-0001-8607-573X</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1598-866X
ispartof Genomics & Informatics, 2018, 16(4), , pp.26-26
issn 1598-866X
2234-0742
2234-0742
language eng
recordid cdi_nrf_kci_oai_kci_go_kr_ARTI_3949824
source Open Access: PubMed Central
subjects Original
자연과학일반
title Functional Prediction of Hypothetical Proteins from Shigella flexneri and Validation of the Predicted Models by Using ROC Curve Analysis
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T00%3A47%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_nrf_k&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Functional%20Prediction%20of%20Hypothetical%20Proteins%20from%20Shigella%20flexneri%20and%20Validation%20of%20the%20Predicted%20Models%20by%20Using%20ROC%20Curve%20Analysis&rft.jtitle=Genomics%20&%20informatics&rft.au=Gazi,%20Md%20Amran&rft.date=2018-12-01&rft.volume=16&rft.issue=4&rft.spage=e26&rft.epage=e26&rft.pages=e26-e26&rft.issn=1598-866X&rft.eissn=2234-0742&rft_id=info:doi/10.5808/GI.2018.16.4.e26&rft_dat=%3Cproquest_nrf_k%3E2163012006%3C/proquest_nrf_k%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c2756-d70ba2f0855556ff00b7e4fc7e7d14f5414e093772c6d817711865a8abb21c4c3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2163012006&rft_id=info:pmid/30602087&rfr_iscdi=true