Loading…

Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods

One of the most precise methods to detect prostate cancer is by evaluation of a stained biopsy by a pathologist under a microscope. Regions of the tissue are assessed and graded according to the observed histological pattern. However, this is not only laborious, but also relies on the experience of...

Full description

Saved in:
Bibliographic Details
Published in:PloS one 2022-06, Vol.17 (6), p.e0267714-e0267714
Main Authors: Mokoatle, Mpho, Mapiye, Darlington, Marivate, Vukosi, Hayes, Vanessa M, Bornman, Riana
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c692t-aac86efcc4e18499295bb3d80c6624188898a3f13b788e20cab800f3427272b13
cites cdi_FETCH-LOGICAL-c692t-aac86efcc4e18499295bb3d80c6624188898a3f13b788e20cab800f3427272b13
container_end_page e0267714
container_issue 6
container_start_page e0267714
container_title PloS one
container_volume 17
creator Mokoatle, Mpho
Mapiye, Darlington
Marivate, Vukosi
Hayes, Vanessa M
Bornman, Riana
description One of the most precise methods to detect prostate cancer is by evaluation of a stained biopsy by a pathologist under a microscope. Regions of the tissue are assessed and graded according to the observed histological pattern. However, this is not only laborious, but also relies on the experience of the pathologist and tends to suffer from the lack of reproducibility of biopsy outcomes across pathologists. As a result, computational approaches are being sought and machine learning has been gaining momentum in the prediction of the Gleason grade group. To date, machine learning literature has addressed this problem by using features from magnetic resonance imaging images, whole slide images, tissue microarrays, gene expression data, and clinical features. However, there is a gap with regards to predicting the Gleason grade group using DNA sequences as the only input source to the machine learning models. In this work, using whole genome sequence data from South African prostate cancer patients, an application of machine learning and biological experiments were combined to understand the challenges that are associated with the prediction of the Gleason grade group. A series of machine learning binary classifiers (XGBoost, LSTM, GRU, LR, RF) were created only relying on DNA sequences input features. All the models were not able to adequately discriminate between the DNA sequences of the studied Gleason grade groups (Gleason grade group 1 and 5). However, the models were further evaluated in the prediction of tumor DNA sequences from matched-normal DNA sequences, given DNA sequences as the only input source. In this new problem, the models performed acceptably better than before with the XGBoost model achieving the highest accuracy of 74 ± 01, F1 score of 79 ± 01, recall of 99 ± 0.0, and precision of 66 ± 0.1.
doi_str_mv 10.1371/journal.pone.0267714
format article
fullrecord <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_2686269974</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A706532808</galeid><doaj_id>oai_doaj_org_article_adcfa77d380d43c6b1bac65189ccf6c9</doaj_id><sourcerecordid>A706532808</sourcerecordid><originalsourceid>FETCH-LOGICAL-c692t-aac86efcc4e18499295bb3d80c6624188898a3f13b788e20cab800f3427272b13</originalsourceid><addsrcrecordid>eNqNk1uL1DAYhoso7rr6D0QLgujFjDm0OXghDKuuAwsLnm7D1zTtZGiTbtKK--9NndllRvZCAm1JnvdNv1OWPcdoiSnH77Z-Cg665eCdWSLCOMfFg-wUS0oWjCD68OD7JHsS4xahkgrGHmcntGRcEoFOs-6jjTrY3joYfbjJLzoD0bu8DVCb9PTTkEfbptMpmJj7Jh-CjyOMJtfgtAnv85XLYRg6q2G0SZmQHvTGOpMnr-Csa_PejBtfx6fZowa6aJ7t32fZj8-fvp9_WVxeXazPV5cLzSQZFwBaMNNoXRgsCimJLKuK1gJpxkiBhRBSAG0wrbgQhiANlUCooQXhaVWYnmUvd75D56PaJyoqwgQjTEpeJGK9I2oPWzWkBEC4UR6s-rvhQ6sgjFZ3RkGtG-C8pgLVBdWswhVoVmIhtW6Ylsnrw_62qepNrY0bA3RHpscnzm5U638piQUhkieDN3uD4K8nE0fVp6KYrgNn_DT_Ny94SRGdI3v1D3p_dHuqhRSAdY1P9-rZVK04YiVNpReJWt5DpVWb3urUVY1N-0eCt0eCxIzm99jCFKNaf_v6_-zVz2P29QG7MdCNm-i7aW6neAwWO1CnHozBNHdJxkjNQ3GbDTUPhdoPRZK9OCzQneh2CugfL_gIWQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2686269974</pqid></control><display><type>article</type><title>Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods</title><source>Publicly Available Content Database</source><source>PubMed Central</source><creator>Mokoatle, Mpho ; Mapiye, Darlington ; Marivate, Vukosi ; Hayes, Vanessa M ; Bornman, Riana</creator><contributor>V E, Sathishkumar</contributor><creatorcontrib>Mokoatle, Mpho ; Mapiye, Darlington ; Marivate, Vukosi ; Hayes, Vanessa M ; Bornman, Riana ; V E, Sathishkumar</creatorcontrib><description>One of the most precise methods to detect prostate cancer is by evaluation of a stained biopsy by a pathologist under a microscope. Regions of the tissue are assessed and graded according to the observed histological pattern. However, this is not only laborious, but also relies on the experience of the pathologist and tends to suffer from the lack of reproducibility of biopsy outcomes across pathologists. As a result, computational approaches are being sought and machine learning has been gaining momentum in the prediction of the Gleason grade group. To date, machine learning literature has addressed this problem by using features from magnetic resonance imaging images, whole slide images, tissue microarrays, gene expression data, and clinical features. However, there is a gap with regards to predicting the Gleason grade group using DNA sequences as the only input source to the machine learning models. In this work, using whole genome sequence data from South African prostate cancer patients, an application of machine learning and biological experiments were combined to understand the challenges that are associated with the prediction of the Gleason grade group. A series of machine learning binary classifiers (XGBoost, LSTM, GRU, LR, RF) were created only relying on DNA sequences input features. All the models were not able to adequately discriminate between the DNA sequences of the studied Gleason grade groups (Gleason grade group 1 and 5). However, the models were further evaluated in the prediction of tumor DNA sequences from matched-normal DNA sequences, given DNA sequences as the only input source. In this new problem, the models performed acceptably better than before with the XGBoost model achieving the highest accuracy of 74 ± 01, F1 score of 79 ± 01, recall of 99 ± 0.0, and precision of 66 ± 0.1.</description><identifier>ISSN: 1932-6203</identifier><identifier>EISSN: 1932-6203</identifier><identifier>DOI: 10.1371/journal.pone.0267714</identifier><identifier>PMID: 35679280</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Awards &amp; honors ; Biology and Life Sciences ; Biopsy ; Colorectal cancer ; Computer and Information Sciences ; Computer applications ; Deep learning ; Deoxyribonucleic acid ; Diagnosis ; DNA ; DNA microarrays ; Evaluation ; Gene expression ; Gene sequencing ; Genetic aspects ; Genomes ; Learning algorithms ; Machine learning ; Magnetic resonance ; Magnetic resonance imaging ; Medical imaging ; Medical research ; Medicine and Health Sciences ; Modelling ; Mutation ; Neural networks ; Nucleotide sequence ; Predictions ; Probability distribution ; Prostate cancer ; Tumors</subject><ispartof>PloS one, 2022-06, Vol.17 (6), p.e0267714-e0267714</ispartof><rights>COPYRIGHT 2022 Public Library of Science</rights><rights>2022 Mokoatle et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2022 Mokoatle et al 2022 Mokoatle et al</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c692t-aac86efcc4e18499295bb3d80c6624188898a3f13b788e20cab800f3427272b13</citedby><cites>FETCH-LOGICAL-c692t-aac86efcc4e18499295bb3d80c6624188898a3f13b788e20cab800f3427272b13</cites><orcidid>0000-0001-7689-2840</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2686269974/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2686269974?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,25732,27903,27904,36991,36992,44569,53769,53771,74872</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35679280$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>V E, Sathishkumar</contributor><creatorcontrib>Mokoatle, Mpho</creatorcontrib><creatorcontrib>Mapiye, Darlington</creatorcontrib><creatorcontrib>Marivate, Vukosi</creatorcontrib><creatorcontrib>Hayes, Vanessa M</creatorcontrib><creatorcontrib>Bornman, Riana</creatorcontrib><title>Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods</title><title>PloS one</title><addtitle>PLoS One</addtitle><description>One of the most precise methods to detect prostate cancer is by evaluation of a stained biopsy by a pathologist under a microscope. Regions of the tissue are assessed and graded according to the observed histological pattern. However, this is not only laborious, but also relies on the experience of the pathologist and tends to suffer from the lack of reproducibility of biopsy outcomes across pathologists. As a result, computational approaches are being sought and machine learning has been gaining momentum in the prediction of the Gleason grade group. To date, machine learning literature has addressed this problem by using features from magnetic resonance imaging images, whole slide images, tissue microarrays, gene expression data, and clinical features. However, there is a gap with regards to predicting the Gleason grade group using DNA sequences as the only input source to the machine learning models. In this work, using whole genome sequence data from South African prostate cancer patients, an application of machine learning and biological experiments were combined to understand the challenges that are associated with the prediction of the Gleason grade group. A series of machine learning binary classifiers (XGBoost, LSTM, GRU, LR, RF) were created only relying on DNA sequences input features. All the models were not able to adequately discriminate between the DNA sequences of the studied Gleason grade groups (Gleason grade group 1 and 5). However, the models were further evaluated in the prediction of tumor DNA sequences from matched-normal DNA sequences, given DNA sequences as the only input source. In this new problem, the models performed acceptably better than before with the XGBoost model achieving the highest accuracy of 74 ± 01, F1 score of 79 ± 01, recall of 99 ± 0.0, and precision of 66 ± 0.1.</description><subject>Awards &amp; honors</subject><subject>Biology and Life Sciences</subject><subject>Biopsy</subject><subject>Colorectal cancer</subject><subject>Computer and Information Sciences</subject><subject>Computer applications</subject><subject>Deep learning</subject><subject>Deoxyribonucleic acid</subject><subject>Diagnosis</subject><subject>DNA</subject><subject>DNA microarrays</subject><subject>Evaluation</subject><subject>Gene expression</subject><subject>Gene sequencing</subject><subject>Genetic aspects</subject><subject>Genomes</subject><subject>Learning algorithms</subject><subject>Machine learning</subject><subject>Magnetic resonance</subject><subject>Magnetic resonance imaging</subject><subject>Medical imaging</subject><subject>Medical research</subject><subject>Medicine and Health Sciences</subject><subject>Modelling</subject><subject>Mutation</subject><subject>Neural networks</subject><subject>Nucleotide sequence</subject><subject>Predictions</subject><subject>Probability distribution</subject><subject>Prostate cancer</subject><subject>Tumors</subject><issn>1932-6203</issn><issn>1932-6203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNqNk1uL1DAYhoso7rr6D0QLgujFjDm0OXghDKuuAwsLnm7D1zTtZGiTbtKK--9NndllRvZCAm1JnvdNv1OWPcdoiSnH77Z-Cg665eCdWSLCOMfFg-wUS0oWjCD68OD7JHsS4xahkgrGHmcntGRcEoFOs-6jjTrY3joYfbjJLzoD0bu8DVCb9PTTkEfbptMpmJj7Jh-CjyOMJtfgtAnv85XLYRg6q2G0SZmQHvTGOpMnr-Csa_PejBtfx6fZowa6aJ7t32fZj8-fvp9_WVxeXazPV5cLzSQZFwBaMNNoXRgsCimJLKuK1gJpxkiBhRBSAG0wrbgQhiANlUCooQXhaVWYnmUvd75D56PaJyoqwgQjTEpeJGK9I2oPWzWkBEC4UR6s-rvhQ6sgjFZ3RkGtG-C8pgLVBdWswhVoVmIhtW6Ylsnrw_62qepNrY0bA3RHpscnzm5U638piQUhkieDN3uD4K8nE0fVp6KYrgNn_DT_Ny94SRGdI3v1D3p_dHuqhRSAdY1P9-rZVK04YiVNpReJWt5DpVWb3urUVY1N-0eCt0eCxIzm99jCFKNaf_v6_-zVz2P29QG7MdCNm-i7aW6neAwWO1CnHozBNHdJxkjNQ3GbDTUPhdoPRZK9OCzQneh2CugfL_gIWQ</recordid><startdate>20220609</startdate><enddate>20220609</enddate><creator>Mokoatle, Mpho</creator><creator>Mapiye, Darlington</creator><creator>Marivate, Vukosi</creator><creator>Hayes, Vanessa M</creator><creator>Bornman, Riana</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISR</scope><scope>3V.</scope><scope>7QG</scope><scope>7QL</scope><scope>7QO</scope><scope>7RV</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TG</scope><scope>7TM</scope><scope>7U9</scope><scope>7X2</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB.</scope><scope>KB0</scope><scope>KL.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>M7S</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0001-7689-2840</orcidid></search><sort><creationdate>20220609</creationdate><title>Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods</title><author>Mokoatle, Mpho ; Mapiye, Darlington ; Marivate, Vukosi ; Hayes, Vanessa M ; Bornman, Riana</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c692t-aac86efcc4e18499295bb3d80c6624188898a3f13b788e20cab800f3427272b13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Awards &amp; honors</topic><topic>Biology and Life Sciences</topic><topic>Biopsy</topic><topic>Colorectal cancer</topic><topic>Computer and Information Sciences</topic><topic>Computer applications</topic><topic>Deep learning</topic><topic>Deoxyribonucleic acid</topic><topic>Diagnosis</topic><topic>DNA</topic><topic>DNA microarrays</topic><topic>Evaluation</topic><topic>Gene expression</topic><topic>Gene sequencing</topic><topic>Genetic aspects</topic><topic>Genomes</topic><topic>Learning algorithms</topic><topic>Machine learning</topic><topic>Magnetic resonance</topic><topic>Magnetic resonance imaging</topic><topic>Medical imaging</topic><topic>Medical research</topic><topic>Medicine and Health Sciences</topic><topic>Modelling</topic><topic>Mutation</topic><topic>Neural networks</topic><topic>Nucleotide sequence</topic><topic>Predictions</topic><topic>Probability distribution</topic><topic>Prostate cancer</topic><topic>Tumors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mokoatle, Mpho</creatorcontrib><creatorcontrib>Mapiye, Darlington</creatorcontrib><creatorcontrib>Marivate, Vukosi</creatorcontrib><creatorcontrib>Hayes, Vanessa M</creatorcontrib><creatorcontrib>Bornman, Riana</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale_Opposing Viewpoints In Context</collection><collection>Science in Context</collection><collection>ProQuest Central (Corporate)</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Nursing &amp; Allied Health Database</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Meteorological &amp; Geoastrophysical Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Agricultural Science Collection</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Database‎ (1962 - current)</collection><collection>Agricultural &amp; Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Nursing &amp; Allied Health Database (Alumni Edition)</collection><collection>Meteorological &amp; Geoastrophysical Abstracts - Academic</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Agriculture Science Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>PML(ProQuest Medical Library)</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biological Science Database</collection><collection>Engineering Database</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>Materials Science Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering collection</collection><collection>Environmental Science Collection</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PloS one</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mokoatle, Mpho</au><au>Mapiye, Darlington</au><au>Marivate, Vukosi</au><au>Hayes, Vanessa M</au><au>Bornman, Riana</au><au>V E, Sathishkumar</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods</atitle><jtitle>PloS one</jtitle><addtitle>PLoS One</addtitle><date>2022-06-09</date><risdate>2022</risdate><volume>17</volume><issue>6</issue><spage>e0267714</spage><epage>e0267714</epage><pages>e0267714-e0267714</pages><issn>1932-6203</issn><eissn>1932-6203</eissn><abstract>One of the most precise methods to detect prostate cancer is by evaluation of a stained biopsy by a pathologist under a microscope. Regions of the tissue are assessed and graded according to the observed histological pattern. However, this is not only laborious, but also relies on the experience of the pathologist and tends to suffer from the lack of reproducibility of biopsy outcomes across pathologists. As a result, computational approaches are being sought and machine learning has been gaining momentum in the prediction of the Gleason grade group. To date, machine learning literature has addressed this problem by using features from magnetic resonance imaging images, whole slide images, tissue microarrays, gene expression data, and clinical features. However, there is a gap with regards to predicting the Gleason grade group using DNA sequences as the only input source to the machine learning models. In this work, using whole genome sequence data from South African prostate cancer patients, an application of machine learning and biological experiments were combined to understand the challenges that are associated with the prediction of the Gleason grade group. A series of machine learning binary classifiers (XGBoost, LSTM, GRU, LR, RF) were created only relying on DNA sequences input features. All the models were not able to adequately discriminate between the DNA sequences of the studied Gleason grade groups (Gleason grade group 1 and 5). However, the models were further evaluated in the prediction of tumor DNA sequences from matched-normal DNA sequences, given DNA sequences as the only input source. In this new problem, the models performed acceptably better than before with the XGBoost model achieving the highest accuracy of 74 ± 01, F1 score of 79 ± 01, recall of 99 ± 0.0, and precision of 66 ± 0.1.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>35679280</pmid><doi>10.1371/journal.pone.0267714</doi><tpages>e0267714</tpages><orcidid>https://orcid.org/0000-0001-7689-2840</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1932-6203
ispartof PloS one, 2022-06, Vol.17 (6), p.e0267714-e0267714
issn 1932-6203
1932-6203
language eng
recordid cdi_plos_journals_2686269974
source Publicly Available Content Database; PubMed Central
subjects Awards & honors
Biology and Life Sciences
Biopsy
Colorectal cancer
Computer and Information Sciences
Computer applications
Deep learning
Deoxyribonucleic acid
Diagnosis
DNA
DNA microarrays
Evaluation
Gene expression
Gene sequencing
Genetic aspects
Genomes
Learning algorithms
Machine learning
Magnetic resonance
Magnetic resonance imaging
Medical imaging
Medical research
Medicine and Health Sciences
Modelling
Mutation
Neural networks
Nucleotide sequence
Predictions
Probability distribution
Prostate cancer
Tumors
title Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T12%3A26%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Discriminatory%20Gleason%20grade%20group%20signatures%20of%20prostate%20cancer:%20An%20application%20of%20machine%20learning%20methods&rft.jtitle=PloS%20one&rft.au=Mokoatle,%20Mpho&rft.date=2022-06-09&rft.volume=17&rft.issue=6&rft.spage=e0267714&rft.epage=e0267714&rft.pages=e0267714-e0267714&rft.issn=1932-6203&rft.eissn=1932-6203&rft_id=info:doi/10.1371/journal.pone.0267714&rft_dat=%3Cgale_plos_%3EA706532808%3C/gale_plos_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c692t-aac86efcc4e18499295bb3d80c6624188898a3f13b788e20cab800f3427272b13%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2686269974&rft_id=info:pmid/35679280&rft_galeid=A706532808&rfr_iscdi=true