Loading…

Will ChatGPT pass the Polish specialty exam in radiology and diagnostic imaging? Insights into strengths and limitations

PurposeRapid development of artificial intelligence has aroused curiosity regarding its potential applications in medical field. The purpose of this article was to present the performance of ChatGPT, a state-of-the-art language model in relation to pass rate of national specialty examination (PES) i...

Full description

Saved in:
Bibliographic Details
Published in:Polish journal of radiology 2023, Vol.88, p.430-434
Main Authors: Kufel, Jakub, Paszkiewicz, Iga, Bielówka, Michał, Bartnikowska, Wiktoria, Janik, Michał, Stencel, Magdalena, Czogalik, Łukasz, Gruszczyńska, Katarzyna, Mielcarska, Sylwia
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c371t-283a02f6b9f93ec8c9df87fd8c5c330e7b9547b726ca6a6dfc0afe9dff0fc1403
cites cdi_FETCH-LOGICAL-c371t-283a02f6b9f93ec8c9df87fd8c5c330e7b9547b726ca6a6dfc0afe9dff0fc1403
container_end_page 434
container_issue
container_start_page 430
container_title Polish journal of radiology
container_volume 88
creator Kufel, Jakub
Paszkiewicz, Iga
Bielówka, Michał
Bartnikowska, Wiktoria
Janik, Michał
Stencel, Magdalena
Czogalik, Łukasz
Gruszczyńska, Katarzyna
Mielcarska, Sylwia
description PurposeRapid development of artificial intelligence has aroused curiosity regarding its potential applications in medical field. The purpose of this article was to present the performance of ChatGPT, a state-of-the-art language model in relation to pass rate of national specialty examination (PES) in radiology and imaging diagnostics within Polish education system. Additionally, the study aimed to identify the strengths and limitations of the model through a detailed analysis of issues raised by exam questions.Material and methodsThe present study utilized a PES exam consisting of 120 questions, provided by Medical Exami-nations Center in Lodz. Questions were administered using openai.com platform that grants free access to GPT-3.5 model. All questions were categorized according to Bloom's taxonomy to assess their complexity and difficulty. Following the answer to each exam question, ChatGPT was asked to rate its confidence on a scale of 1 to 5 to evaluate the accuracy of its response.ResultsChatGPT did not reach the pass rate threshold of PES exam (52%); however, it was close in certain question categories. No significant differences were observed in the percentage of correct answers across question types and sub-types.ConclusionsThe performance of the ChatGPT model in the pass rate of PES exam in radiology and imaging diagnostics in Poland is yet to be determined, which requires further research on improved versions of ChatGPT.
doi_str_mv 10.5114/pjr.2023.131215
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10551734</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2874838108</sourcerecordid><originalsourceid>FETCH-LOGICAL-c371t-283a02f6b9f93ec8c9df87fd8c5c330e7b9547b726ca6a6dfc0afe9dff0fc1403</originalsourceid><addsrcrecordid>eNpVkU1LxDAQhosoKOrZa45edk2afqQnkcWPBcE9rHgMs2nSRtKkZrLi_nurK6JzmYF5eGfgybILRuclY8XV-BrnOc35nHGWs_IgO2GiaWa0qerDP_Nxdo74SqeqGK-K4iT7eLHOkUUP6X61JiMgktRrsgrOYk9w1MqCSzuiP2Ag1pMIrQ0udDsCviWthc4HTFYRO0BnfXdNlh5t1yec6BQIpqh9l3r85p0dbIJkg8ez7MiAQ33-00-z57vb9eJh9vh0v1zcPM4Ur1ma5YIDzU21aUzDtRKqaY2oTStUqTinut40ZVFv6rxSUEHVGkXB6Aky1ChWUH6aXe9zx-1m0K3SPkVwcozTw3EnA1j5f-NtL7vwLhktS1bzYkq4_EmI4W2rMcnBotLOgddhizIXdSG4YFRM6NUeVTEgRm1-7zAqv0TJSZT8EiX3ovgnGDeK4w</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2874838108</pqid></control><display><type>article</type><title>Will ChatGPT pass the Polish specialty exam in radiology and diagnostic imaging? Insights into strengths and limitations</title><source>PubMed Central</source><creator>Kufel, Jakub ; Paszkiewicz, Iga ; Bielówka, Michał ; Bartnikowska, Wiktoria ; Janik, Michał ; Stencel, Magdalena ; Czogalik, Łukasz ; Gruszczyńska, Katarzyna ; Mielcarska, Sylwia</creator><creatorcontrib>Kufel, Jakub ; Paszkiewicz, Iga ; Bielówka, Michał ; Bartnikowska, Wiktoria ; Janik, Michał ; Stencel, Magdalena ; Czogalik, Łukasz ; Gruszczyńska, Katarzyna ; Mielcarska, Sylwia</creatorcontrib><description>PurposeRapid development of artificial intelligence has aroused curiosity regarding its potential applications in medical field. The purpose of this article was to present the performance of ChatGPT, a state-of-the-art language model in relation to pass rate of national specialty examination (PES) in radiology and imaging diagnostics within Polish education system. Additionally, the study aimed to identify the strengths and limitations of the model through a detailed analysis of issues raised by exam questions.Material and methodsThe present study utilized a PES exam consisting of 120 questions, provided by Medical Exami-nations Center in Lodz. Questions were administered using openai.com platform that grants free access to GPT-3.5 model. All questions were categorized according to Bloom's taxonomy to assess their complexity and difficulty. Following the answer to each exam question, ChatGPT was asked to rate its confidence on a scale of 1 to 5 to evaluate the accuracy of its response.ResultsChatGPT did not reach the pass rate threshold of PES exam (52%); however, it was close in certain question categories. No significant differences were observed in the percentage of correct answers across question types and sub-types.ConclusionsThe performance of the ChatGPT model in the pass rate of PES exam in radiology and imaging diagnostics in Poland is yet to be determined, which requires further research on improved versions of ChatGPT.</description><identifier>ISSN: 1899-0967</identifier><identifier>ISSN: 1733-134X</identifier><identifier>EISSN: 1899-0967</identifier><identifier>DOI: 10.5114/pjr.2023.131215</identifier><language>eng</language><publisher>Termedia Publishing House</publisher><subject>Original Paper</subject><ispartof>Polish journal of radiology, 2023, Vol.88, p.430-434</ispartof><rights>Pol J Radiol 2023 2023</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c371t-283a02f6b9f93ec8c9df87fd8c5c330e7b9547b726ca6a6dfc0afe9dff0fc1403</citedby><cites>FETCH-LOGICAL-c371t-283a02f6b9f93ec8c9df87fd8c5c330e7b9547b726ca6a6dfc0afe9dff0fc1403</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC10551734/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC10551734/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,4024,27923,27924,27925,53791,53793</link.rule.ids></links><search><creatorcontrib>Kufel, Jakub</creatorcontrib><creatorcontrib>Paszkiewicz, Iga</creatorcontrib><creatorcontrib>Bielówka, Michał</creatorcontrib><creatorcontrib>Bartnikowska, Wiktoria</creatorcontrib><creatorcontrib>Janik, Michał</creatorcontrib><creatorcontrib>Stencel, Magdalena</creatorcontrib><creatorcontrib>Czogalik, Łukasz</creatorcontrib><creatorcontrib>Gruszczyńska, Katarzyna</creatorcontrib><creatorcontrib>Mielcarska, Sylwia</creatorcontrib><title>Will ChatGPT pass the Polish specialty exam in radiology and diagnostic imaging? Insights into strengths and limitations</title><title>Polish journal of radiology</title><description>PurposeRapid development of artificial intelligence has aroused curiosity regarding its potential applications in medical field. The purpose of this article was to present the performance of ChatGPT, a state-of-the-art language model in relation to pass rate of national specialty examination (PES) in radiology and imaging diagnostics within Polish education system. Additionally, the study aimed to identify the strengths and limitations of the model through a detailed analysis of issues raised by exam questions.Material and methodsThe present study utilized a PES exam consisting of 120 questions, provided by Medical Exami-nations Center in Lodz. Questions were administered using openai.com platform that grants free access to GPT-3.5 model. All questions were categorized according to Bloom's taxonomy to assess their complexity and difficulty. Following the answer to each exam question, ChatGPT was asked to rate its confidence on a scale of 1 to 5 to evaluate the accuracy of its response.ResultsChatGPT did not reach the pass rate threshold of PES exam (52%); however, it was close in certain question categories. No significant differences were observed in the percentage of correct answers across question types and sub-types.ConclusionsThe performance of the ChatGPT model in the pass rate of PES exam in radiology and imaging diagnostics in Poland is yet to be determined, which requires further research on improved versions of ChatGPT.</description><subject>Original Paper</subject><issn>1899-0967</issn><issn>1733-134X</issn><issn>1899-0967</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNpVkU1LxDAQhosoKOrZa45edk2afqQnkcWPBcE9rHgMs2nSRtKkZrLi_nurK6JzmYF5eGfgybILRuclY8XV-BrnOc35nHGWs_IgO2GiaWa0qerDP_Nxdo74SqeqGK-K4iT7eLHOkUUP6X61JiMgktRrsgrOYk9w1MqCSzuiP2Ag1pMIrQ0udDsCviWthc4HTFYRO0BnfXdNlh5t1yec6BQIpqh9l3r85p0dbIJkg8ez7MiAQ33-00-z57vb9eJh9vh0v1zcPM4Ur1ma5YIDzU21aUzDtRKqaY2oTStUqTinut40ZVFv6rxSUEHVGkXB6Aky1ChWUH6aXe9zx-1m0K3SPkVwcozTw3EnA1j5f-NtL7vwLhktS1bzYkq4_EmI4W2rMcnBotLOgddhizIXdSG4YFRM6NUeVTEgRm1-7zAqv0TJSZT8EiX3ovgnGDeK4w</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Kufel, Jakub</creator><creator>Paszkiewicz, Iga</creator><creator>Bielówka, Michał</creator><creator>Bartnikowska, Wiktoria</creator><creator>Janik, Michał</creator><creator>Stencel, Magdalena</creator><creator>Czogalik, Łukasz</creator><creator>Gruszczyńska, Katarzyna</creator><creator>Mielcarska, Sylwia</creator><general>Termedia Publishing House</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>2023</creationdate><title>Will ChatGPT pass the Polish specialty exam in radiology and diagnostic imaging? Insights into strengths and limitations</title><author>Kufel, Jakub ; Paszkiewicz, Iga ; Bielówka, Michał ; Bartnikowska, Wiktoria ; Janik, Michał ; Stencel, Magdalena ; Czogalik, Łukasz ; Gruszczyńska, Katarzyna ; Mielcarska, Sylwia</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c371t-283a02f6b9f93ec8c9df87fd8c5c330e7b9547b726ca6a6dfc0afe9dff0fc1403</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Original Paper</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kufel, Jakub</creatorcontrib><creatorcontrib>Paszkiewicz, Iga</creatorcontrib><creatorcontrib>Bielówka, Michał</creatorcontrib><creatorcontrib>Bartnikowska, Wiktoria</creatorcontrib><creatorcontrib>Janik, Michał</creatorcontrib><creatorcontrib>Stencel, Magdalena</creatorcontrib><creatorcontrib>Czogalik, Łukasz</creatorcontrib><creatorcontrib>Gruszczyńska, Katarzyna</creatorcontrib><creatorcontrib>Mielcarska, Sylwia</creatorcontrib><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Polish journal of radiology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kufel, Jakub</au><au>Paszkiewicz, Iga</au><au>Bielówka, Michał</au><au>Bartnikowska, Wiktoria</au><au>Janik, Michał</au><au>Stencel, Magdalena</au><au>Czogalik, Łukasz</au><au>Gruszczyńska, Katarzyna</au><au>Mielcarska, Sylwia</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Will ChatGPT pass the Polish specialty exam in radiology and diagnostic imaging? Insights into strengths and limitations</atitle><jtitle>Polish journal of radiology</jtitle><date>2023</date><risdate>2023</risdate><volume>88</volume><spage>430</spage><epage>434</epage><pages>430-434</pages><issn>1899-0967</issn><issn>1733-134X</issn><eissn>1899-0967</eissn><abstract>PurposeRapid development of artificial intelligence has aroused curiosity regarding its potential applications in medical field. The purpose of this article was to present the performance of ChatGPT, a state-of-the-art language model in relation to pass rate of national specialty examination (PES) in radiology and imaging diagnostics within Polish education system. Additionally, the study aimed to identify the strengths and limitations of the model through a detailed analysis of issues raised by exam questions.Material and methodsThe present study utilized a PES exam consisting of 120 questions, provided by Medical Exami-nations Center in Lodz. Questions were administered using openai.com platform that grants free access to GPT-3.5 model. All questions were categorized according to Bloom's taxonomy to assess their complexity and difficulty. Following the answer to each exam question, ChatGPT was asked to rate its confidence on a scale of 1 to 5 to evaluate the accuracy of its response.ResultsChatGPT did not reach the pass rate threshold of PES exam (52%); however, it was close in certain question categories. No significant differences were observed in the percentage of correct answers across question types and sub-types.ConclusionsThe performance of the ChatGPT model in the pass rate of PES exam in radiology and imaging diagnostics in Poland is yet to be determined, which requires further research on improved versions of ChatGPT.</abstract><pub>Termedia Publishing House</pub><doi>10.5114/pjr.2023.131215</doi><tpages>5</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1899-0967
ispartof Polish journal of radiology, 2023, Vol.88, p.430-434
issn 1899-0967
1733-134X
1899-0967
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10551734
source PubMed Central
subjects Original Paper
title Will ChatGPT pass the Polish specialty exam in radiology and diagnostic imaging? Insights into strengths and limitations
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T21%3A25%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Will%20ChatGPT%20pass%20the%20Polish%20specialty%20exam%20in%20radiology%20and%20diagnostic%20imaging?%20Insights%20into%20strengths%20and%20limitations&rft.jtitle=Polish%20journal%20of%20radiology&rft.au=Kufel,%20Jakub&rft.date=2023&rft.volume=88&rft.spage=430&rft.epage=434&rft.pages=430-434&rft.issn=1899-0967&rft.eissn=1899-0967&rft_id=info:doi/10.5114/pjr.2023.131215&rft_dat=%3Cproquest_pubme%3E2874838108%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c371t-283a02f6b9f93ec8c9df87fd8c5c330e7b9547b726ca6a6dfc0afe9dff0fc1403%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2874838108&rft_id=info:pmid/&rfr_iscdi=true