Loading…

A cross-sectional comparative study: ChatGPT 3.5 versus diverse levels of medical experts in the diagnosis of ENT diseases

Purpose With recent advances in artificial intelligence (AI), it has become crucial to thoroughly evaluate its applicability in healthcare. This study aimed to assess the accuracy of ChatGPT in diagnosing ear, nose, and throat (ENT) pathology, and comparing its performance to that of medical experts...

Full description

Saved in:

Bibliographic Details
Published in:	European archives of oto-rhino-laryngology 2024-05, Vol.281 (5), p.2717-2721
Main Authors:	Makhoul, Mikhael, Melkane, Antoine E., Khoury, Patrick El, Hadi, Christopher El, Matar, Nayla
Format:	Article
Language:	English
Subjects:	Head and Neck Surgery Medicine Medicine & Public Health Miscellaneous Neurosurgery Otorhinolaryngology
Citations:	Items that this one cites
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Purpose With recent advances in artificial intelligence (AI), it has become crucial to thoroughly evaluate its applicability in healthcare. This study aimed to assess the accuracy of ChatGPT in diagnosing ear, nose, and throat (ENT) pathology, and comparing its performance to that of medical experts. Methods We conducted a cross-sectional comparative study where 32 ENT cases were presented to ChatGPT 3.5, ENT physicians, ENT residents, family medicine (FM) specialists, second-year medical students (Med2), and third-year medical students (Med3). Each participant provided three differential diagnoses. The study analyzed diagnostic accuracy rates and inter-rater agreement within and between participant groups and ChatGPT. Results The accuracy rate of ChatGPT was 70.8%, being not significantly different from ENT physicians or ENT residents. However, a significant difference in correctness rate existed between ChatGPT and FM specialists (49.8%, p
ISSN:	0937-4477 1434-4726
DOI:	10.1007/s00405-024-08509-z