Loading…
Detection and localisation of hip fractures on anteroposterior radiographs with artificial intelligence: proof of concept
To investigate the feasibility of applying a deep convolutional neural network (CNN) for detection/localisation of acute proximal femoral fractures (APFFs) on hip radiographs. This study had institutional review board approval. Radiographs of 307 patients with APFFs and 310 normal patients were iden...
Saved in:
Published in: | Clinical radiology 2020-03, Vol.75 (3), p.237.e1-237.e9 |
---|---|
Main Authors: | , , , , , , , , , , |
Format: | Article |
Language: | English |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | To investigate the feasibility of applying a deep convolutional neural network (CNN) for detection/localisation of acute proximal femoral fractures (APFFs) on hip radiographs.
This study had institutional review board approval. Radiographs of 307 patients with APFFs and 310 normal patients were identified. A split ratio of 3/1/1 was used to create training, validation, and test datasets. To test the validity of the proposed model, a 20-fold cross-validation was performed. The anonymised images from the test cohort were shown to two groups of radiologists: musculoskeletal radiologists and diagnostic radiology residents. Each reader was asked to assess if there was a fracture and localise it if one was detected. The area under the receiver operator characteristics curve (AUC), sensitivity, and specificity were calculated for the CNN and readers.
The mean AUC was 0.9944 with a standard deviation of 0.0036. Mean sensitivity and specificity for fracture detection was 97.1% (81.5/84) and 96.7% (118/122), respectively. There was good concordance with saliency maps for lesion identification, but sensitivity was lower for characterising location (subcapital/transcervical, 84.1%; basicervical/intertrochanteric, 77%; subtrochanteric, 20%). Musculoskeletal radiologists showed a sensitivity and specificity for fracture detection of 100% and 100% respectively, while residents showed 100% and 96.8%, respectively. For fracture localisation, the performance decreased slightly for human readers.
The proposed CNN algorithm showed high accuracy for detection of APFFs, but the performance was lower for fracture localisation. Overall performance of the CNN was lower than that of radiologists, especially in localizing fracture location.
•Hip fractures cause increased morbidity/mortality, and a high economic burden.•People > 65 years who are vulnerable to fractures are expected to double by 2030.•Application of artificial intelligence for the detection of hip fractures is feasible.•A CNN algorithm can achieve a high level of accuracy for fracture localization.•AI may facilitate earlier fracture detection, and delivery of timely patient care. |
---|---|
ISSN: | 0009-9260 1365-229X |
DOI: | 10.1016/j.crad.2019.10.022 |