Loading…

RADCURE: An open‐source head and neck cancer CT dataset for clinical radiation therapy insights

Purpose This manuscript presents RADCURE, one of the most extensive head and neck cancer (HNC) imaging datasets accessible to the public. Initially collected for clinical radiation therapy (RT) treatment planning, this dataset has been retrospectively reconstructed for use in imaging research. Acqui...

Full description

Saved in:
Bibliographic Details
Published in:Medical physics (Lancaster) 2024-04, Vol.51 (4), p.3101-3109
Main Authors: Welch, Mattea L., Kim, Sejin, Hope, Andrew J., Huang, Shao Hui, Lu, Zhibin, Marsilla, Joseph, Kazmierski, Michal, Rey‐McIntyre, Katrina, Patel, Tirth, O'Sullivan, Brian, Waldron, John, Bratman, Scott, Haibe‐Kains, Benjamin, Tadic, Tony
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Purpose This manuscript presents RADCURE, one of the most extensive head and neck cancer (HNC) imaging datasets accessible to the public. Initially collected for clinical radiation therapy (RT) treatment planning, this dataset has been retrospectively reconstructed for use in imaging research. Acquisition and Validation Methods RADCURE encompasses data from 3346 patients, featuring computed tomography (CT) RT simulation images with corresponding target and organ‐at‐risk contours. These CT scans were collected using systems from three different manufacturers. Standard clinical imaging protocols were followed, and contours were manually generated and reviewed at weekly RT quality assurance rounds. RADCURE imaging and structure set data was extracted from our institution's radiation treatment planning and oncology information systems using a custom‐built data mining and processing system. Furthermore, images were linked to our clinical anthology of outcomes data for each patient and includes demographic, clinical and treatment information based on the 7th edition TNM staging system (Tumor‐Node‐Metastasis Classification System of Malignant Tumors). The median patient age is 63, with the final dataset including 80% males. Half of the cohort is diagnosed with oropharyngeal cancer, while laryngeal, nasopharyngeal, and hypopharyngeal cancers account for 25%, 12%, and 5% of cases, respectively. The median duration of follow‐up is five years, with 60% of the cohort surviving until the last follow‐up point. Data Format and Usage Notes The dataset provides images and contours in DICOM CT and RT‐STRUCT formats, respectively. We have standardized the nomenclature for individual contours—such as the gross primary tumor, gross nodal volumes, and 19 organs‐at‐risk—to enhance the RT‐STRUCT files’ utility. Accompanying demographic, clinical, and treatment data are supplied in a comma‐separated values (CSV) file format. This comprehensive dataset is publicly accessible via The Cancer Imaging Archive. Potential Applications RADCURE's amalgamation of imaging, clinical, demographic, and treatment data renders it an invaluable resource for a broad spectrum of radiomics image analysis research endeavors. Researchers can utilize this dataset to advance routine clinical procedures using machine learning or artificial intelligence, to identify new non‐invasive biomarkers, or to forge prognostic models.
ISSN:0094-2405
2473-4209
DOI:10.1002/mp.16972