Loading…
Reproducibility of the diagnosis of dysplasia in Barrett esophagus: A reaffirmation
Morphologic assessment of dysplasia in Barrett esophagus, despite limitations, remains the basis of treatment. We rigorously tested modified 1988 criteria, assessing intraobserver and interobserver reproducibility. Participants submitted slides of Barrett mucosa negative (BE) and indefinite (IND) fo...
Saved in:
Published in: | Human pathology 2001-04, Vol.32 (4), p.368-378 |
---|---|
Main Authors: | , , , , , , , , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Morphologic assessment of dysplasia in Barrett esophagus, despite limitations, remains the basis of treatment. We rigorously tested modified 1988 criteria, assessing intraobserver and interobserver reproducibility. Participants submitted slides of Barrett mucosa negative (BE) and indefinite (IND) for dysplasia, with low-grade dysplasia (LGD) and high-grade dysplasia (HGD), and with carcinoma. Two hundred fifty slides were divided into 2 groups. The first 125 slides were reviewed, without knowledge of the prior diagnoses, on 2 occasions by 12 gastrointestinal pathologists without prior discussion of criteria. Results were analyzed by κ statistics, which correct for agreement by chance. A consensus meeting was then held, establishing, by group review of the index 125 slides, the criteria outlined herein. The second 125-slide set was then reviewed twice by each of the same 12 pathologists, and follow-up κ statistics were calculated. When statistical analysis was performed using 2 broad diagnostic categories (BE, IND, and LG v HG and carcinoma), intraobserver agreement was near perfect both before and after the consensus meeting (mean κ = 0.82 and 0.80). Interobserver agreement was substantial (κ = 0.66) and improved after the consensus meeting (κ = 0.70; P =.02). When statistical analysis was performed using 4 clinically relevant separations (BE; IND and LGD; HGD; carcinoma), mean intraobserver κ improved from 0.64 to 0.68 (both substantial) after the consensus meeting, and mean interobserver κ improved from 0.43 to 0.46 (both moderate agreement). When statistical analysis was performed using 4 diagnostic categories that required distinction between LGD and IND (BE; IND; LGD; HGD and carcinoma), the pre-consensus meeting mean intraobserver κ was 0.60 (substantial agreement), improving to 0.65 after the meeting (P |
---|---|
ISSN: | 0046-8177 1532-8392 |
DOI: | 10.1053/hupa.2001.23510 |