Loading…

The interobserver reliability of clinical relevance in medical research

•There is weak interobserver reliability of interpreting clinical difference based on observed difference and the confidence interval.•The interpretation of the p-value did not differentiate physician reported clinical relevance.•The novel OD/CI ratio, showed a significant difference for differentia...

Full description

Saved in:

Bibliographic Details
Published in:	Injury 2023-05, Vol.54, p.S66-S68
Main Authors:	Strage, Katya, Stacey, Stephen, Mauffrey, Cyril, Parry, Joshua A
Format:	Article
Language:	English
Subjects:	Biomedical Research Clinical Relevance Humans Reproducibility of Results Sensitivity and Specificity
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	•There is weak interobserver reliability of interpreting clinical difference based on observed difference and the confidence interval.•The interpretation of the p-value did not differentiate physician reported clinical relevance.•The novel OD/CI ratio, showed a significant difference for differentiating clinical relevance.•The OD/CI maximized the sensitivity (SN) and specificity (SP) for identifying clinical relevance. A measure of effect size, such as observed difference (OD) and its 95% confidence interval (CI), is necessary to determine clinical relevance (CR) of research findings. The purpose of this paper is to (1) determine the interobserver reliability (IOR) of determining CR when presented with only the OD and CI and (2) to determine if a ratio of OD over CI (OD/CI) had a stronger association with CR than the p-value. A survey including the OD and CI results from 21 studies was sent to 36 physicians, of which 21 responded. Respondents were asked to determine if the results were clinically relevant or not clinically relevant. Twenty-one (58%) physicians responded. The IOR of interpreting CR based on OD and the CI was weak (kappa=0.13, CI 0.10 to 0.15). The p-value did not differ between CR and non-CR results (median difference -0.001, CI -0.005 to 0.0, p = 0.07). The OD/CI however, was greater for CR vs. non-CR results (median difference 0.5, CI 0.09 to 0.95, p = 0.02). The area under the curve for the p-value and OD/CI receiver-operator characteristic curve was 0.70 and 0.80. The p-value and OD/CI that maximized the sensitivity (SN) and specificity (SP) for identifying CR was 0.001 (SN 88%, SP 59%) and 0.95 (SN 88%, SP 84%). Determining CR from the OD and CI alone had weak interobserver reliability. The OD/CI ratio had a stronger association with CR than the p-value making it potentially useful in evaluating the CR of research findings.
ISSN:	0020-1383 1879-0267
DOI:	10.1016/j.injury.2021.12.044