Loading…

A protocol for the Hamilton Rating Scale for Depression: Item scoring rules, Rater training, and outcome accuracy with data on its application in a clinical trial

Abstract Background We present a fully articulated protocol for the Hamilton Rating Scale for Depression (HAM-D), including item scoring rules, rater training procedures, and a data management algorithm to increase accuracy of scores prior to outcome analyses. The latter involves identifying potenti...

Full description

Saved in:
Bibliographic Details
Published in:Journal of affective disorders 2016-08, Vol.200, p.111-118
Main Authors: Rohan, Kelly J, Rough, Jennifer N, Evans, Maggie, Ho, Sheau-Yan, Meyerhoff, Jonah, Roberts, Lorinda M, Vacek, Pamela M
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Background We present a fully articulated protocol for the Hamilton Rating Scale for Depression (HAM-D), including item scoring rules, rater training procedures, and a data management algorithm to increase accuracy of scores prior to outcome analyses. The latter involves identifying potentially inaccurate scores as interviews with discrepancies between two independent raters on the basis of either scores >=5-point difference) or meeting threshold for depression recurrence status, a long-term treatment outcome with public health significance. Discrepancies are resolved by assigning two new raters, identifying items with disagreement per an algorithm, and reaching consensus on the most accurate scores for those items. Methods These methods were applied in a clinical trial where the primary outcome was the Structured Interview Guide for the Hamilton Rating Scale for Depression–Seasonal Affective Disorder version (SIGH-SAD), which includes the 21-item HAM-D and 8 items assessing atypical symptoms. 177 seasonally depressed adult patients were enrolled and interviewed at 10 time points across treatment and the 2-year followup interval for a total of 1589 completed interviews with 1535 (96.6%) archived. Results Inter-rater reliability ranged from ICCs of .923–.967. Only 86 (5.6%) interviews met criteria for a between-rater discrepancy. HAM-D items “Depressed Mood”, “Work and Activities”, “Middle Insomnia”, and “Hypochondriasis” and Atypical items “Fatigability” and “Hypersomnia” contributed most to discrepancies. Limitations Generalizability beyond well-trained, experienced raters in a clinical trial is unknown. Conclusions Researchers might want to consider adopting this protocol in part or full. Clinicians might want to tailor it to their needs.
ISSN:0165-0327
1573-2517
DOI:10.1016/j.jad.2016.01.051