Loading…
Empirical calibration of time series monitoring methods using receiver operating characteristic curves
Time series monitoring methods, such as the Brown and Trigg methods, have the purpose of detecting pattern breaks (or “signals”) in time series data reliably and in a timely fashion. Traditionally, researchers have used the average run length (ARL) statistic on results from generated signal occurren...
Saved in:
Published in: | International journal of forecasting 2009-07, Vol.25 (3), p.484-497 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Time series monitoring methods, such as the Brown and Trigg methods, have the purpose of detecting pattern breaks (or “signals”) in time series data reliably and in a timely fashion. Traditionally, researchers have used the average run length (ARL) statistic on results from generated signal occurrences in simulated time series data to calibrate and evaluate these methods, with a focus on timeliness of signal detection. This paper investigates the receiver operating characteristic (ROC) framework, well-known in the diagnostic decision making literature, as an alternative to ARL analysis for time series monitoring methods. ROC analysis traditionally uses real data to address the inherent tradeoff in signal detection between the true and false positive rates when varying control limits. We illustrate ROC analysis using time series data on crime at the patrol district level in two cities, and use the concept of Pareto frontier ROC curves and reverse functions for methods such as Brown’s and Trigg’s that have parameters affecting signal-detection performance. We compare the Brown and Trigg methods to three benchmark methods, including one commonly used in practice. The Brown and Trigg methods collapse to the same simple method on the Pareto frontier and dominate the benchmark methods under most conditions. The worst method is the one commonly used in practice. |
---|---|
ISSN: | 0169-2070 1872-8200 |
DOI: | 10.1016/j.ijforecast.2008.11.007 |