Loading…

An overview of unsupervised drift detection methods

Practical applications involving big data, such as weather monitoring, identification of customer preferences, Internet log analysis, and sensors warnings require challenging data analysis, since these are examples of problems whose data are generated in streams and usually demand real‐time analytic...

Full description

Saved in:
Bibliographic Details
Published in:Wiley interdisciplinary reviews. Data mining and knowledge discovery 2020-11, Vol.10 (6), p.e1381-n/a
Main Authors: Gemaque, Rosana Noronha, Costa, Albert França Josuá, Giusti, Rafael, Santos, Eulanda Miranda
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Practical applications involving big data, such as weather monitoring, identification of customer preferences, Internet log analysis, and sensors warnings require challenging data analysis, since these are examples of problems whose data are generated in streams and usually demand real‐time analytics. Patterns in such data stream problems may change quickly. Consequently, machine learning models that operate in this context must be updated over time. This phenomenon is called concept drift in machine learning and data mining literature. Several different directions have been pursued to learn from data stream and to deal with concept drift. However, most drift detection methods consider that an instance's class label is available right after its prediction, since these methods work by monitoring the prediction results of a base classifier or an ensemble of classifiers. Nevertheless, this constraint is unrealistic in several practical problems. To cope with this constraint, some works are focused on proposing efficient unsupervised or semi‐supervised concept drift detectors. While interesting and recent overview papers dedicated to supervised drift detectors have been published, the scenario is not the same in terms of unsupervised methods. Therefore, this work presents a comprehensive overview of approaches that tackle concept drift in classification problems in an unsupervised manner. Additional contribution includes a proposed taxonomy of state‐of‐the‐art approaches for concept drift detection based on unsupervised strategies. This article is categorized under: Technologies > Classification Technologies > Machine Learning Proposed taxonomy of unsupervised concept drift detection methods.
ISSN:1942-4787
1942-4795
DOI:10.1002/widm.1381