Loading…

Sliding window based weighted maximal frequent pattern mining over data streams

•We introduce a novel algorithm mining WMFPs with only one scan over sliding window-based data stream environment.•We also provide a strategy which can prune unnecessary operations causing meaningless pattern generation in single paths.•In performance evaluation, we show that our approach presents b...

Full description

Saved in:
Bibliographic Details
Published in:Expert systems with applications 2014-02, Vol.41 (2), p.694-708
Main Authors: Lee, Gangin, Yun, Unil, Ryu, Keun Ho
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•We introduce a novel algorithm mining WMFPs with only one scan over sliding window-based data stream environment.•We also provide a strategy which can prune unnecessary operations causing meaningless pattern generation in single paths.•In performance evaluation, we show that our approach presents better performance than previous algorithms. As data have been accumulated more quickly in recent years, corresponding databases have also become huger, and thus, general frequent pattern mining methods have been faced with limitations that do not appropriately respond to the massive data. To overcome this problem, data mining researchers have studied methods which can conduct more efficient and immediate mining tasks by scanning databases only once. Thereafter, the sliding window model, which can perform mining operations focusing on recently accumulated parts over data streams, was proposed, and a variety of mining approaches related to this have been suggested. However, it is hard to mine all of the frequent patterns in the data stream environment since generated patterns are remarkably increased as data streams are continuously extended. Thus, methods for efficiently compressing generated patterns are needed in order to solve that problem. In addition, since not only support conditions but also weight constraints expressing items’ importance are one of the important factors in the pattern mining, we need to consider them in mining process. Motivated by these issues, we propose a novel algorithm, weighted maximal frequent pattern mining over data streams based on sliding window model (WMFP-SW) to obtain weighted maximal frequent patterns reflecting recent information over data streams. Performance experiments report that MWFP-SW outperforms previous algorithms in terms of runtime, memory usage, and scalability.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2013.07.094