Loading…
General incremental sliding-window aggregation
Stream processing is gaining importance as more data becomes available in the form of continuous streams and companies compete to promptly extract insights from them. In such applications, sliding-window aggregation is a central operator, and incremental aggregation helps avoid the performance penal...
Saved in:
Published in: | Proceedings of the VLDB Endowment 2015-02, Vol.8 (7), p.702-713 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Stream processing is gaining importance as more data becomes available in the form of continuous streams and companies compete to promptly extract insights from them. In such applications, sliding-window aggregation is a central operator, and incremental aggregation helps avoid the performance penalty of re-aggregating from scratch for each window change.
This paper presents Reactive Aggregator (RA), a new framework for incremental sliding-window aggregation. RA is general in that it does not require aggregation functions to be invertible or commutative, and it does not require windows to be FIFO. We implemented RA as a drop-in replacement for the Aggregate operator of a commercial streaming engine. Given
m
updates on a window of size
n
, RA has an algorithmic complexity of
O
(
m
+
m
log (
n/m
)), rivaling the best prior algorithms for any
m
. Furthermore, RA's implementation minimizes overheads from allocation and pointer traversals by using a single flat array. |
---|---|
ISSN: | 2150-8097 2150-8097 |
DOI: | 10.14778/2752939.2752940 |