Loading…

Better Streaming Algorithms for the Maximum Coverage Problem

We study the classic NP-Hard problem of finding the maximum k -set coverage in the data stream model: given a set system of m sets that are subsets of a universe { 1 , ... , n } , find the k sets that cover the most number of distinct elements. The problem can be approximated up to a factor 1 − 1 /...

Full description

Saved in:
Bibliographic Details
Published in:Theory of computing systems 2019-10, Vol.63 (7), p.1595-1619
Main Authors: McGregor, Andrew, Vu, Hoa T.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We study the classic NP-Hard problem of finding the maximum k -set coverage in the data stream model: given a set system of m sets that are subsets of a universe { 1 , ... , n } , find the k sets that cover the most number of distinct elements. The problem can be approximated up to a factor 1 − 1 / e in polynomial time. In the streaming-set model, the sets and their elements are revealed online. The main goal of our work is to design algorithms, with approximation guarantees as close as possible to 1 − 1 / e , that use sublinear space o ( mn ) . Our main results are: Two ( 1 − 1 / e − ε ) approximation algorithms: One uses O ( ε − 1 ) passes and Õ ( ε − 2 k ) space whereas the other uses only a single pass but Õ ( ε − 2 m ) space. Õ ( ⋅ ) suppresses polylog factors. We show that any approximation factor better than ( 1 − ( 1 − 1 / k ) k ) ≈ 1 − 1 / e in constant passes requires Ω ( m ) space for constant k even if the algorithm is allowed unbounded processing time. We also demonstrate a single-pass , ( 1 − ε ) approximation algorithm using Õ ε − 2 m ⋅ min ( k , ε − 1 ) space. We also study the maximum k -vertex coverage problem in the dynamic graph stream model. In this model, the stream consists of edge insertions and deletions of a graph on N vertices. The goal is to find k vertices that cover the most number of distinct edges. We show that any constant approximation in constant passes requires Ω ( N ) space for constant k whereas Õ ( ε − 2 N ) space is sufficient for a ( 1 − ε ) approximation and arbitrary k in a single pass. For regular graphs, we show that Õ ( ε − 3 k ) space is sufficient for a ( 1 − ε ) approximation in a single pass. We generalize this to a ( κ − ε ) approximation when the ratio between the minimum and maximum degree is bounded below by κ .
ISSN:1432-4350
1433-0490
DOI:10.1007/s00224-018-9878-x