Loading…

An analysis of query-agnostic sampling for interactive data exploration

Data analysts often explore a large database to identify the data of interest, but may not be able to specify the exact query to send to the database. A manual data exploration process is labor intensive and time-consuming. In the new paradigm of system-aided interactive data exploration, the Databa...

Full description

Saved in:
Bibliographic Details
Published in:Communications in statistics. Theory and methods 2018-08, Vol.47 (16), p.3820-3837
Main Authors: Liu, Wenzhao, Diao, Yanlei, Liu, Anna
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Data analysts often explore a large database to identify the data of interest, but may not be able to specify the exact query to send to the database. A manual data exploration process is labor intensive and time-consuming. In the new paradigm of system-aided interactive data exploration, the Database Management System presents the samples to the user and engages the user in an interactive exploration process to identify the user interest. In this article, we examine a number of initial sampling techniques to identify at least one positive (i.e., interesting) sample and compare them both theoretically and empirically.
ISSN:0361-0926
1532-415X
DOI:10.1080/03610926.2017.1363231