Loading…

DataSynth: generating synthetic data using declarative constraints

A variety of scenarios such as database system and application testing, data masking, and benchmarking require synthetic database instances, often having complex data characteristics. We present DataSynth , a flexible tool for generating synthetic databases. DataSynth uses a simple and powerful decl...

Full description

Saved in:
Bibliographic Details
Published in:Proceedings of the VLDB Endowment 2011-08, Vol.4 (12), p.1418-1421
Main Authors: Arasu, Arvind, Kaushik, Raghav, Li, Jian
Format: Article
Language:English
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:A variety of scenarios such as database system and application testing, data masking, and benchmarking require synthetic database instances, often having complex data characteristics. We present DataSynth , a flexible tool for generating synthetic databases. DataSynth uses a simple and powerful declarative abstraction based on cardinality constraints to specify data characteristics, and uses sophisticated algorithms to efficiently generate database instances satisfying the specified characteristics. The demo will showcase various features of DataSynth using two real-world data generation scenarios.
ISSN:2150-8097
2150-8097
DOI:10.14778/3402755.3402785