Loading…

GraphScope: a unified engine for big graph processing

GraphScope is a system and a set of language extensions that enable a new programming interface for large-scale distributed graph computing. It generalizes previous graph processing frameworks (e.g. , Pregel, GraphX) and distributed graph databases ( e.g ., Janus-Graph, Neptune) in two important way...

Full description

Saved in:
Bibliographic Details
Published in:Proceedings of the VLDB Endowment 2021-08, Vol.14 (12), p.2879-2892
Main Authors: Fan, Wenfei, He, Tao, Lai, Longbin, Li, Xue, Li, Yong, Li, Zhao, Qian, Zhengping, Tian, Chao, Wang, Lei, Xu, Jingbo, Yao, Youyang, Yin, Qiang, Yu, Wenyuan, Zhou, Jingren, Zhu, Diwen, Zhu, Rong
Format: Article
Language:English
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:GraphScope is a system and a set of language extensions that enable a new programming interface for large-scale distributed graph computing. It generalizes previous graph processing frameworks (e.g. , Pregel, GraphX) and distributed graph databases ( e.g ., Janus-Graph, Neptune) in two important ways: by exposing a unified programming interface to a wide variety of graph computations such as graph traversal, pattern matching, iterative algorithms and graph neural networks within a high-level programming language; and by supporting the seamless integration of a highly optimized graph engine in a general purpose data-parallel computing system. A GraphScope program is a sequential program composed of declarative data-parallel operators, and can be written using standard Python development tools. The system automatically handles the parallelization and distributed execution of programs on a cluster of machines. It outperforms current state-of-the-art systems by enabling a separate optimization (or family of optimizations) for each graph operation in one carefully designed coherent framework. We describe the design and implementation of GraphScope and evaluate system performance using several real-world applications.
ISSN:2150-8097
2150-8097
DOI:10.14778/3476311.3476369