Loading…

Persistence of RDF Data into NoSQL: A Survey and a Reference Architecture

RDF is being increasingly considered in a broad range of information management scenarios. Governments, large corporations, startups, and other organizations around the world are using RDF as a data model to represent and share knowledge. However, there is still a long evolutionary track with multip...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on knowledge and data engineering 2022-03, Vol.34 (3), p.1370-1389
Main Authors: Santana, Luiz Henrique Zambom, Mello, Ronaldo dos Santos
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043
cites cdi_FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043
container_end_page 1389
container_issue 3
container_start_page 1370
container_title IEEE transactions on knowledge and data engineering
container_volume 34
creator Santana, Luiz Henrique Zambom
Mello, Ronaldo dos Santos
description RDF is being increasingly considered in a broad range of information management scenarios. Governments, large corporations, startups, and other organizations around the world are using RDF as a data model to represent and share knowledge. However, there is still a long evolutionary track with multiple challenges for RDF reaching the scale of the most recent Big Data intensive applications (e.g., Smart Cities, Sensor Networks, eHealth, Internet of Things). In this survey, we review the usage of NoSQL databases to the storage of large RDF graphs by rehearsing the latest surveys and expanding their findings by updating proposals and bringing light to aspects such as model mapping between RDF and NoSQL, triple indexing and partitioning, graph fragmentation and data caching. Moreover, we explain how the surveyed works extended the RDF capabilities so the datasets can benefit of the characteristics of scalability, schemaless data, and better overall performance of NoSQL databases. The survey summarizes the current state of art, discusses open problems, and proposes a Reference Architecture (RA). For the best of our knowledge, this is the first survey where the focus is solely on papers that use one or more NoSQL systems for the RDF persistence.
doi_str_mv 10.1109/TKDE.2020.2994521
format article
fullrecord <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_proquest_journals_2625366587</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9093172</ieee_id><sourcerecordid>2625366587</sourcerecordid><originalsourceid>FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043</originalsourceid><addsrcrecordid>eNo9kMtOwzAQAC0EEqXwAYiLJc4pXttJbG5VH1BR8WjL2XLtjUgFSbETpP49Ka047R5mdqUh5BrYAIDpu9XTeDLgjLMB11qmHE5ID9JUJRw0nHY7k5BIIfNzchHjhjGmcgU9MnvFEMvYYOWQ1gVdjKd0bBtLy6qp6XO9fJvf0yFdtuEHd9RWnlq6wALDnzAM7qNs0DVtwEtyVtjPiFfH2Sfv08lq9JjMXx5mo-E8cVyLJim0EmufK5v7LFNaFzJXPkut11Y6QL32IkWpvPbccikBJQpfyGwNQjvJpOiT28Pdbai_W4yN2dRtqLqXhmc8FVmWqryj4EC5UMcYsDDbUH7ZsDPAzL6Y2Rcz-2LmWKxzbg5OiYj_vGZaQM7FL-4wZSQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2625366587</pqid></control><display><type>article</type><title>Persistence of RDF Data into NoSQL: A Survey and a Reference Architecture</title><source>IEEE Xplore (Online service)</source><creator>Santana, Luiz Henrique Zambom ; Mello, Ronaldo dos Santos</creator><creatorcontrib>Santana, Luiz Henrique Zambom ; Mello, Ronaldo dos Santos</creatorcontrib><description>RDF is being increasingly considered in a broad range of information management scenarios. Governments, large corporations, startups, and other organizations around the world are using RDF as a data model to represent and share knowledge. However, there is still a long evolutionary track with multiple challenges for RDF reaching the scale of the most recent Big Data intensive applications (e.g., Smart Cities, Sensor Networks, eHealth, Internet of Things). In this survey, we review the usage of NoSQL databases to the storage of large RDF graphs by rehearsing the latest surveys and expanding their findings by updating proposals and bringing light to aspects such as model mapping between RDF and NoSQL, triple indexing and partitioning, graph fragmentation and data caching. Moreover, we explain how the surveyed works extended the RDF capabilities so the datasets can benefit of the characteristics of scalability, schemaless data, and better overall performance of NoSQL databases. The survey summarizes the current state of art, discusses open problems, and proposes a Reference Architecture (RA). For the best of our knowledge, this is the first survey where the focus is solely on papers that use one or more NoSQL systems for the RDF persistence.</description><identifier>ISSN: 1041-4347</identifier><identifier>EISSN: 1558-2191</identifier><identifier>DOI: 10.1109/TKDE.2020.2994521</identifier><identifier>CODEN: ITKEEH</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Benchmark testing ; Big Data ; Computer architecture ; Data models ; Indexing ; Information management ; Internet of Things ; NoSQL ; NoSQL databases ; RDF ; Resource description framework ; Scalability ; Semantic Web ; SPARQL</subject><ispartof>IEEE transactions on knowledge and data engineering, 2022-03, Vol.34 (3), p.1370-1389</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2022</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043</citedby><cites>FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043</cites><orcidid>0000-0003-4262-5474 ; 0000-0001-5626-9859</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9093172$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,54796</link.rule.ids></links><search><creatorcontrib>Santana, Luiz Henrique Zambom</creatorcontrib><creatorcontrib>Mello, Ronaldo dos Santos</creatorcontrib><title>Persistence of RDF Data into NoSQL: A Survey and a Reference Architecture</title><title>IEEE transactions on knowledge and data engineering</title><addtitle>TKDE</addtitle><description>RDF is being increasingly considered in a broad range of information management scenarios. Governments, large corporations, startups, and other organizations around the world are using RDF as a data model to represent and share knowledge. However, there is still a long evolutionary track with multiple challenges for RDF reaching the scale of the most recent Big Data intensive applications (e.g., Smart Cities, Sensor Networks, eHealth, Internet of Things). In this survey, we review the usage of NoSQL databases to the storage of large RDF graphs by rehearsing the latest surveys and expanding their findings by updating proposals and bringing light to aspects such as model mapping between RDF and NoSQL, triple indexing and partitioning, graph fragmentation and data caching. Moreover, we explain how the surveyed works extended the RDF capabilities so the datasets can benefit of the characteristics of scalability, schemaless data, and better overall performance of NoSQL databases. The survey summarizes the current state of art, discusses open problems, and proposes a Reference Architecture (RA). For the best of our knowledge, this is the first survey where the focus is solely on papers that use one or more NoSQL systems for the RDF persistence.</description><subject>Benchmark testing</subject><subject>Big Data</subject><subject>Computer architecture</subject><subject>Data models</subject><subject>Indexing</subject><subject>Information management</subject><subject>Internet of Things</subject><subject>NoSQL</subject><subject>NoSQL databases</subject><subject>RDF</subject><subject>Resource description framework</subject><subject>Scalability</subject><subject>Semantic Web</subject><subject>SPARQL</subject><issn>1041-4347</issn><issn>1558-2191</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNo9kMtOwzAQAC0EEqXwAYiLJc4pXttJbG5VH1BR8WjL2XLtjUgFSbETpP49Ka047R5mdqUh5BrYAIDpu9XTeDLgjLMB11qmHE5ID9JUJRw0nHY7k5BIIfNzchHjhjGmcgU9MnvFEMvYYOWQ1gVdjKd0bBtLy6qp6XO9fJvf0yFdtuEHd9RWnlq6wALDnzAM7qNs0DVtwEtyVtjPiFfH2Sfv08lq9JjMXx5mo-E8cVyLJim0EmufK5v7LFNaFzJXPkut11Y6QL32IkWpvPbccikBJQpfyGwNQjvJpOiT28Pdbai_W4yN2dRtqLqXhmc8FVmWqryj4EC5UMcYsDDbUH7ZsDPAzL6Y2Rcz-2LmWKxzbg5OiYj_vGZaQM7FL-4wZSQ</recordid><startdate>20220301</startdate><enddate>20220301</enddate><creator>Santana, Luiz Henrique Zambom</creator><creator>Mello, Ronaldo dos Santos</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0003-4262-5474</orcidid><orcidid>https://orcid.org/0000-0001-5626-9859</orcidid></search><sort><creationdate>20220301</creationdate><title>Persistence of RDF Data into NoSQL: A Survey and a Reference Architecture</title><author>Santana, Luiz Henrique Zambom ; Mello, Ronaldo dos Santos</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Benchmark testing</topic><topic>Big Data</topic><topic>Computer architecture</topic><topic>Data models</topic><topic>Indexing</topic><topic>Information management</topic><topic>Internet of Things</topic><topic>NoSQL</topic><topic>NoSQL databases</topic><topic>RDF</topic><topic>Resource description framework</topic><topic>Scalability</topic><topic>Semantic Web</topic><topic>SPARQL</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Santana, Luiz Henrique Zambom</creatorcontrib><creatorcontrib>Mello, Ronaldo dos Santos</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEL</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on knowledge and data engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Santana, Luiz Henrique Zambom</au><au>Mello, Ronaldo dos Santos</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Persistence of RDF Data into NoSQL: A Survey and a Reference Architecture</atitle><jtitle>IEEE transactions on knowledge and data engineering</jtitle><stitle>TKDE</stitle><date>2022-03-01</date><risdate>2022</risdate><volume>34</volume><issue>3</issue><spage>1370</spage><epage>1389</epage><pages>1370-1389</pages><issn>1041-4347</issn><eissn>1558-2191</eissn><coden>ITKEEH</coden><abstract>RDF is being increasingly considered in a broad range of information management scenarios. Governments, large corporations, startups, and other organizations around the world are using RDF as a data model to represent and share knowledge. However, there is still a long evolutionary track with multiple challenges for RDF reaching the scale of the most recent Big Data intensive applications (e.g., Smart Cities, Sensor Networks, eHealth, Internet of Things). In this survey, we review the usage of NoSQL databases to the storage of large RDF graphs by rehearsing the latest surveys and expanding their findings by updating proposals and bringing light to aspects such as model mapping between RDF and NoSQL, triple indexing and partitioning, graph fragmentation and data caching. Moreover, we explain how the surveyed works extended the RDF capabilities so the datasets can benefit of the characteristics of scalability, schemaless data, and better overall performance of NoSQL databases. The survey summarizes the current state of art, discusses open problems, and proposes a Reference Architecture (RA). For the best of our knowledge, this is the first survey where the focus is solely on papers that use one or more NoSQL systems for the RDF persistence.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TKDE.2020.2994521</doi><tpages>20</tpages><orcidid>https://orcid.org/0000-0003-4262-5474</orcidid><orcidid>https://orcid.org/0000-0001-5626-9859</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1041-4347
ispartof IEEE transactions on knowledge and data engineering, 2022-03, Vol.34 (3), p.1370-1389
issn 1041-4347
1558-2191
language eng
recordid cdi_proquest_journals_2625366587
source IEEE Xplore (Online service)
subjects Benchmark testing
Big Data
Computer architecture
Data models
Indexing
Information management
Internet of Things
NoSQL
NoSQL databases
RDF
Resource description framework
Scalability
Semantic Web
SPARQL
title Persistence of RDF Data into NoSQL: A Survey and a Reference Architecture
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T07%3A39%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Persistence%20of%20RDF%20Data%20into%20NoSQL:%20A%20Survey%20and%20a%20Reference%20Architecture&rft.jtitle=IEEE%20transactions%20on%20knowledge%20and%20data%20engineering&rft.au=Santana,%20Luiz%20Henrique%20Zambom&rft.date=2022-03-01&rft.volume=34&rft.issue=3&rft.spage=1370&rft.epage=1389&rft.pages=1370-1389&rft.issn=1041-4347&rft.eissn=1558-2191&rft.coden=ITKEEH&rft_id=info:doi/10.1109/TKDE.2020.2994521&rft_dat=%3Cproquest_ieee_%3E2625366587%3C/proquest_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2625366587&rft_id=info:pmid/&rft_ieee_id=9093172&rfr_iscdi=true