Loading…
Persistence of RDF Data into NoSQL: A Survey and a Reference Architecture
RDF is being increasingly considered in a broad range of information management scenarios. Governments, large corporations, startups, and other organizations around the world are using RDF as a data model to represent and share knowledge. However, there is still a long evolutionary track with multip...
Saved in:
Published in: | IEEE transactions on knowledge and data engineering 2022-03, Vol.34 (3), p.1370-1389 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043 |
---|---|
cites | cdi_FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043 |
container_end_page | 1389 |
container_issue | 3 |
container_start_page | 1370 |
container_title | IEEE transactions on knowledge and data engineering |
container_volume | 34 |
creator | Santana, Luiz Henrique Zambom Mello, Ronaldo dos Santos |
description | RDF is being increasingly considered in a broad range of information management scenarios. Governments, large corporations, startups, and other organizations around the world are using RDF as a data model to represent and share knowledge. However, there is still a long evolutionary track with multiple challenges for RDF reaching the scale of the most recent Big Data intensive applications (e.g., Smart Cities, Sensor Networks, eHealth, Internet of Things). In this survey, we review the usage of NoSQL databases to the storage of large RDF graphs by rehearsing the latest surveys and expanding their findings by updating proposals and bringing light to aspects such as model mapping between RDF and NoSQL, triple indexing and partitioning, graph fragmentation and data caching. Moreover, we explain how the surveyed works extended the RDF capabilities so the datasets can benefit of the characteristics of scalability, schemaless data, and better overall performance of NoSQL databases. The survey summarizes the current state of art, discusses open problems, and proposes a Reference Architecture (RA). For the best of our knowledge, this is the first survey where the focus is solely on papers that use one or more NoSQL systems for the RDF persistence. |
doi_str_mv | 10.1109/TKDE.2020.2994521 |
format | article |
fullrecord | <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_proquest_journals_2625366587</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9093172</ieee_id><sourcerecordid>2625366587</sourcerecordid><originalsourceid>FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043</originalsourceid><addsrcrecordid>eNo9kMtOwzAQAC0EEqXwAYiLJc4pXttJbG5VH1BR8WjL2XLtjUgFSbETpP49Ka047R5mdqUh5BrYAIDpu9XTeDLgjLMB11qmHE5ID9JUJRw0nHY7k5BIIfNzchHjhjGmcgU9MnvFEMvYYOWQ1gVdjKd0bBtLy6qp6XO9fJvf0yFdtuEHd9RWnlq6wALDnzAM7qNs0DVtwEtyVtjPiFfH2Sfv08lq9JjMXx5mo-E8cVyLJim0EmufK5v7LFNaFzJXPkut11Y6QL32IkWpvPbccikBJQpfyGwNQjvJpOiT28Pdbai_W4yN2dRtqLqXhmc8FVmWqryj4EC5UMcYsDDbUH7ZsDPAzL6Y2Rcz-2LmWKxzbg5OiYj_vGZaQM7FL-4wZSQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2625366587</pqid></control><display><type>article</type><title>Persistence of RDF Data into NoSQL: A Survey and a Reference Architecture</title><source>IEEE Xplore (Online service)</source><creator>Santana, Luiz Henrique Zambom ; Mello, Ronaldo dos Santos</creator><creatorcontrib>Santana, Luiz Henrique Zambom ; Mello, Ronaldo dos Santos</creatorcontrib><description>RDF is being increasingly considered in a broad range of information management scenarios. Governments, large corporations, startups, and other organizations around the world are using RDF as a data model to represent and share knowledge. However, there is still a long evolutionary track with multiple challenges for RDF reaching the scale of the most recent Big Data intensive applications (e.g., Smart Cities, Sensor Networks, eHealth, Internet of Things). In this survey, we review the usage of NoSQL databases to the storage of large RDF graphs by rehearsing the latest surveys and expanding their findings by updating proposals and bringing light to aspects such as model mapping between RDF and NoSQL, triple indexing and partitioning, graph fragmentation and data caching. Moreover, we explain how the surveyed works extended the RDF capabilities so the datasets can benefit of the characteristics of scalability, schemaless data, and better overall performance of NoSQL databases. The survey summarizes the current state of art, discusses open problems, and proposes a Reference Architecture (RA). For the best of our knowledge, this is the first survey where the focus is solely on papers that use one or more NoSQL systems for the RDF persistence.</description><identifier>ISSN: 1041-4347</identifier><identifier>EISSN: 1558-2191</identifier><identifier>DOI: 10.1109/TKDE.2020.2994521</identifier><identifier>CODEN: ITKEEH</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Benchmark testing ; Big Data ; Computer architecture ; Data models ; Indexing ; Information management ; Internet of Things ; NoSQL ; NoSQL databases ; RDF ; Resource description framework ; Scalability ; Semantic Web ; SPARQL</subject><ispartof>IEEE transactions on knowledge and data engineering, 2022-03, Vol.34 (3), p.1370-1389</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2022</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043</citedby><cites>FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043</cites><orcidid>0000-0003-4262-5474 ; 0000-0001-5626-9859</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9093172$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,54796</link.rule.ids></links><search><creatorcontrib>Santana, Luiz Henrique Zambom</creatorcontrib><creatorcontrib>Mello, Ronaldo dos Santos</creatorcontrib><title>Persistence of RDF Data into NoSQL: A Survey and a Reference Architecture</title><title>IEEE transactions on knowledge and data engineering</title><addtitle>TKDE</addtitle><description>RDF is being increasingly considered in a broad range of information management scenarios. Governments, large corporations, startups, and other organizations around the world are using RDF as a data model to represent and share knowledge. However, there is still a long evolutionary track with multiple challenges for RDF reaching the scale of the most recent Big Data intensive applications (e.g., Smart Cities, Sensor Networks, eHealth, Internet of Things). In this survey, we review the usage of NoSQL databases to the storage of large RDF graphs by rehearsing the latest surveys and expanding their findings by updating proposals and bringing light to aspects such as model mapping between RDF and NoSQL, triple indexing and partitioning, graph fragmentation and data caching. Moreover, we explain how the surveyed works extended the RDF capabilities so the datasets can benefit of the characteristics of scalability, schemaless data, and better overall performance of NoSQL databases. The survey summarizes the current state of art, discusses open problems, and proposes a Reference Architecture (RA). For the best of our knowledge, this is the first survey where the focus is solely on papers that use one or more NoSQL systems for the RDF persistence.</description><subject>Benchmark testing</subject><subject>Big Data</subject><subject>Computer architecture</subject><subject>Data models</subject><subject>Indexing</subject><subject>Information management</subject><subject>Internet of Things</subject><subject>NoSQL</subject><subject>NoSQL databases</subject><subject>RDF</subject><subject>Resource description framework</subject><subject>Scalability</subject><subject>Semantic Web</subject><subject>SPARQL</subject><issn>1041-4347</issn><issn>1558-2191</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNo9kMtOwzAQAC0EEqXwAYiLJc4pXttJbG5VH1BR8WjL2XLtjUgFSbETpP49Ka047R5mdqUh5BrYAIDpu9XTeDLgjLMB11qmHE5ID9JUJRw0nHY7k5BIIfNzchHjhjGmcgU9MnvFEMvYYOWQ1gVdjKd0bBtLy6qp6XO9fJvf0yFdtuEHd9RWnlq6wALDnzAM7qNs0DVtwEtyVtjPiFfH2Sfv08lq9JjMXx5mo-E8cVyLJim0EmufK5v7LFNaFzJXPkut11Y6QL32IkWpvPbccikBJQpfyGwNQjvJpOiT28Pdbai_W4yN2dRtqLqXhmc8FVmWqryj4EC5UMcYsDDbUH7ZsDPAzL6Y2Rcz-2LmWKxzbg5OiYj_vGZaQM7FL-4wZSQ</recordid><startdate>20220301</startdate><enddate>20220301</enddate><creator>Santana, Luiz Henrique Zambom</creator><creator>Mello, Ronaldo dos Santos</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0003-4262-5474</orcidid><orcidid>https://orcid.org/0000-0001-5626-9859</orcidid></search><sort><creationdate>20220301</creationdate><title>Persistence of RDF Data into NoSQL: A Survey and a Reference Architecture</title><author>Santana, Luiz Henrique Zambom ; Mello, Ronaldo dos Santos</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Benchmark testing</topic><topic>Big Data</topic><topic>Computer architecture</topic><topic>Data models</topic><topic>Indexing</topic><topic>Information management</topic><topic>Internet of Things</topic><topic>NoSQL</topic><topic>NoSQL databases</topic><topic>RDF</topic><topic>Resource description framework</topic><topic>Scalability</topic><topic>Semantic Web</topic><topic>SPARQL</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Santana, Luiz Henrique Zambom</creatorcontrib><creatorcontrib>Mello, Ronaldo dos Santos</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEL</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on knowledge and data engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Santana, Luiz Henrique Zambom</au><au>Mello, Ronaldo dos Santos</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Persistence of RDF Data into NoSQL: A Survey and a Reference Architecture</atitle><jtitle>IEEE transactions on knowledge and data engineering</jtitle><stitle>TKDE</stitle><date>2022-03-01</date><risdate>2022</risdate><volume>34</volume><issue>3</issue><spage>1370</spage><epage>1389</epage><pages>1370-1389</pages><issn>1041-4347</issn><eissn>1558-2191</eissn><coden>ITKEEH</coden><abstract>RDF is being increasingly considered in a broad range of information management scenarios. Governments, large corporations, startups, and other organizations around the world are using RDF as a data model to represent and share knowledge. However, there is still a long evolutionary track with multiple challenges for RDF reaching the scale of the most recent Big Data intensive applications (e.g., Smart Cities, Sensor Networks, eHealth, Internet of Things). In this survey, we review the usage of NoSQL databases to the storage of large RDF graphs by rehearsing the latest surveys and expanding their findings by updating proposals and bringing light to aspects such as model mapping between RDF and NoSQL, triple indexing and partitioning, graph fragmentation and data caching. Moreover, we explain how the surveyed works extended the RDF capabilities so the datasets can benefit of the characteristics of scalability, schemaless data, and better overall performance of NoSQL databases. The survey summarizes the current state of art, discusses open problems, and proposes a Reference Architecture (RA). For the best of our knowledge, this is the first survey where the focus is solely on papers that use one or more NoSQL systems for the RDF persistence.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TKDE.2020.2994521</doi><tpages>20</tpages><orcidid>https://orcid.org/0000-0003-4262-5474</orcidid><orcidid>https://orcid.org/0000-0001-5626-9859</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1041-4347 |
ispartof | IEEE transactions on knowledge and data engineering, 2022-03, Vol.34 (3), p.1370-1389 |
issn | 1041-4347 1558-2191 |
language | eng |
recordid | cdi_proquest_journals_2625366587 |
source | IEEE Xplore (Online service) |
subjects | Benchmark testing Big Data Computer architecture Data models Indexing Information management Internet of Things NoSQL NoSQL databases RDF Resource description framework Scalability Semantic Web SPARQL |
title | Persistence of RDF Data into NoSQL: A Survey and a Reference Architecture |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T07%3A39%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Persistence%20of%20RDF%20Data%20into%20NoSQL:%20A%20Survey%20and%20a%20Reference%20Architecture&rft.jtitle=IEEE%20transactions%20on%20knowledge%20and%20data%20engineering&rft.au=Santana,%20Luiz%20Henrique%20Zambom&rft.date=2022-03-01&rft.volume=34&rft.issue=3&rft.spage=1370&rft.epage=1389&rft.pages=1370-1389&rft.issn=1041-4347&rft.eissn=1558-2191&rft.coden=ITKEEH&rft_id=info:doi/10.1109/TKDE.2020.2994521&rft_dat=%3Cproquest_ieee_%3E2625366587%3C/proquest_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c293t-f983bd78a7d66899f478d65ad9a4c1e9bd35e48d9d2a2441e4e3df46b139c4043%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2625366587&rft_id=info:pmid/&rft_ieee_id=9093172&rfr_iscdi=true |