Loading…

Logical Foundations of Linked Data Anonymisation

The widespread adoption of the Linked Data paradigm has been driven by the increasing demand for information exchange between organisations, as well as by regulations in domains such as health care and governance that require certain data to be published. In this setting, sensitive information is at...

Full description

Saved in:
Bibliographic Details
Published in:The Journal of artificial intelligence research 2019-02, Vol.64, p.253-314
Main Authors: Grau, Bernardo Cuenca, Kostylev, Egor V.
Format: Article
Language:English
Citations: Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c235t-a1e9ddabb4d2def34621edfae5d369127ecec688fbd58159b40aa663f496db573
cites
container_end_page 314
container_issue
container_start_page 253
container_title The Journal of artificial intelligence research
container_volume 64
creator Grau, Bernardo Cuenca
Kostylev, Egor V.
description The widespread adoption of the Linked Data paradigm has been driven by the increasing demand for information exchange between organisations, as well as by regulations in domains such as health care and governance that require certain data to be published. In this setting, sensitive information is at high risk of disclosure since published data can be often seamlessly linkedwith arbitrary external data sources.In this paper we lay the logical foundations of anonymisation in the context of Linked Data. We consider anonymisations of RDF graphs (and, more generally, relational datasets with labelled nulls) and define notions of policy-compliant and linkage-safe anonymisations. Policy compliance ensures that an anonymised dataset does not reveal any sensitive information as specified by a policy query. Linkage safety ensures that an anonymised dataset remains compliant even if it is linked to (possibly unknown) external datasets available on the Web, thus providing provable protection guarantees against data linkage attacks. We establish the computational complexity of the underpinning decision problems both under the open-world semantics inherent to RDF and under the assumption that an attacker has complete, closed-world knowledge over some parts of the original data.
doi_str_mv 10.1613/jair.1.11355
format article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1613_jair_1_11355</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1613_jair_1_11355</sourcerecordid><originalsourceid>FETCH-LOGICAL-c235t-a1e9ddabb4d2def34621edfae5d369127ecec688fbd58159b40aa663f496db573</originalsourceid><addsrcrecordid>eNpNz8FKAzEUBdAgCtbqzg-YD3DGvGSSmSxLtSoMuNF1eJmXSGo7kWRc9O-l1YWre-HChcPYLfAGNMj7LcbcQAMglTpjC-Cdrk2nuvN__ZJdlbLlHEwr-gXjQ_qII-6qTfqeCOeYplKlUA1x-vRUPeCM1WpK02Efy2m9ZhcBd8Xf_OWSvW8e39bP9fD69LJeDfUopJprBG-I0LmWBPkgWy3AU0CvSGoDovOjH3XfB0eqB2VcyxG1lqE1mpzq5JLd_f6OOZWSfbBfOe4xHyxwe9Tao9aCPWnlD0MBSQ4</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Logical Foundations of Linked Data Anonymisation</title><source>Publicly Available Content Database</source><creator>Grau, Bernardo Cuenca ; Kostylev, Egor V.</creator><creatorcontrib>Grau, Bernardo Cuenca ; Kostylev, Egor V.</creatorcontrib><description>The widespread adoption of the Linked Data paradigm has been driven by the increasing demand for information exchange between organisations, as well as by regulations in domains such as health care and governance that require certain data to be published. In this setting, sensitive information is at high risk of disclosure since published data can be often seamlessly linkedwith arbitrary external data sources.In this paper we lay the logical foundations of anonymisation in the context of Linked Data. We consider anonymisations of RDF graphs (and, more generally, relational datasets with labelled nulls) and define notions of policy-compliant and linkage-safe anonymisations. Policy compliance ensures that an anonymised dataset does not reveal any sensitive information as specified by a policy query. Linkage safety ensures that an anonymised dataset remains compliant even if it is linked to (possibly unknown) external datasets available on the Web, thus providing provable protection guarantees against data linkage attacks. We establish the computational complexity of the underpinning decision problems both under the open-world semantics inherent to RDF and under the assumption that an attacker has complete, closed-world knowledge over some parts of the original data.</description><identifier>ISSN: 1076-9757</identifier><identifier>EISSN: 1076-9757</identifier><identifier>DOI: 10.1613/jair.1.11355</identifier><language>eng</language><ispartof>The Journal of artificial intelligence research, 2019-02, Vol.64, p.253-314</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c235t-a1e9ddabb4d2def34621edfae5d369127ecec688fbd58159b40aa663f496db573</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Grau, Bernardo Cuenca</creatorcontrib><creatorcontrib>Kostylev, Egor V.</creatorcontrib><title>Logical Foundations of Linked Data Anonymisation</title><title>The Journal of artificial intelligence research</title><description>The widespread adoption of the Linked Data paradigm has been driven by the increasing demand for information exchange between organisations, as well as by regulations in domains such as health care and governance that require certain data to be published. In this setting, sensitive information is at high risk of disclosure since published data can be often seamlessly linkedwith arbitrary external data sources.In this paper we lay the logical foundations of anonymisation in the context of Linked Data. We consider anonymisations of RDF graphs (and, more generally, relational datasets with labelled nulls) and define notions of policy-compliant and linkage-safe anonymisations. Policy compliance ensures that an anonymised dataset does not reveal any sensitive information as specified by a policy query. Linkage safety ensures that an anonymised dataset remains compliant even if it is linked to (possibly unknown) external datasets available on the Web, thus providing provable protection guarantees against data linkage attacks. We establish the computational complexity of the underpinning decision problems both under the open-world semantics inherent to RDF and under the assumption that an attacker has complete, closed-world knowledge over some parts of the original data.</description><issn>1076-9757</issn><issn>1076-9757</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNpNz8FKAzEUBdAgCtbqzg-YD3DGvGSSmSxLtSoMuNF1eJmXSGo7kWRc9O-l1YWre-HChcPYLfAGNMj7LcbcQAMglTpjC-Cdrk2nuvN__ZJdlbLlHEwr-gXjQ_qII-6qTfqeCOeYplKlUA1x-vRUPeCM1WpK02Efy2m9ZhcBd8Xf_OWSvW8e39bP9fD69LJeDfUopJprBG-I0LmWBPkgWy3AU0CvSGoDovOjH3XfB0eqB2VcyxG1lqE1mpzq5JLd_f6OOZWSfbBfOe4xHyxwe9Tao9aCPWnlD0MBSQ4</recordid><startdate>20190216</startdate><enddate>20190216</enddate><creator>Grau, Bernardo Cuenca</creator><creator>Kostylev, Egor V.</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20190216</creationdate><title>Logical Foundations of Linked Data Anonymisation</title><author>Grau, Bernardo Cuenca ; Kostylev, Egor V.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c235t-a1e9ddabb4d2def34621edfae5d369127ecec688fbd58159b40aa663f496db573</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Grau, Bernardo Cuenca</creatorcontrib><creatorcontrib>Kostylev, Egor V.</creatorcontrib><collection>CrossRef</collection><jtitle>The Journal of artificial intelligence research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Grau, Bernardo Cuenca</au><au>Kostylev, Egor V.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Logical Foundations of Linked Data Anonymisation</atitle><jtitle>The Journal of artificial intelligence research</jtitle><date>2019-02-16</date><risdate>2019</risdate><volume>64</volume><spage>253</spage><epage>314</epage><pages>253-314</pages><issn>1076-9757</issn><eissn>1076-9757</eissn><abstract>The widespread adoption of the Linked Data paradigm has been driven by the increasing demand for information exchange between organisations, as well as by regulations in domains such as health care and governance that require certain data to be published. In this setting, sensitive information is at high risk of disclosure since published data can be often seamlessly linkedwith arbitrary external data sources.In this paper we lay the logical foundations of anonymisation in the context of Linked Data. We consider anonymisations of RDF graphs (and, more generally, relational datasets with labelled nulls) and define notions of policy-compliant and linkage-safe anonymisations. Policy compliance ensures that an anonymised dataset does not reveal any sensitive information as specified by a policy query. Linkage safety ensures that an anonymised dataset remains compliant even if it is linked to (possibly unknown) external datasets available on the Web, thus providing provable protection guarantees against data linkage attacks. We establish the computational complexity of the underpinning decision problems both under the open-world semantics inherent to RDF and under the assumption that an attacker has complete, closed-world knowledge over some parts of the original data.</abstract><doi>10.1613/jair.1.11355</doi><tpages>62</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1076-9757
ispartof The Journal of artificial intelligence research, 2019-02, Vol.64, p.253-314
issn 1076-9757
1076-9757
language eng
recordid cdi_crossref_primary_10_1613_jair_1_11355
source Publicly Available Content Database
title Logical Foundations of Linked Data Anonymisation
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T11%3A23%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Logical%20Foundations%20of%20Linked%20Data%20Anonymisation&rft.jtitle=The%20Journal%20of%20artificial%20intelligence%20research&rft.au=Grau,%20Bernardo%20Cuenca&rft.date=2019-02-16&rft.volume=64&rft.spage=253&rft.epage=314&rft.pages=253-314&rft.issn=1076-9757&rft.eissn=1076-9757&rft_id=info:doi/10.1613/jair.1.11355&rft_dat=%3Ccrossref%3E10_1613_jair_1_11355%3C/crossref%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c235t-a1e9ddabb4d2def34621edfae5d369127ecec688fbd58159b40aa663f496db573%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true