Loading…

Grandma Karl is 27 years old -- research agenda for pseudonymization of research data

Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e.g names or political opinions. General Data Protection Regulation (GDPR) suggests pseudonymization as a solution...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2023-08
Main Authors: Volodina, Elena, Dobnik, Simon, Therese Lindström Tiedemann, Xuan-Son Vu
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Volodina, Elena
Dobnik, Simon
Therese Lindström Tiedemann
Xuan-Son Vu
description Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e.g names or political opinions. General Data Protection Regulation (GDPR) suggests pseudonymization as a solution to secure open access to research data, but we need to learn more about pseudonymization as an approach before adopting it for manipulation of research data. This paper outlines a research agenda within pseudonymization, namely need of studies into the effects of pseudonymization on unstructured data in relation to e.g. readability and language assessment, as well as the effectiveness of pseudonymization as a way of protecting writer identity, while also exploring different ways of developing context-sensitive algorithms for detection, labelling and replacement of personal information in unstructured data. The recently granted project on pseudonymization Grandma Karl is 27 years old addresses exactly those challenges.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2859362699</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2859362699</sourcerecordid><originalsourceid>FETCH-proquest_journals_28593626993</originalsourceid><addsrcrecordid>eNqNissKwjAQAIMgWLT_sOA5UDf2dRYf4FXPZWlSbWmTmm0P9evtQfDqaRhmFiJApXYy2yOuRMjcRFGESYpxrAJxP3uyuiO4km-hZsAUJkOewbUapARveNbyCfQwVhNUzkPPZtTOTl39pqF2Flz1-zQNtBHLilo24ZdrsT0db4eL7L17jYaHonGjt3MqMItzlWCS5-q_6wMAJT_I</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2859362699</pqid></control><display><type>article</type><title>Grandma Karl is 27 years old -- research agenda for pseudonymization of research data</title><source>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</source><creator>Volodina, Elena ; Dobnik, Simon ; Therese Lindström Tiedemann ; Xuan-Son Vu</creator><creatorcontrib>Volodina, Elena ; Dobnik, Simon ; Therese Lindström Tiedemann ; Xuan-Son Vu</creatorcontrib><description>Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e.g names or political opinions. General Data Protection Regulation (GDPR) suggests pseudonymization as a solution to secure open access to research data, but we need to learn more about pseudonymization as an approach before adopting it for manipulation of research data. This paper outlines a research agenda within pseudonymization, namely need of studies into the effects of pseudonymization on unstructured data in relation to e.g. readability and language assessment, as well as the effectiveness of pseudonymization as a way of protecting writer identity, while also exploring different ways of developing context-sensitive algorithms for detection, labelling and replacement of personal information in unstructured data. The recently granted project on pseudonymization Grandma Karl is 27 years old addresses exactly those challenges.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Unstructured data</subject><ispartof>arXiv.org, 2023-08</ispartof><rights>2023. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2859362699?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,37012,44590</link.rule.ids></links><search><creatorcontrib>Volodina, Elena</creatorcontrib><creatorcontrib>Dobnik, Simon</creatorcontrib><creatorcontrib>Therese Lindström Tiedemann</creatorcontrib><creatorcontrib>Xuan-Son Vu</creatorcontrib><title>Grandma Karl is 27 years old -- research agenda for pseudonymization of research data</title><title>arXiv.org</title><description>Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e.g names or political opinions. General Data Protection Regulation (GDPR) suggests pseudonymization as a solution to secure open access to research data, but we need to learn more about pseudonymization as an approach before adopting it for manipulation of research data. This paper outlines a research agenda within pseudonymization, namely need of studies into the effects of pseudonymization on unstructured data in relation to e.g. readability and language assessment, as well as the effectiveness of pseudonymization as a way of protecting writer identity, while also exploring different ways of developing context-sensitive algorithms for detection, labelling and replacement of personal information in unstructured data. The recently granted project on pseudonymization Grandma Karl is 27 years old addresses exactly those challenges.</description><subject>Algorithms</subject><subject>Unstructured data</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNissKwjAQAIMgWLT_sOA5UDf2dRYf4FXPZWlSbWmTmm0P9evtQfDqaRhmFiJApXYy2yOuRMjcRFGESYpxrAJxP3uyuiO4km-hZsAUJkOewbUapARveNbyCfQwVhNUzkPPZtTOTl39pqF2Flz1-zQNtBHLilo24ZdrsT0db4eL7L17jYaHonGjt3MqMItzlWCS5-q_6wMAJT_I</recordid><startdate>20230830</startdate><enddate>20230830</enddate><creator>Volodina, Elena</creator><creator>Dobnik, Simon</creator><creator>Therese Lindström Tiedemann</creator><creator>Xuan-Son Vu</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope></search><sort><creationdate>20230830</creationdate><title>Grandma Karl is 27 years old -- research agenda for pseudonymization of research data</title><author>Volodina, Elena ; Dobnik, Simon ; Therese Lindström Tiedemann ; Xuan-Son Vu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_28593626993</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Unstructured data</topic><toplevel>online_resources</toplevel><creatorcontrib>Volodina, Elena</creatorcontrib><creatorcontrib>Dobnik, Simon</creatorcontrib><creatorcontrib>Therese Lindström Tiedemann</creatorcontrib><creatorcontrib>Xuan-Son Vu</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Volodina, Elena</au><au>Dobnik, Simon</au><au>Therese Lindström Tiedemann</au><au>Xuan-Son Vu</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Grandma Karl is 27 years old -- research agenda for pseudonymization of research data</atitle><jtitle>arXiv.org</jtitle><date>2023-08-30</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e.g names or political opinions. General Data Protection Regulation (GDPR) suggests pseudonymization as a solution to secure open access to research data, but we need to learn more about pseudonymization as an approach before adopting it for manipulation of research data. This paper outlines a research agenda within pseudonymization, namely need of studies into the effects of pseudonymization on unstructured data in relation to e.g. readability and language assessment, as well as the effectiveness of pseudonymization as a way of protecting writer identity, while also exploring different ways of developing context-sensitive algorithms for detection, labelling and replacement of personal information in unstructured data. The recently granted project on pseudonymization Grandma Karl is 27 years old addresses exactly those challenges.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-08
issn 2331-8422
language eng
recordid cdi_proquest_journals_2859362699
source Publicly Available Content Database (Proquest) (PQ_SDU_P3)
subjects Algorithms
Unstructured data
title Grandma Karl is 27 years old -- research agenda for pseudonymization of research data
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-21T06%3A27%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Grandma%20Karl%20is%2027%20years%20old%20--%20research%20agenda%20for%20pseudonymization%20of%20research%20data&rft.jtitle=arXiv.org&rft.au=Volodina,%20Elena&rft.date=2023-08-30&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2859362699%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_28593626993%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2859362699&rft_id=info:pmid/&rfr_iscdi=true