Loading…
Grandma Karl is 27 years old -- research agenda for pseudonymization of research data
Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e.g names or political opinions. General Data Protection Regulation (GDPR) suggests pseudonymization as a solution...
Saved in:
Published in: | arXiv.org 2023-08 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | |
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Volodina, Elena Dobnik, Simon Therese Lindström Tiedemann Xuan-Son Vu |
description | Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e.g names or political opinions. General Data Protection Regulation (GDPR) suggests pseudonymization as a solution to secure open access to research data, but we need to learn more about pseudonymization as an approach before adopting it for manipulation of research data. This paper outlines a research agenda within pseudonymization, namely need of studies into the effects of pseudonymization on unstructured data in relation to e.g. readability and language assessment, as well as the effectiveness of pseudonymization as a way of protecting writer identity, while also exploring different ways of developing context-sensitive algorithms for detection, labelling and replacement of personal information in unstructured data. The recently granted project on pseudonymization Grandma Karl is 27 years old addresses exactly those challenges. |
format | article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2859362699</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2859362699</sourcerecordid><originalsourceid>FETCH-proquest_journals_28593626993</originalsourceid><addsrcrecordid>eNqNissKwjAQAIMgWLT_sOA5UDf2dRYf4FXPZWlSbWmTmm0P9evtQfDqaRhmFiJApXYy2yOuRMjcRFGESYpxrAJxP3uyuiO4km-hZsAUJkOewbUapARveNbyCfQwVhNUzkPPZtTOTl39pqF2Flz1-zQNtBHLilo24ZdrsT0db4eL7L17jYaHonGjt3MqMItzlWCS5-q_6wMAJT_I</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2859362699</pqid></control><display><type>article</type><title>Grandma Karl is 27 years old -- research agenda for pseudonymization of research data</title><source>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</source><creator>Volodina, Elena ; Dobnik, Simon ; Therese Lindström Tiedemann ; Xuan-Son Vu</creator><creatorcontrib>Volodina, Elena ; Dobnik, Simon ; Therese Lindström Tiedemann ; Xuan-Son Vu</creatorcontrib><description>Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e.g names or political opinions. General Data Protection Regulation (GDPR) suggests pseudonymization as a solution to secure open access to research data, but we need to learn more about pseudonymization as an approach before adopting it for manipulation of research data. This paper outlines a research agenda within pseudonymization, namely need of studies into the effects of pseudonymization on unstructured data in relation to e.g. readability and language assessment, as well as the effectiveness of pseudonymization as a way of protecting writer identity, while also exploring different ways of developing context-sensitive algorithms for detection, labelling and replacement of personal information in unstructured data. The recently granted project on pseudonymization Grandma Karl is 27 years old addresses exactly those challenges.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Unstructured data</subject><ispartof>arXiv.org, 2023-08</ispartof><rights>2023. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2859362699?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,37012,44590</link.rule.ids></links><search><creatorcontrib>Volodina, Elena</creatorcontrib><creatorcontrib>Dobnik, Simon</creatorcontrib><creatorcontrib>Therese Lindström Tiedemann</creatorcontrib><creatorcontrib>Xuan-Son Vu</creatorcontrib><title>Grandma Karl is 27 years old -- research agenda for pseudonymization of research data</title><title>arXiv.org</title><description>Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e.g names or political opinions. General Data Protection Regulation (GDPR) suggests pseudonymization as a solution to secure open access to research data, but we need to learn more about pseudonymization as an approach before adopting it for manipulation of research data. This paper outlines a research agenda within pseudonymization, namely need of studies into the effects of pseudonymization on unstructured data in relation to e.g. readability and language assessment, as well as the effectiveness of pseudonymization as a way of protecting writer identity, while also exploring different ways of developing context-sensitive algorithms for detection, labelling and replacement of personal information in unstructured data. The recently granted project on pseudonymization Grandma Karl is 27 years old addresses exactly those challenges.</description><subject>Algorithms</subject><subject>Unstructured data</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNissKwjAQAIMgWLT_sOA5UDf2dRYf4FXPZWlSbWmTmm0P9evtQfDqaRhmFiJApXYy2yOuRMjcRFGESYpxrAJxP3uyuiO4km-hZsAUJkOewbUapARveNbyCfQwVhNUzkPPZtTOTl39pqF2Flz1-zQNtBHLilo24ZdrsT0db4eL7L17jYaHonGjt3MqMItzlWCS5-q_6wMAJT_I</recordid><startdate>20230830</startdate><enddate>20230830</enddate><creator>Volodina, Elena</creator><creator>Dobnik, Simon</creator><creator>Therese Lindström Tiedemann</creator><creator>Xuan-Son Vu</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope></search><sort><creationdate>20230830</creationdate><title>Grandma Karl is 27 years old -- research agenda for pseudonymization of research data</title><author>Volodina, Elena ; Dobnik, Simon ; Therese Lindström Tiedemann ; Xuan-Son Vu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_28593626993</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Unstructured data</topic><toplevel>online_resources</toplevel><creatorcontrib>Volodina, Elena</creatorcontrib><creatorcontrib>Dobnik, Simon</creatorcontrib><creatorcontrib>Therese Lindström Tiedemann</creatorcontrib><creatorcontrib>Xuan-Son Vu</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Volodina, Elena</au><au>Dobnik, Simon</au><au>Therese Lindström Tiedemann</au><au>Xuan-Son Vu</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Grandma Karl is 27 years old -- research agenda for pseudonymization of research data</atitle><jtitle>arXiv.org</jtitle><date>2023-08-30</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e.g names or political opinions. General Data Protection Regulation (GDPR) suggests pseudonymization as a solution to secure open access to research data, but we need to learn more about pseudonymization as an approach before adopting it for manipulation of research data. This paper outlines a research agenda within pseudonymization, namely need of studies into the effects of pseudonymization on unstructured data in relation to e.g. readability and language assessment, as well as the effectiveness of pseudonymization as a way of protecting writer identity, while also exploring different ways of developing context-sensitive algorithms for detection, labelling and replacement of personal information in unstructured data. The recently granted project on pseudonymization Grandma Karl is 27 years old addresses exactly those challenges.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2023-08 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2859362699 |
source | Publicly Available Content Database (Proquest) (PQ_SDU_P3) |
subjects | Algorithms Unstructured data |
title | Grandma Karl is 27 years old -- research agenda for pseudonymization of research data |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-21T06%3A27%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Grandma%20Karl%20is%2027%20years%20old%20--%20research%20agenda%20for%20pseudonymization%20of%20research%20data&rft.jtitle=arXiv.org&rft.au=Volodina,%20Elena&rft.date=2023-08-30&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2859362699%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_28593626993%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2859362699&rft_id=info:pmid/&rfr_iscdi=true |