Loading…

Privacy Preserving Data Mining Framework for Negative Association Rules: An Application to Healthcare Informatics

Protecting the privacy of healthcare information is an important part of encouraging data custodians to give accurate records so that mining may proceed with confidence. The application of association rule mining in healthcare data has been widespread to this point in time. Most applications focus o...

Full description

Saved in:
Bibliographic Details
Published in:IEEE access 2022, Vol.10, p.76268-76280
Main Authors: Darwish, Saad M., Essa, Reham M., Osman, Mohamed A., Ismail, Ahmed A.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3
cites cdi_FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3
container_end_page 76280
container_issue
container_start_page 76268
container_title IEEE access
container_volume 10
creator Darwish, Saad M.
Essa, Reham M.
Osman, Mohamed A.
Ismail, Ahmed A.
description Protecting the privacy of healthcare information is an important part of encouraging data custodians to give accurate records so that mining may proceed with confidence. The application of association rule mining in healthcare data has been widespread to this point in time. Most applications focus on positive association rules, ignoring the negative consequences of particular diagnostic techniques. When it comes to bridging divergent diseases and drugs, negative association rules may give more helpful information than positive ones. This is especially true when it comes to physicians and social organizations (e.g., a certain symptom will not arise when certain symptoms exist). Data mining in healthcare must be done in a way that protects the identity of patients, especially when dealing with sensitive information. However, revealing this information puts it at risk of attack. Healthcare data privacy protection has lately been addressed by technologies that disrupt data (data sanitization) and reconstruct aggregate distributions in the interest of doing research in data mining. In this study, metaheuristic-based data sanitization for healthcare data mining is investigated in order to keep patient privacy protected. It is hoped that by using the Tabu-genetic algorithm as an optimization tool, the suggested technique chooses item sets to be sanitized (modified) from transactions that satisfy sensitive negative criteria with the goal of minimizing changes to the original database. Experiments with benchmark healthcare datasets show that the suggested privacy preserving data mining (PPDM) method outperforms existing algorithms in terms of Hiding Failure (HF), Artificial Rule Generation (AR), and Lost Rules (LR).
doi_str_mv 10.1109/ACCESS.2022.3192447
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_ACCESS_2022_3192447</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9832893</ieee_id><doaj_id>oai_doaj_org_article_88f0183a18db47d88df4dd7a61aa2543</doaj_id><sourcerecordid>2695144431</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3</originalsourceid><addsrcrecordid>eNpNUcFOGzEQXVWtVAR8ARdLPSe1Pd5db2-rFEokoKjA2Zqsx6nTzTrYm1T8PU4Xofripzfz3ozmFcWF4HMhePO1XSwuHx7mkks5B9FIpeoPxYkUVTODEqqP_-HPxXlKG56fzlRZnxTP99EfsHth95ESxYMf1uw7jshu_XDEVxG39DfEP8yFyO5ojaM_EGtTCp3POAzs176n9I21A2t3u953EzsGdk3Yj787jMSWQ5Zvc6VLZ8Unh32i87f_tHi6unxcXM9ufv5YLtqbWae4Hme2riyQhFoBF6TcSjrHS1nXQsumlFWpGme1dlLVVq9KB65zknjjECoU4OC0WE6-NuDG7KLfYnwxAb35R4S4NhjzQj2ZbMOFBhTaro522jplbY2VQJSlguz1ZfLaxfC8pzSaTdjHIa9vZL6jUEqByF0wdXUxpBTJvU8V3ByjMlNU5hiVeYsqqy4mlSeid0WjQeoG4BW8JI-7</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2695144431</pqid></control><display><type>article</type><title>Privacy Preserving Data Mining Framework for Negative Association Rules: An Application to Healthcare Informatics</title><source>IEEE Open Access Journals</source><creator>Darwish, Saad M. ; Essa, Reham M. ; Osman, Mohamed A. ; Ismail, Ahmed A.</creator><creatorcontrib>Darwish, Saad M. ; Essa, Reham M. ; Osman, Mohamed A. ; Ismail, Ahmed A.</creatorcontrib><description>Protecting the privacy of healthcare information is an important part of encouraging data custodians to give accurate records so that mining may proceed with confidence. The application of association rule mining in healthcare data has been widespread to this point in time. Most applications focus on positive association rules, ignoring the negative consequences of particular diagnostic techniques. When it comes to bridging divergent diseases and drugs, negative association rules may give more helpful information than positive ones. This is especially true when it comes to physicians and social organizations (e.g., a certain symptom will not arise when certain symptoms exist). Data mining in healthcare must be done in a way that protects the identity of patients, especially when dealing with sensitive information. However, revealing this information puts it at risk of attack. Healthcare data privacy protection has lately been addressed by technologies that disrupt data (data sanitization) and reconstruct aggregate distributions in the interest of doing research in data mining. In this study, metaheuristic-based data sanitization for healthcare data mining is investigated in order to keep patient privacy protected. It is hoped that by using the Tabu-genetic algorithm as an optimization tool, the suggested technique chooses item sets to be sanitized (modified) from transactions that satisfy sensitive negative criteria with the goal of minimizing changes to the original database. Experiments with benchmark healthcare datasets show that the suggested privacy preserving data mining (PPDM) method outperforms existing algorithms in terms of Hiding Failure (HF), Artificial Rule Generation (AR), and Lost Rules (LR).</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2022.3192447</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Data integrity ; Data mining ; Data privacy ; evolutionary computation ; Genetic algorithms ; Health care ; healthcare data ; Heuristic methods ; Medical services ; Optimization ; Physicians ; Privacy ; Privacy-preserving data mining ; sanitization process ; Signs and symptoms ; Task analysis</subject><ispartof>IEEE access, 2022, Vol.10, p.76268-76280</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2022</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3</citedby><cites>FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3</cites><orcidid>0000-0003-2723-1549</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9832893$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,4024,27633,27923,27924,27925,54933</link.rule.ids></links><search><creatorcontrib>Darwish, Saad M.</creatorcontrib><creatorcontrib>Essa, Reham M.</creatorcontrib><creatorcontrib>Osman, Mohamed A.</creatorcontrib><creatorcontrib>Ismail, Ahmed A.</creatorcontrib><title>Privacy Preserving Data Mining Framework for Negative Association Rules: An Application to Healthcare Informatics</title><title>IEEE access</title><addtitle>Access</addtitle><description>Protecting the privacy of healthcare information is an important part of encouraging data custodians to give accurate records so that mining may proceed with confidence. The application of association rule mining in healthcare data has been widespread to this point in time. Most applications focus on positive association rules, ignoring the negative consequences of particular diagnostic techniques. When it comes to bridging divergent diseases and drugs, negative association rules may give more helpful information than positive ones. This is especially true when it comes to physicians and social organizations (e.g., a certain symptom will not arise when certain symptoms exist). Data mining in healthcare must be done in a way that protects the identity of patients, especially when dealing with sensitive information. However, revealing this information puts it at risk of attack. Healthcare data privacy protection has lately been addressed by technologies that disrupt data (data sanitization) and reconstruct aggregate distributions in the interest of doing research in data mining. In this study, metaheuristic-based data sanitization for healthcare data mining is investigated in order to keep patient privacy protected. It is hoped that by using the Tabu-genetic algorithm as an optimization tool, the suggested technique chooses item sets to be sanitized (modified) from transactions that satisfy sensitive negative criteria with the goal of minimizing changes to the original database. Experiments with benchmark healthcare datasets show that the suggested privacy preserving data mining (PPDM) method outperforms existing algorithms in terms of Hiding Failure (HF), Artificial Rule Generation (AR), and Lost Rules (LR).</description><subject>Data integrity</subject><subject>Data mining</subject><subject>Data privacy</subject><subject>evolutionary computation</subject><subject>Genetic algorithms</subject><subject>Health care</subject><subject>healthcare data</subject><subject>Heuristic methods</subject><subject>Medical services</subject><subject>Optimization</subject><subject>Physicians</subject><subject>Privacy</subject><subject>Privacy-preserving data mining</subject><subject>sanitization process</subject><subject>Signs and symptoms</subject><subject>Task analysis</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>DOA</sourceid><recordid>eNpNUcFOGzEQXVWtVAR8ARdLPSe1Pd5db2-rFEokoKjA2Zqsx6nTzTrYm1T8PU4Xofripzfz3ozmFcWF4HMhePO1XSwuHx7mkks5B9FIpeoPxYkUVTODEqqP_-HPxXlKG56fzlRZnxTP99EfsHth95ESxYMf1uw7jshu_XDEVxG39DfEP8yFyO5ojaM_EGtTCp3POAzs176n9I21A2t3u953EzsGdk3Yj787jMSWQ5Zvc6VLZ8Unh32i87f_tHi6unxcXM9ufv5YLtqbWae4Hme2riyQhFoBF6TcSjrHS1nXQsumlFWpGme1dlLVVq9KB65zknjjECoU4OC0WE6-NuDG7KLfYnwxAb35R4S4NhjzQj2ZbMOFBhTaro522jplbY2VQJSlguz1ZfLaxfC8pzSaTdjHIa9vZL6jUEqByF0wdXUxpBTJvU8V3ByjMlNU5hiVeYsqqy4mlSeid0WjQeoG4BW8JI-7</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Darwish, Saad M.</creator><creator>Essa, Reham M.</creator><creator>Osman, Mohamed A.</creator><creator>Ismail, Ahmed A.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-2723-1549</orcidid></search><sort><creationdate>2022</creationdate><title>Privacy Preserving Data Mining Framework for Negative Association Rules: An Application to Healthcare Informatics</title><author>Darwish, Saad M. ; Essa, Reham M. ; Osman, Mohamed A. ; Ismail, Ahmed A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Data integrity</topic><topic>Data mining</topic><topic>Data privacy</topic><topic>evolutionary computation</topic><topic>Genetic algorithms</topic><topic>Health care</topic><topic>healthcare data</topic><topic>Heuristic methods</topic><topic>Medical services</topic><topic>Optimization</topic><topic>Physicians</topic><topic>Privacy</topic><topic>Privacy-preserving data mining</topic><topic>sanitization process</topic><topic>Signs and symptoms</topic><topic>Task analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Darwish, Saad M.</creatorcontrib><creatorcontrib>Essa, Reham M.</creatorcontrib><creatorcontrib>Osman, Mohamed A.</creatorcontrib><creatorcontrib>Ismail, Ahmed A.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Darwish, Saad M.</au><au>Essa, Reham M.</au><au>Osman, Mohamed A.</au><au>Ismail, Ahmed A.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Privacy Preserving Data Mining Framework for Negative Association Rules: An Application to Healthcare Informatics</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2022</date><risdate>2022</risdate><volume>10</volume><spage>76268</spage><epage>76280</epage><pages>76268-76280</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Protecting the privacy of healthcare information is an important part of encouraging data custodians to give accurate records so that mining may proceed with confidence. The application of association rule mining in healthcare data has been widespread to this point in time. Most applications focus on positive association rules, ignoring the negative consequences of particular diagnostic techniques. When it comes to bridging divergent diseases and drugs, negative association rules may give more helpful information than positive ones. This is especially true when it comes to physicians and social organizations (e.g., a certain symptom will not arise when certain symptoms exist). Data mining in healthcare must be done in a way that protects the identity of patients, especially when dealing with sensitive information. However, revealing this information puts it at risk of attack. Healthcare data privacy protection has lately been addressed by technologies that disrupt data (data sanitization) and reconstruct aggregate distributions in the interest of doing research in data mining. In this study, metaheuristic-based data sanitization for healthcare data mining is investigated in order to keep patient privacy protected. It is hoped that by using the Tabu-genetic algorithm as an optimization tool, the suggested technique chooses item sets to be sanitized (modified) from transactions that satisfy sensitive negative criteria with the goal of minimizing changes to the original database. Experiments with benchmark healthcare datasets show that the suggested privacy preserving data mining (PPDM) method outperforms existing algorithms in terms of Hiding Failure (HF), Artificial Rule Generation (AR), and Lost Rules (LR).</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2022.3192447</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0003-2723-1549</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2022, Vol.10, p.76268-76280
issn 2169-3536
2169-3536
language eng
recordid cdi_crossref_primary_10_1109_ACCESS_2022_3192447
source IEEE Open Access Journals
subjects Data integrity
Data mining
Data privacy
evolutionary computation
Genetic algorithms
Health care
healthcare data
Heuristic methods
Medical services
Optimization
Physicians
Privacy
Privacy-preserving data mining
sanitization process
Signs and symptoms
Task analysis
title Privacy Preserving Data Mining Framework for Negative Association Rules: An Application to Healthcare Informatics
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-21T04%3A04%3A21IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Privacy%20Preserving%20Data%20Mining%20Framework%20for%20Negative%20Association%20Rules:%20An%20Application%20to%20Healthcare%20Informatics&rft.jtitle=IEEE%20access&rft.au=Darwish,%20Saad%20M.&rft.date=2022&rft.volume=10&rft.spage=76268&rft.epage=76280&rft.pages=76268-76280&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2022.3192447&rft_dat=%3Cproquest_cross%3E2695144431%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2695144431&rft_id=info:pmid/&rft_ieee_id=9832893&rfr_iscdi=true