Loading…
Privacy Preserving Data Mining Framework for Negative Association Rules: An Application to Healthcare Informatics
Protecting the privacy of healthcare information is an important part of encouraging data custodians to give accurate records so that mining may proceed with confidence. The application of association rule mining in healthcare data has been widespread to this point in time. Most applications focus o...
Saved in:
Published in: | IEEE access 2022, Vol.10, p.76268-76280 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3 |
---|---|
cites | cdi_FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3 |
container_end_page | 76280 |
container_issue | |
container_start_page | 76268 |
container_title | IEEE access |
container_volume | 10 |
creator | Darwish, Saad M. Essa, Reham M. Osman, Mohamed A. Ismail, Ahmed A. |
description | Protecting the privacy of healthcare information is an important part of encouraging data custodians to give accurate records so that mining may proceed with confidence. The application of association rule mining in healthcare data has been widespread to this point in time. Most applications focus on positive association rules, ignoring the negative consequences of particular diagnostic techniques. When it comes to bridging divergent diseases and drugs, negative association rules may give more helpful information than positive ones. This is especially true when it comes to physicians and social organizations (e.g., a certain symptom will not arise when certain symptoms exist). Data mining in healthcare must be done in a way that protects the identity of patients, especially when dealing with sensitive information. However, revealing this information puts it at risk of attack. Healthcare data privacy protection has lately been addressed by technologies that disrupt data (data sanitization) and reconstruct aggregate distributions in the interest of doing research in data mining. In this study, metaheuristic-based data sanitization for healthcare data mining is investigated in order to keep patient privacy protected. It is hoped that by using the Tabu-genetic algorithm as an optimization tool, the suggested technique chooses item sets to be sanitized (modified) from transactions that satisfy sensitive negative criteria with the goal of minimizing changes to the original database. Experiments with benchmark healthcare datasets show that the suggested privacy preserving data mining (PPDM) method outperforms existing algorithms in terms of Hiding Failure (HF), Artificial Rule Generation (AR), and Lost Rules (LR). |
doi_str_mv | 10.1109/ACCESS.2022.3192447 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_ACCESS_2022_3192447</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9832893</ieee_id><doaj_id>oai_doaj_org_article_88f0183a18db47d88df4dd7a61aa2543</doaj_id><sourcerecordid>2695144431</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3</originalsourceid><addsrcrecordid>eNpNUcFOGzEQXVWtVAR8ARdLPSe1Pd5db2-rFEokoKjA2Zqsx6nTzTrYm1T8PU4Xofripzfz3ozmFcWF4HMhePO1XSwuHx7mkks5B9FIpeoPxYkUVTODEqqP_-HPxXlKG56fzlRZnxTP99EfsHth95ESxYMf1uw7jshu_XDEVxG39DfEP8yFyO5ojaM_EGtTCp3POAzs176n9I21A2t3u953EzsGdk3Yj787jMSWQ5Zvc6VLZ8Unh32i87f_tHi6unxcXM9ufv5YLtqbWae4Hme2riyQhFoBF6TcSjrHS1nXQsumlFWpGme1dlLVVq9KB65zknjjECoU4OC0WE6-NuDG7KLfYnwxAb35R4S4NhjzQj2ZbMOFBhTaro522jplbY2VQJSlguz1ZfLaxfC8pzSaTdjHIa9vZL6jUEqByF0wdXUxpBTJvU8V3ByjMlNU5hiVeYsqqy4mlSeid0WjQeoG4BW8JI-7</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2695144431</pqid></control><display><type>article</type><title>Privacy Preserving Data Mining Framework for Negative Association Rules: An Application to Healthcare Informatics</title><source>IEEE Open Access Journals</source><creator>Darwish, Saad M. ; Essa, Reham M. ; Osman, Mohamed A. ; Ismail, Ahmed A.</creator><creatorcontrib>Darwish, Saad M. ; Essa, Reham M. ; Osman, Mohamed A. ; Ismail, Ahmed A.</creatorcontrib><description>Protecting the privacy of healthcare information is an important part of encouraging data custodians to give accurate records so that mining may proceed with confidence. The application of association rule mining in healthcare data has been widespread to this point in time. Most applications focus on positive association rules, ignoring the negative consequences of particular diagnostic techniques. When it comes to bridging divergent diseases and drugs, negative association rules may give more helpful information than positive ones. This is especially true when it comes to physicians and social organizations (e.g., a certain symptom will not arise when certain symptoms exist). Data mining in healthcare must be done in a way that protects the identity of patients, especially when dealing with sensitive information. However, revealing this information puts it at risk of attack. Healthcare data privacy protection has lately been addressed by technologies that disrupt data (data sanitization) and reconstruct aggregate distributions in the interest of doing research in data mining. In this study, metaheuristic-based data sanitization for healthcare data mining is investigated in order to keep patient privacy protected. It is hoped that by using the Tabu-genetic algorithm as an optimization tool, the suggested technique chooses item sets to be sanitized (modified) from transactions that satisfy sensitive negative criteria with the goal of minimizing changes to the original database. Experiments with benchmark healthcare datasets show that the suggested privacy preserving data mining (PPDM) method outperforms existing algorithms in terms of Hiding Failure (HF), Artificial Rule Generation (AR), and Lost Rules (LR).</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2022.3192447</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Data integrity ; Data mining ; Data privacy ; evolutionary computation ; Genetic algorithms ; Health care ; healthcare data ; Heuristic methods ; Medical services ; Optimization ; Physicians ; Privacy ; Privacy-preserving data mining ; sanitization process ; Signs and symptoms ; Task analysis</subject><ispartof>IEEE access, 2022, Vol.10, p.76268-76280</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2022</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3</citedby><cites>FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3</cites><orcidid>0000-0003-2723-1549</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9832893$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,4024,27633,27923,27924,27925,54933</link.rule.ids></links><search><creatorcontrib>Darwish, Saad M.</creatorcontrib><creatorcontrib>Essa, Reham M.</creatorcontrib><creatorcontrib>Osman, Mohamed A.</creatorcontrib><creatorcontrib>Ismail, Ahmed A.</creatorcontrib><title>Privacy Preserving Data Mining Framework for Negative Association Rules: An Application to Healthcare Informatics</title><title>IEEE access</title><addtitle>Access</addtitle><description>Protecting the privacy of healthcare information is an important part of encouraging data custodians to give accurate records so that mining may proceed with confidence. The application of association rule mining in healthcare data has been widespread to this point in time. Most applications focus on positive association rules, ignoring the negative consequences of particular diagnostic techniques. When it comes to bridging divergent diseases and drugs, negative association rules may give more helpful information than positive ones. This is especially true when it comes to physicians and social organizations (e.g., a certain symptom will not arise when certain symptoms exist). Data mining in healthcare must be done in a way that protects the identity of patients, especially when dealing with sensitive information. However, revealing this information puts it at risk of attack. Healthcare data privacy protection has lately been addressed by technologies that disrupt data (data sanitization) and reconstruct aggregate distributions in the interest of doing research in data mining. In this study, metaheuristic-based data sanitization for healthcare data mining is investigated in order to keep patient privacy protected. It is hoped that by using the Tabu-genetic algorithm as an optimization tool, the suggested technique chooses item sets to be sanitized (modified) from transactions that satisfy sensitive negative criteria with the goal of minimizing changes to the original database. Experiments with benchmark healthcare datasets show that the suggested privacy preserving data mining (PPDM) method outperforms existing algorithms in terms of Hiding Failure (HF), Artificial Rule Generation (AR), and Lost Rules (LR).</description><subject>Data integrity</subject><subject>Data mining</subject><subject>Data privacy</subject><subject>evolutionary computation</subject><subject>Genetic algorithms</subject><subject>Health care</subject><subject>healthcare data</subject><subject>Heuristic methods</subject><subject>Medical services</subject><subject>Optimization</subject><subject>Physicians</subject><subject>Privacy</subject><subject>Privacy-preserving data mining</subject><subject>sanitization process</subject><subject>Signs and symptoms</subject><subject>Task analysis</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>DOA</sourceid><recordid>eNpNUcFOGzEQXVWtVAR8ARdLPSe1Pd5db2-rFEokoKjA2Zqsx6nTzTrYm1T8PU4Xofripzfz3ozmFcWF4HMhePO1XSwuHx7mkks5B9FIpeoPxYkUVTODEqqP_-HPxXlKG56fzlRZnxTP99EfsHth95ESxYMf1uw7jshu_XDEVxG39DfEP8yFyO5ojaM_EGtTCp3POAzs176n9I21A2t3u953EzsGdk3Yj787jMSWQ5Zvc6VLZ8Unh32i87f_tHi6unxcXM9ufv5YLtqbWae4Hme2riyQhFoBF6TcSjrHS1nXQsumlFWpGme1dlLVVq9KB65zknjjECoU4OC0WE6-NuDG7KLfYnwxAb35R4S4NhjzQj2ZbMOFBhTaro522jplbY2VQJSlguz1ZfLaxfC8pzSaTdjHIa9vZL6jUEqByF0wdXUxpBTJvU8V3ByjMlNU5hiVeYsqqy4mlSeid0WjQeoG4BW8JI-7</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Darwish, Saad M.</creator><creator>Essa, Reham M.</creator><creator>Osman, Mohamed A.</creator><creator>Ismail, Ahmed A.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-2723-1549</orcidid></search><sort><creationdate>2022</creationdate><title>Privacy Preserving Data Mining Framework for Negative Association Rules: An Application to Healthcare Informatics</title><author>Darwish, Saad M. ; Essa, Reham M. ; Osman, Mohamed A. ; Ismail, Ahmed A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Data integrity</topic><topic>Data mining</topic><topic>Data privacy</topic><topic>evolutionary computation</topic><topic>Genetic algorithms</topic><topic>Health care</topic><topic>healthcare data</topic><topic>Heuristic methods</topic><topic>Medical services</topic><topic>Optimization</topic><topic>Physicians</topic><topic>Privacy</topic><topic>Privacy-preserving data mining</topic><topic>sanitization process</topic><topic>Signs and symptoms</topic><topic>Task analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Darwish, Saad M.</creatorcontrib><creatorcontrib>Essa, Reham M.</creatorcontrib><creatorcontrib>Osman, Mohamed A.</creatorcontrib><creatorcontrib>Ismail, Ahmed A.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Darwish, Saad M.</au><au>Essa, Reham M.</au><au>Osman, Mohamed A.</au><au>Ismail, Ahmed A.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Privacy Preserving Data Mining Framework for Negative Association Rules: An Application to Healthcare Informatics</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2022</date><risdate>2022</risdate><volume>10</volume><spage>76268</spage><epage>76280</epage><pages>76268-76280</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Protecting the privacy of healthcare information is an important part of encouraging data custodians to give accurate records so that mining may proceed with confidence. The application of association rule mining in healthcare data has been widespread to this point in time. Most applications focus on positive association rules, ignoring the negative consequences of particular diagnostic techniques. When it comes to bridging divergent diseases and drugs, negative association rules may give more helpful information than positive ones. This is especially true when it comes to physicians and social organizations (e.g., a certain symptom will not arise when certain symptoms exist). Data mining in healthcare must be done in a way that protects the identity of patients, especially when dealing with sensitive information. However, revealing this information puts it at risk of attack. Healthcare data privacy protection has lately been addressed by technologies that disrupt data (data sanitization) and reconstruct aggregate distributions in the interest of doing research in data mining. In this study, metaheuristic-based data sanitization for healthcare data mining is investigated in order to keep patient privacy protected. It is hoped that by using the Tabu-genetic algorithm as an optimization tool, the suggested technique chooses item sets to be sanitized (modified) from transactions that satisfy sensitive negative criteria with the goal of minimizing changes to the original database. Experiments with benchmark healthcare datasets show that the suggested privacy preserving data mining (PPDM) method outperforms existing algorithms in terms of Hiding Failure (HF), Artificial Rule Generation (AR), and Lost Rules (LR).</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2022.3192447</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0003-2723-1549</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2169-3536 |
ispartof | IEEE access, 2022, Vol.10, p.76268-76280 |
issn | 2169-3536 2169-3536 |
language | eng |
recordid | cdi_crossref_primary_10_1109_ACCESS_2022_3192447 |
source | IEEE Open Access Journals |
subjects | Data integrity Data mining Data privacy evolutionary computation Genetic algorithms Health care healthcare data Heuristic methods Medical services Optimization Physicians Privacy Privacy-preserving data mining sanitization process Signs and symptoms Task analysis |
title | Privacy Preserving Data Mining Framework for Negative Association Rules: An Application to Healthcare Informatics |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-21T04%3A04%3A21IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Privacy%20Preserving%20Data%20Mining%20Framework%20for%20Negative%20Association%20Rules:%20An%20Application%20to%20Healthcare%20Informatics&rft.jtitle=IEEE%20access&rft.au=Darwish,%20Saad%20M.&rft.date=2022&rft.volume=10&rft.spage=76268&rft.epage=76280&rft.pages=76268-76280&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2022.3192447&rft_dat=%3Cproquest_cross%3E2695144431%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c408t-d76d3e2374301e4fb2ff052771829526549fd88f247d8b5f3fcf2e09fa36a13f3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2695144431&rft_id=info:pmid/&rft_ieee_id=9832893&rfr_iscdi=true |