Loading…

A compression strategy for an efficient TSP-based microaggregation

The advent of decentralised systems and the continuous collection of personal data managed by public and private entities require the application of measures to guarantee the privacy of individuals. Due to the necessity to preserve both the privacy and the utility of such data, different techniques...

Full description

Saved in:
Bibliographic Details
Published in:Expert systems with applications 2023-03, Vol.213, p.118980, Article 118980
Main Authors: Maya-López, Armando, Martínez-Ballesté, Antoni, Casino, Fran
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The advent of decentralised systems and the continuous collection of personal data managed by public and private entities require the application of measures to guarantee the privacy of individuals. Due to the necessity to preserve both the privacy and the utility of such data, different techniques have been proposed in the literature. Microaggregation, a family of data perturbation methods, relies on the principle of k-anonymity to aggregate personal data records. While several microaggregation heuristics exist, those based on the Travelling Salesman Problem (TSP) have been shown to outperform the state of the art when considering the trade-off between privacy protection and data utility. However, TSP-based heuristics suffer from scalability issues. Intuitively, methods that may reduce the computational time of TSP-based heuristics may incur a higher information loss. Nevertheless, in this article, we propose a method that improves the performance of TSP-based heuristics and can be used in both small and large datasets effectively. Moreover, instead of focusing only on the computational time perspective, our method can preserve and sometimes reduce the information loss resulting from the microaggregation. Extensive experiments with different benchmarks show how our method is able to outperform the current state of the art, considering the trade-off between information loss and computational time. •We improve the performance of TSP-based heuristics regardless of the size of data.•According to data, we can reduce the computational time and the introduced error.•Extensive experiments show how our method is able to outperform the current soa.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2022.118980