TY - GEN
T1 - Utility Promises of Self-Organising Maps in Privacy Preserving Data Mining
AU - Mohammed, Kabiru
AU - Ayesh, Aladdin
AU - Boiten, Eerke
N1 - International Workshop on Data Privacy Management; International Workshop on Cryptocurrencies and Blockchain Technology; Data Privacy Management, Cryptocurrencies and Blockchain Technology, ESORICS 2020 International Workshops, DPM 2020 and CBT 2020, Revised Selected Papers
Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020
Y1 - 2020
N2 - Data mining techniques are highly efficient in sifting through big data to extract hidden knowledge and assist evidence-based decisions. However, it poses severe threats to individuals’ privacy because it can be exploited to allow inferences to be made on sensitive data. Researchers have proposed several privacy-preserving data mining techniques to address this challenge. One unique method is by extending anonymisation privacy models in data mining processes to enhance privacy and utility. Several published works in this area have utilised clustering techniques to enforce anonymisation models on private data, which work by grouping the data into clusters using a quality measure and then generalise the data in each group separately to achieve an anonymisation threshold. Although they are highly efficient and practical, however guaranteeing adequate balance between data utility and privacy protection remains a challenge. In addition to this, existing approaches do not work well with high-dimensional data, since it is difficult to develop good groupings without incurring excessive information loss. Our work aims to overcome these challenges by proposing a hybrid approach, combining self organising maps with conventional privacy based clustering algorithms. The main contribution of this paper is to show that, dimensionality reduction techniques can improve the anonymisation process by incurring less information loss, thus producing a more desirable balance between privacy and utility properties.
AB - Data mining techniques are highly efficient in sifting through big data to extract hidden knowledge and assist evidence-based decisions. However, it poses severe threats to individuals’ privacy because it can be exploited to allow inferences to be made on sensitive data. Researchers have proposed several privacy-preserving data mining techniques to address this challenge. One unique method is by extending anonymisation privacy models in data mining processes to enhance privacy and utility. Several published works in this area have utilised clustering techniques to enforce anonymisation models on private data, which work by grouping the data into clusters using a quality measure and then generalise the data in each group separately to achieve an anonymisation threshold. Although they are highly efficient and practical, however guaranteeing adequate balance between data utility and privacy protection remains a challenge. In addition to this, existing approaches do not work well with high-dimensional data, since it is difficult to develop good groupings without incurring excessive information loss. Our work aims to overcome these challenges by proposing a hybrid approach, combining self organising maps with conventional privacy based clustering algorithms. The main contribution of this paper is to show that, dimensionality reduction techniques can improve the anonymisation process by incurring less information loss, thus producing a more desirable balance between privacy and utility properties.
KW - Clustering
KW - k-anonymity
KW - Privacy Preserving Data Mining
KW - Self Organising Map
UR - http://www.scopus.com/inward/record.url?scp=85101826165&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-66172-4_4
DO - 10.1007/978-3-030-66172-4_4
M3 - Published conference contribution
AN - SCOPUS:85101826165
SN - 9783030661717
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 55
EP - 72
BT - Data Privacy Management, Cryptocurrencies and Blockchain Technology
A2 - Garcia-Alfaro, Joaquin
A2 - Navarro-Arribas, Guillermo
A2 - Herrera-Joancomarti, Jordi
PB - Springer Science and Business Media Deutschland GmbH
T2 - 15th International Workshop on Data Privacy Management, DPM 2020 and 4th International Workshop on Cryptocurrencies and Blockchain Technology, CBT 2020 held in conjunction with 25th European Symposium on Research in Computer Security, ESORICS 2020
Y2 - 17 September 2020 through 18 September 2020
ER -