一种基于聚类的个性化（l，c）-匿名算法

计算机工程与应用 ›› 2012, Vol. 48 ›› Issue (23): 16-20.

一种基于聚类的个性化（l，c）-匿名算法

王平水1，2，王建东1

1.南京航空航天大学计算机科学与技术学院，南京 210016
2.安徽财经大学管理科学与工程学院，安徽蚌埠 233030

出版日期:2012-08-11 发布日期:2012-08-21

Personalized（l，c）-anonymity algorithm based on clustering

WANG Pingshui1，2, WANG Jiandong1

1.College of Computer Science and Technology, Nanjing University of Aeronautics & Astronautics, Nanjing 210016, China
2.School of Management Science and Engineering, Anhui University of Finance & Economics, Bengbu, Anhui 233030, China

Online:2012-08-11 Published:2012-08-21

摘要/Abstract

摘要： 目前多数l-多样性匿名算法对所有敏感属性值均作同等处理，没有考虑其敏感程度和具体分布情况，容易受到相似性攻击和偏斜性攻击；而且等价类建立时执行全域泛化处理，导致信息损失较高。提出一种基于聚类的个性化[(l,c)]-匿名算法，通过定义最大比率阈值和不同敏感属性值的敏感度来提高数据发布的安全性，运用聚类技术产生等价类以减少信息损失。理论分析和实验结果表明，该方法是有效和可行的。

关键词: 数据发布, 隐私保护, l-多样性, 相似性攻击, 偏斜性攻击

Abstract: At present most [l]-diverse anonymity algorithms are vulnerable to similarity attack and skewness attack due to treating all sensitive attribute values equally and without considering the sensitivity and specific distribution. Moreover, these algorithms result in high information loss on account of performing full domain generalization to create equivalence class. This paper proposes a personalized [(l,c)]-anonymity algorithm based on clustering, which improves the security through defining sensitivity for different sensitive attribute value and maximal ratio threshold and reduces information loss via clustering technique. Theoretical analysis and experimental results indicate that the method is effective and feasible.

Key words: data release, privacy preservation, [l]-diversity, similarity attack, skewness attack

王平水1，2，王建东1. 一种基于聚类的个性化（l，c）-匿名算法[J]. 计算机工程与应用, 2012, 48(23): 16-20.

WANG Pingshui1，2, WANG Jiandong1. Personalized（l，c）-anonymity algorithm based on clustering[J]. Computer Engineering and Applications, 2012, 48(23): 16-20.

[1]	魏立斐，李梦思，张蕾，陈聪聪，陈玉娇，王勤. 基于安全两方计算的隐私保护线性回归算法[J]. 计算机工程与应用, 2021, 57(22): 139-146.
[2]	贺智明，徐亿达. 区块链与可搜索加密结合的电子病历共享方案[J]. 计算机工程与应用, 2021, 57(21): 140-147.
[3]	谢裕清，王渊，江樱，杨苗，王永利. 便于数据共享的电网数据湖隐私保护方法[J]. 计算机工程与应用, 2021, 57(2): 113-118.
[4]	吕鑫，赵连成，余记远，谭彬，曾涛，陈娟. 基于轨迹聚类的连续查询隐私保护方法[J]. 计算机工程与应用, 2021, 57(2): 104-112.
[5]	宋国超，初广辉，武绍欣. 基于区间区域的位置隐私保护方法[J]. 计算机工程与应用, 2020, 56(8): 66-73.
[6]	曾海燕，左开中，王永录，刘蕊. 路网环境下的语义多样性位置隐私保护方法[J]. 计算机工程与应用, 2020, 56(7): 102-108.
[7]	许斌，梁晓兵，沈博. 大数据环境中非交互式查询差分隐私保护模型[J]. 计算机工程与应用, 2020, 56(7): 116-121.
[8]	王杰，陈志刚，刘加玲，程宏兵. 基于聚类的云隐私行为挖掘技术[J]. 计算机工程与应用, 2020, 56(5): 80-84.
[9]	梁晓兵，许斌，翟峰，沈博. 基于属性分类的用电大数据隐私保护方法[J]. 计算机工程与应用, 2020, 56(5): 93-100.
[10]	张思亮，凌捷，陈家辉. 可追踪的区块链账本隐私保护方案[J]. 计算机工程与应用, 2020, 56(23): 31-37.
[11]	杨婷，庞晓琼，陈文俊，王云婷，刘天野. 区块链环境下支持隐私保护的数字权限管理[J]. 计算机工程与应用, 2020, 56(23): 109-115.
[12]	李孟特，顾春华，温蜜. 基于区块链的充电交易数据安全存储平台设计[J]. 计算机工程与应用, 2020, 56(21): 79-84.
[13]	王佳贺，魏松杰，吴超. 差分隐私保护的Android应用流量行为混淆方法[J]. 计算机工程与应用, 2020, 56(2): 68-75.
[14]	朴杨鹤然，崔晓晖. 社交网络中用户隐私推理与保护研究综述[J]. 计算机工程与应用, 2020, 56(19): 1-12.
[15]	马媛媛，刘周斌，汪自翔. 边缘计算场景下的异构终端安全接入技术研究[J]. 计算机工程与应用, 2020, 56(17): 115-120.