基于深度自动编码器的托攻击集成检测方法

doi:10.3778/j.issn.1002-8331.1809-0297

计算机工程与应用 ›› 2019, Vol. 55 ›› Issue (1): 9-22.DOI: 10.3778/j.issn.1002-8331.1809-0297

基于深度自动编码器的托攻击集成检测方法

郝耀军1，2，张付志2

1.忻州师范学院计算机系，山西忻州 034000
2.燕山大学信息科学与工程学院，河北秦皇岛 066004

出版日期:2019-01-01 发布日期:2019-01-07

Ensemble Detection Method for Shilling Attacks Based on Deep Sparse Autoencoder

HAO Yaojun1，2, ZHANG Fuzhi2

1.Department of Computer, Xinzhou Teachers University, Xinzhou, Shanxi 034000, China
2.School of Information Science and Engineering, Yanshan University, Qinhuangdao, Hebei 066004, China

Online:2019-01-01 Published:2019-01-07

摘要/Abstract

摘要： 在采用协同过滤技术的推荐系统中，恶意用户通过注入大量虚假概貌使系统的推荐结果产生偏离，达到其攻击目的。为了检测托攻击，根据用户的评分值或基于攻击时间的集中性假设，从不同视角提取攻击概貌的特征。但是，这些基于人工特征的检测方法严重依赖于特征工程的质量，而且人工提取的检测特征多限于特定类型的攻击，提取特征也需要较高的知识成本。针对这些问题，从用户评分项目的时间偏好信息入手，提出一种利用深度稀疏自动编码器自动提取检测特征的托攻击集成检测方法。利用小波变换将项目在不同时间间隔内的流行度设定为多个等级，对用户的评分数据预处理得到用户-项目时间流行度等级矩阵。然后，采用深度稀疏自动编码器对用户-项目时间流行度等级矩阵自动进行特征提取，得到用户评分模式的低层特征表达，消除了传统的人工特征工程。以SVM作为基分类器，在深度稀疏自动编码器的每层提取特征并进行攻击检测，生成最终的集成检测结果。在Netflix数据集上的实验表明，提出的检测方法对均值攻击、AoP攻击、偏移攻击、高级项目攻击、高级用户攻击具有较好的检测效果。

关键词: 协同过滤, 托攻击, 托攻击检测, 深度稀疏自动编码器, 项目时间流行度等级

Abstract: In collaborative filtering-based recommender systems, malicious users can bias the systems’ recommendation output by injecting a large number of fake profiles, and then achieve the purpose of attack. To detect shilling attacks, some researchers extract the features of attack profiles from different views, which are mainly based on the users’ratings or the hypothesis that attacks are concentrated in short time. However, the performance of feature extraction-based detection methods usually relies on the quality of artificial feature engineering. Moreover, the detection features are not universal in different environments, and the feature extraction requires high knowledge costs. To address these problems, this paper focuses on the user temporal preferences to the rated items, and proposes an ensemble detection method for shilling attacks based on deep sparse autoencoder. Firstly, the item popularity is set to several grades in different time intervals based on the wavelet transform, and the ratings are preprocessed to obtain the user-item temporal popularity grade matrix. Secondly, the deep sparse autoencoder is used to automatically extract the features from user-item temporal popularity grade matrix, which can obtain the low level feature expressions for the user rating patterns and eliminate the artificial feature engineering. Finally, as base classifier, SVM is used to detect the attacks based on the features of each layer in deep sparse autoencoder, and then the final detection result is generated by voting the detection results of each layer. Experimental results on the Netflix dataset indicate that the proposed method has better detection performance under average attack, AoP attack, shifting attack, power item attack, and power user attack.

Key words: collaborative filtering, shilling attacks, shilling attack detection, deep sparse autoencoder, item temporal popularity grade

郝耀军1，2，张付志2. 基于深度自动编码器的托攻击集成检测方法[J]. 计算机工程与应用, 2019, 55(1): 9-22.

HAO Yaojun1，2, ZHANG Fuzhi2. Ensemble Detection Method for Shilling Attacks Based on Deep Sparse Autoencoder[J]. Computer Engineering and Applications, 2019, 55(1): 9-22.

[1]	张岐山，陈露露. 基于均衡接近度灰关联的Slope One算法[J]. 计算机工程与应用, 2021, 57(9): 96-102.
[2]	王永贵，李倩玉. 基于KNN-GBDT的混合协同过滤推荐算法[J]. 计算机工程与应用, 2021, 57(9): 103-108.
[3]	田维安，陈红梅，周丽华. 基于相似用户好奇心的多样性推荐方法[J]. 计算机工程与应用, 2021, 57(23): 113-121.
[4]	吴昊，徐行健，孟繁军. 课程资源的融合知识图谱多任务特征推荐算法[J]. 计算机工程与应用, 2021, 57(21): 132-139.
[5]	王永，赵旭辉，李晓光，肖玲. 一种面向协同过滤的快速最近邻居搜索方法[J]. 计算机工程与应用, 2021, 57(17): 96-105.
[6]	郑诚，王建. 联合注意力和自编码器的协同过滤推荐[J]. 计算机工程与应用, 2021, 57(10): 139-145.
[7]	陆航，师智斌，刘忠宝. 融合用户兴趣和评分差异的协同过滤推荐算法[J]. 计算机工程与应用, 2020, 56(7): 24-29.
[8]	王卫红，曾英杰. 基于聚类和用户偏好的协同过滤推荐算法[J]. 计算机工程与应用, 2020, 56(3): 68-73.
[9]	纪文璐，王海龙，苏贵斌，柳林. 基于关联规则算法的推荐方法研究综述[J]. 计算机工程与应用, 2020, 56(22): 33-41.
[10]	顾明星，黄伟建，黄远，生龙，申超，张梦甜. 结合用户聚类与改进用户相似性的协同过滤推荐[J]. 计算机工程与应用, 2020, 56(22): 185-190.
[11]	李浩，张亚钏，康雁，杨兵，卜荣景，李晋源. 融合循环知识图谱和协同过滤电影推荐算法[J]. 计算机工程与应用, 2020, 56(2): 106-114.
[12]	曾安，赵恢真. 融合了LSTM和PMF的推荐算法[J]. 计算机工程与应用, 2020, 56(19): 68-75.
[13]	刘春玲，张黎. 改进非对称相似度和关联正则化的推荐算法[J]. 计算机工程与应用, 2020, 56(16): 45-49.
[14]	王永贵，刘凯奇. 一种优化聚类的协同过滤推荐算法[J]. 计算机工程与应用, 2020, 56(15): 66-73.
[15]	张凯辉，周志平，赵卫东. 结合CFDP与时间因子的协同过滤推荐算法[J]. 计算机工程与应用, 2020, 56(15): 80-85.

基于深度自动编码器的托攻击集成检测方法

Ensemble Detection Method for Shilling Attacks Based on Deep Sparse Autoencoder

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics