计算机工程与应用 ›› 2020, Vol. 56 ›› Issue (7): 24-29.DOI: 10.3778/j.issn.1002-8331.1909-0129

• 热点与综述 • 上一篇    下一篇

融合用户兴趣和评分差异的协同过滤推荐算法

陆航,师智斌,刘忠宝   

  1. 1.中北大学 大数据学院 大数据与网络安全研究所,太原 030051
    2.中北大学 软件学院,太原 030051
  • 出版日期:2020-04-01 发布日期:2020-03-28

Collaboration Filtering Recommendation Algorithm Based on User Interest and Ratings Difference

LU Hang, SHI Zhibin, LIU Zhongbao   

  1. 1.Research Institute of Big Data and Network Security, School of Big Data, North University of China, Taiyuan 030051, China
    2.School of Software, North University of China, Taiyuan 030051, China
  • Online:2020-04-01 Published:2020-03-28

摘要:

针对传统的协同过滤算法中单一评分相似性计算不准确的问题,提出融合用户兴趣和评分差异的协同过滤推荐算法。将TF-IDF思想运用到用户对标签的权重计算中,并使用指数衰减函数和时间窗口捕捉用户兴趣的变化;根据历史评分矩阵,充分考虑用户评分值差异、评判准则差异、影响力差异和项目影响差异等影响因子,定义了一种评分差异相似性度量算法;最后将用户兴趣相似性和评分差异相似性进行加权融合,获取更加准确的用户邻居,从而预测项目评分并进行推荐。在数据集Movielens的实验表明,提出的算法能有效提高推荐精度。

关键词: 协同过滤, TF-IDF, 指数衰减函数, 时间窗口, 差异性

Abstract:

Aiming at the inaccuracy of single rating similarity calculation in traditional collaborative filtering algorithm, a collaborative filtering recommendation algorithm based on user interest and the ratings difference is proposed. Firstly, the TF-IDF is integrated into the tag weight calculation. An exponential decay function and a time window are used to capture the change of user interest. Secondly, according to the rating matrix, ratings difference similarity algorithm is defined by considering the difference of ratings, evaluation criteria, user and item influence. Finally, user interest similarity and ratings difference similarity are weighted and fused to obtain the nearest neighbors more accurately, which predicts item rating and makes recommendations. Experiments on the Movielens dataset show that the proposed algorithm can effectively improve the recommendation accuracy.

Key words: collaborative filtering, TF-IDF, exponential decay function, time window, difference