计算机工程与应用 ›› 2016, Vol. 52 ›› Issue (22): 99-104.

• 大数据与云计算 • 上一篇    下一篇

基于用户浏览日志的上下文相关新闻推荐

张骁逸1,苏  宇2,晏小辉3   

  1. 1.北京邮电大学 理学院,北京 100876
    2.新华社技术局 技术实验室,北京 100803
    3.中国科学院 计算技术研究所,北京 100190
  • 出版日期:2016-11-15 发布日期:2016-12-02

Context-awareness recommendation based on user browsing log

ZHANG Xiaoyi1, SU Yu2, YAN Xiaohui3   

  1. 1.School of Science, Beijing University of Posts and Telecommunications, Beijing 100876, China
    2.Laboratory of Xinhua News Agency and Technology Bureau, Beijing 100803, China
    3.Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
  • Online:2016-11-15 Published:2016-12-02

摘要: 个性化新闻推荐的难点在于用户在浏览新闻时目的性不强,容易受各种环境因素的影响,导致其浏览行为难以预测。以往的研究仅仅强调推荐内容相关的或者和用户长期兴趣相匹配的新闻,忽视了环境因素的影响。为此,需要研究上下文相关的新闻推荐算法。具体做法是从用户的浏览日志中提取上下文相关特征,然后训练一个Logistic回归模型来预测用户接下来最可能阅读的新闻。真实数据上的实验结果表明,上下文相关新闻推荐方法效果明显好于传统方法,也验证了上下文信息对用户浏览行为的重要影响。

关键词: 新闻推荐, 上下文相关, Logistic回归

Abstract: The challenge of personalized news recommendation lies in that users browse news without clear purposes. Thus they are susceptible to various environmental factors, making their browsing behavior hard to predict. Previous studies focus on recommending news based on the content similarity or users’ long-term interests, which ignores the impact of environmental factors. Therefore, there is a need to study context-aware news recommendation algorithm. Specifically, contextual features are extracted from the user’s browser log, and then are used to train a Logistic regression classifier to predict the most possible news users are going to read. Experimental results on real-world data set suggest that the context-aware method outperforms substantially traditional methods, and verify the important impact of contextual information on users’ browsing behavior.

Key words: news recommendation, context awareness, Logistic regression