计算机工程与应用 ›› 2010, Vol. 46 ›› Issue (10): 209-212.DOI: 10.3778/j.issn.1002-8331.2010.10.065

• 工程与应用 • 上一篇    下一篇

一种新的预测用户浏览模式的度量方法

陈 佳,吴军华   

  1. 南京工业大学 信息科学与工程学院,南京 210009
  • 收稿日期:2008-09-23 修回日期:2008-12-15 出版日期:2010-04-01 发布日期:2010-04-01
  • 通讯作者: 陈 佳

New measuring method to predict users’ browsing patterns

CHEN Jia,WU Jun-hua   

  1. College of Information Science and Engineering,Nanjing University of Technology,Nanjing 210009,China
  • Received:2008-09-23 Revised:2008-12-15 Online:2010-04-01 Published:2010-04-01
  • Contact: CHEN Jia

摘要: 在Web环境中,度量用户的浏览模式对Web站点结构的改进是有益的。挖掘和度量Web日志能够识别用户的访问模式模型,Web站点管理者能够应用这些模型研究用户的访问偏爱度,由此改进站点的体系结构以及分析这些改进带来的影响。因此,提出用户群偏爱度这样一个新概念,并使用了基于用户群的模糊聚类算法(UGFC),然后根据聚类结果,即具有相似访问习惯的用户群体,度量用户群偏爱度,再基于用户群偏爱度,利用混合阶Markov模型(HOMM)进行预测。实验表明,这种新的度量预测方法(UGFC-HOMM)比传统Markov模型(TMM)预测更准确,并且实验用精确率、覆盖率和运行时间这3个度量评价值对预测性能进行评估。

关键词: Web日志, 用户群偏爱度, 模糊聚类算法, 混合阶Markov模型, 预测

Abstract: In the Web environment,measuring users’ browsing patterns can benefit the improvement of framework of Web sites.Mining and measuring Web logs are able to identify users’ navigation patterns models,Web-masters can apply these models for studying users’ access favoritism to improve site organization and analyze the effects of changes to their Web sites.So,in this paper,a new conception of user group favoritism is proposed,and Fuzzy Clustering algorithm based on User Group(UGFC) is used,and then,according to clustering result—user group having similar access habit,user group favoritism is measured,based on which the Hybrid-Order Markov Model(HOMM) is used to predict.This new prediction metrics approach(UGFC-HOMM) shows that it is superior to Traditional Markov Model(TMM) in users’ access prediction.Three evaluation metrics are applied to evaluate performance of prediction,namely,precision,coverage and runtime.

Key words: Web logs, user group favoritism, fuzzy clustering algorithm, hybrid-order Markov model, prediction

中图分类号: