计算机工程与应用 ›› 2010, Vol. 46 ›› Issue (33): 130-131.DOI: 10.3778/j.issn.1002-8331.2010.33.037

• 数据库、信号与信息处理 • 上一篇    下一篇

基于频繁访问页组的路径聚类研究

吴俊杰1,刘耀军1,陈俊杰2   

  1. 1.太原师范学院 计算机系,太原 030012
    2.太原理工大学 计算机与软件学院,太原 030024
  • 收稿日期:2009-04-02 修回日期:2009-06-17 出版日期:2010-11-21 发布日期:2010-11-21
  • 通讯作者: 吴俊杰

Research of path clustering based on frequently visited page groups

WU Jun-jie1,LIU Yao-jun1,CHEN Jun-jie2   

  1. 1.Department of Computer,Taiyuan Normal University,Taiyuan 030012,China
    2.College of Computer and Software,Taiyuan University of Technology,Taiyuan 030024,China
  • Received:2009-04-02 Revised:2009-06-17 Online:2010-11-21 Published:2010-11-21
  • Contact: WU Jun-jie

摘要: 基于用户会话的页面聚类算法旨在发现用户在浏览过程中频繁访问的页组,为站点管理员优化站点结构提供有力的依据。将介绍一种改进的基于频繁访问页组的路径聚类算法K-PathPlus,其中定义了新的兴趣度、内容链接因子。最后采用龙城热线网站日志进行真实测试,实验的结果是成功的。

关键词: 访问兴趣, 聚类, 路径聚类, 数据挖掘, 兴趣度, 内容链接因子

Abstract: The page clustering based on user sessions is to group the frequently visited pages,which can help the webmaster to optimize the site topology.This paper will introduce an improved clustering algorithm based on users’ access interest.K-PathPlus defines new interest degree,content-link ratio.In the end a true experiment is done by using www.ty.sx.cn log file.The result of experiment is successful.

Key words: access interest, clustering, path clustering, data mining, interest degree, content-link ratio

中图分类号: