Computer Engineering and Applications ›› 2010, Vol. 46 ›› Issue (33): 130-131.DOI: 10.3778/j.issn.1002-8331.2010.33.037

• 数据库、信号与信息处理 • Previous Articles     Next Articles

Research of path clustering based on frequently visited page groups

WU Jun-jie1,LIU Yao-jun1,CHEN Jun-jie2   

  1. 1.Department of Computer,Taiyuan Normal University,Taiyuan 030012,China
    2.College of Computer and Software,Taiyuan University of Technology,Taiyuan 030024,China
  • Received:2009-04-02 Revised:2009-06-17 Online:2010-11-21 Published:2010-11-21
  • Contact: WU Jun-jie

基于频繁访问页组的路径聚类研究

吴俊杰1,刘耀军1,陈俊杰2   

  1. 1.太原师范学院 计算机系,太原 030012
    2.太原理工大学 计算机与软件学院,太原 030024
  • 通讯作者: 吴俊杰

Abstract: The page clustering based on user sessions is to group the frequently visited pages,which can help the webmaster to optimize the site topology.This paper will introduce an improved clustering algorithm based on users’ access interest.K-PathPlus defines new interest degree,content-link ratio.In the end a true experiment is done by using www.ty.sx.cn log file.The result of experiment is successful.

Key words: access interest, clustering, path clustering, data mining, interest degree, content-link ratio

摘要: 基于用户会话的页面聚类算法旨在发现用户在浏览过程中频繁访问的页组,为站点管理员优化站点结构提供有力的依据。将介绍一种改进的基于频繁访问页组的路径聚类算法K-PathPlus,其中定义了新的兴趣度、内容链接因子。最后采用龙城热线网站日志进行真实测试,实验的结果是成功的。

关键词: 访问兴趣, 聚类, 路径聚类, 数据挖掘, 兴趣度, 内容链接因子

CLC Number: