计算机工程与应用 ›› 2007, Vol. 43 ›› Issue (7): 191-194.

• 数据库与信息处理 • 上一篇    下一篇

Web挖掘中基于GITC算法发现用户频繁访问模式

欧阳一鸣 郭维 郭骏 孙超超   

  1. 合肥工业大学 计算机与信息学院 合肥工业大学计算机与信息学院 合肥工业大学计算机与信息系
  • 收稿日期:2006-03-27 修回日期:1900-01-01 出版日期:2007-03-01 发布日期:2007-03-01
  • 通讯作者: 欧阳一鸣

Discovery of User Frequent Access Patterns Based on GITC Algorithm on Web Mining

Ouyang Yiming,Guo Wei,Guo Jun , Sun Chao Chao   

  1. School of Computer and Information, Hefei University of Technology, Hefei,230009
  • Received:2006-03-27 Revised:1900-01-01 Online:2007-03-01 Published:2007-03-01

摘要: 用户频繁访问模式的发现是Web日志挖掘的重要研究内容。本文提出了一种先求两两用户访问模式的交集结果再生成候选频繁访问模式,然后扫描数据库,统计各个候选频繁访问模式的支持度计数的GITC算法。经过理论分析和实验验证,该算法能有效地发现用户频繁访问模式。

Abstract: The user frequent access patterns discovery is an important task of Web log mining study. The paper proposes GITC algorithm. The algorithm first gets the intersections of each two user access patterns and gives birth to candidate frequent access patterns, then takes count of the number of each candidate frequent access pattern by scanning the original database. Theory analysis and experimental results show that the GITC algorithm can discover user frequent access patterns effectively.