计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (24): 148-150.DOI: 10.3778/j.issn.1002-8331.2008.24.044

• 数据库、信号与信息处理 • 上一篇    下一篇

一种基于OUS的最大频繁项集挖掘算法

张忠平,宋晓辉,宋少英   

  1. 燕山大学 信息科学与工程学院,河北 秦皇岛 066004
  • 收稿日期:2008-03-12 修回日期:2008-05-19 出版日期:2008-08-21 发布日期:2008-08-21
  • 通讯作者: 张忠平

Algorithm for mining maximal frequent itemsets based on OUS

ZHANG Zhong-ping,SONG Xiao-hui,SONG Shao-ying   

  1. College of Information Science and Engineering,Yanshan University,Qinhuangdao,Hebei 066004,China
  • Received:2008-03-12 Revised:2008-05-19 Online:2008-08-21 Published:2008-08-21
  • Contact: ZHANG Zhong-ping

摘要: 分析实际应用中有效访问序列的特点,提出了一种采用自底向上策略快速挖掘最大频繁项集的OUS算法。该算法首先对用户项集进行重叠操作统计浏览次数,然后合并,依据用户给出的最小支持度删除原项集中的非频繁页面元素,并对两两用户项集筛选生成候选频繁项集,最后扫描数据库,统计各个候选频繁项集的支持度计数。实验结果表明,该算法能有效地发现用户最大频繁项集。

关键词: 有效访问序列, 重叠, 筛选, 合并, 最大频繁项集

Abstract: The characteristics of effective access sequence in the actual application are analyzed and an efficient algorithm OUS based bottom-up strategy is proposed for mining maximal frequent itemsets.The algorithm first takes count of the browse number of each access sequence by overlapping operation,then unites and deletes the unfrequent page items according to minimum support degree given by users,afterwards sifts getting the intersections of each two user access pattern and gives birth to candidate grequent access patterns,at last,adds up the number of each candidate frequent access pattern by scanning the original database.Experimental results show that the OUS algorithm can discover user maximal frequent access patterns effectively.

Key words: effective access sequence, overlap, sift, unite, maximal frequent itemset