Mining frequent access patterns on Web mining based on BIPL algorithm

doi:10.3778/j.issn.1002-8331.2008.23.042

Computer Engineering and Applications ›› 2008, Vol. 44 ›› Issue (23): 136-138.DOI: 10.3778/j.issn.1002-8331.2008.23.042

• 数据库、信号与信息处理 • Previous Articles Next Articles

Mining frequent access patterns on Web mining based on BIPL algorithm

WU Ya-shuang,ZHANG Dong-zhan

Department of Computer Science，Xiamen University，Xiamen，Fujian 361005，China

Received:2008-02-22 Revised:2008-04-29 Online:2008-08-11 Published:2008-08-11
Contact: WU Ya-shuang

基于BIPL的Web频繁访问模式挖掘

吴雅双,张东站

厦门大学计算机科学系，福建厦门 361005

通讯作者: 吴雅双

Abstract

Abstract: Mining frequent access patterns is an important task of Web log mining.In connection with the shortage of the similar Apriori algorithm and the GITC algorithm，the paper presents BIPL algorithm which is used to mine the Web frequent access patterns.The algorithm is based on parents list and intersection，and requests to scan the database only one times.It first gets the intersections of each two access patterns and gives the birth to candidate access patterns.And the parents access patterns of each candidate access pattern are saved in the process of intersection.Then the counts of all the candidate access patterns can be calculated easily through add operational.Finally，the algorithm is proved to be stable and efficient through theoretical analysis and experimental proof.

Key words: Web log mining, intersection relation, frequent access pattern

摘要： 挖掘频繁访问模式是Web日志挖掘的一个重要任务。针对类Apriori算法和GITC算法的不足，提出了基于双亲链的单次扫描求交的Web频繁访问模式挖掘算法—BIPL，该算法首先对用户的访问模式两两进行交集运算，生成候选访问模式，并在求交集过程中保存各个候选访问模式的双亲模式，然后通过简单的求和运算，计算出各个候选访问模式的支持数。最后通过理论分析和实验验证，该算法是稳定的和高效的。

关键词: Web日志挖掘, 交集关系, 频繁访问模式

WU Ya-shuang,ZHANG Dong-zhan. Mining frequent access patterns on Web mining based on BIPL algorithm[J]. Computer Engineering and Applications, 2008, 44(23): 136-138.

吴雅双,张东站. 基于BIPL的Web频繁访问模式挖掘[J]. 计算机工程与应用, 2008, 44(23): 136-138.

[1]	LI Bin, LIU Lili. Weblog mining based on MapReduce [J]. Computer Engineering and Applications, 2012, 48(22): 95-98.
[2]	LI Xue-jun¹,LI Long-shu²,XU Yi¹. Research on Web user’s behavior prediction based on rough set [J]. Computer Engineering and Applications, 2008, 44(13): 134-136.
[3]	Ouyang Yiming，Guo Wei，Guo Jun , Sun Chao Chao. Discovery of User Frequent Access Patterns Based on GITC Algorithm on Web Mining [J]. Computer Engineering and Applications, 2007, 43(7期): 191-194.

Mining frequent access patterns on Web mining based on BIPL algorithm

基于BIPL的Web频繁访问模式挖掘

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 3

Recommended Articles

Metrics