Computer Engineering and Applications ›› 2011, Vol. 47 ›› Issue (33): 143-145.

• 数据库、信号与信息处理 • Previous Articles     Next Articles

Incremental sequential patterns updating algorithm based on sequence tree

LIU Jiaxin1,YAN Shuting2,HE Chunliang3,REN Jiadong4   

  1. 1.Library of Yanshan University,Qinhuangdao,Hebei 066004,China
    2.Science and Technology Administration Office,Yanshan University,Qinhuangdao,Hebei 066004,China
    3.Library Network Centre,Yanshan University,Qinhuangdao,Hebei 066004,China
    4.College of Information Science and Engineering,Yanshan University,Qinhuangdao,Hebei 066004,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2011-11-21 Published:2011-11-21

基于序列树的增量式序列模式更新算法

刘佳新1,严书亭2,贺春亮3,任家东4   

  1. 1.燕山大学 图书馆,河北 秦皇岛 066004
    2.燕山大学 科学技术研究院,河北 秦皇岛 066004
    3.燕山大学 图书馆网络中心,河北 秦皇岛 066004
    4.燕山大学 信息科学与工程学院,河北 秦皇岛 066004

Abstract: When the sequence database is updated,the existing incremental mining algorithms of sequential patterns only mention insert operation and append operation of sequences,and do not mention the delete operation.An incremental sequential patterns updating algorithm based on sequence tree is proposed,called ISPST.When the database is updated,ISPST only constructs the projected databases for the sequences associated with the deleted sequences,and completes the updated operations of the sequence tree,and finds all the sequential patterns through using depth-first search strategy to traverse the sequence tree.Experiments show that when the support is changed,ISPST outperforms PrefixSpan and IncSpan in time cost.

Key words: sequential patterns, incremental mining, projected database, sequence tree

摘要: 在序列数据库更新时,现有的增量式序列模式挖掘算法只提到序列的插入操作和序列的扩展操作两种情况,没有针对序列删除操作。提出了一种基于序列树的增量式序列模式更新算法(ISPST)。当数据库更新时,ISPST算法只需要对与删除序列有关的序列构造投影数据库,实现对序列树的更新操作,通过深度优先遍历序列树得到更新后数据库中的所有序列模式。实验结果表明,当支持度发生变化时,ISPST算法在时间性能上优于PrefixSpan算法和IncSpan算法。

关键词: 序列模式, 增量式挖掘, 投影数据库, 序列树