计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (33): 143-145.

• 数据库、信号与信息处理 • 上一篇    下一篇

基于序列树的增量式序列模式更新算法

刘佳新1,严书亭2,贺春亮3,任家东4   

  1. 1.燕山大学 图书馆,河北 秦皇岛 066004
    2.燕山大学 科学技术研究院,河北 秦皇岛 066004
    3.燕山大学 图书馆网络中心,河北 秦皇岛 066004
    4.燕山大学 信息科学与工程学院,河北 秦皇岛 066004
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2011-11-21 发布日期:2011-11-21

Incremental sequential patterns updating algorithm based on sequence tree

LIU Jiaxin1,YAN Shuting2,HE Chunliang3,REN Jiadong4   

  1. 1.Library of Yanshan University,Qinhuangdao,Hebei 066004,China
    2.Science and Technology Administration Office,Yanshan University,Qinhuangdao,Hebei 066004,China
    3.Library Network Centre,Yanshan University,Qinhuangdao,Hebei 066004,China
    4.College of Information Science and Engineering,Yanshan University,Qinhuangdao,Hebei 066004,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2011-11-21 Published:2011-11-21

摘要: 在序列数据库更新时,现有的增量式序列模式挖掘算法只提到序列的插入操作和序列的扩展操作两种情况,没有针对序列删除操作。提出了一种基于序列树的增量式序列模式更新算法(ISPST)。当数据库更新时,ISPST算法只需要对与删除序列有关的序列构造投影数据库,实现对序列树的更新操作,通过深度优先遍历序列树得到更新后数据库中的所有序列模式。实验结果表明,当支持度发生变化时,ISPST算法在时间性能上优于PrefixSpan算法和IncSpan算法。

关键词: 序列模式, 增量式挖掘, 投影数据库, 序列树

Abstract: When the sequence database is updated,the existing incremental mining algorithms of sequential patterns only mention insert operation and append operation of sequences,and do not mention the delete operation.An incremental sequential patterns updating algorithm based on sequence tree is proposed,called ISPST.When the database is updated,ISPST only constructs the projected databases for the sequences associated with the deleted sequences,and completes the updated operations of the sequence tree,and finds all the sequential patterns through using depth-first search strategy to traverse the sequence tree.Experiments show that when the support is changed,ISPST outperforms PrefixSpan and IncSpan in time cost.

Key words: sequential patterns, incremental mining, projected database, sequence tree