计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (34): 161-163.

• 数据库、信号与信息处理 • 上一篇    下一篇

TRDFA:一种用于事务识别的DFA

李文骏1,黄金晶2   

  1. 1.苏州大学 图书馆,江苏 苏州 215006
    2.苏州工业职业技术学院 信息工程系,江苏 苏州 215104
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2011-12-01 发布日期:2011-12-01

TRDFA:transaction recognition based on Deterministic Finite Automata

LI Wenjun1,HUANG Jinjing2   

  1. 1.Library of Soochow University,Suzhou,Jiangsu 215006,China
    2.Department of Information?and?Engineering,Suzhou Institute of Industrial Technology,Suzhou,Jiangsu 215104,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2011-12-01 Published:2011-12-01

摘要: 在Web使用挖掘的预处理中,会话识别结束后需要经过路径补充将那些由于缓存而遗漏的页面补全,而后进行事务识别。构造了一个TRDFA,能从未经路径补充的会话中直接提取事务,并考虑了多窗口方式打开页面的情形。该方法对于只关心最终事务的用户而言,节省了整个预处理的时间,提高了效率。

关键词: 会话识别, 路径补充, 事务识别

Abstract: In the preprocessing of Web usage mining,it is necessary to do path supplement for those missing pages after session recognition because of cache.After path supplement,transaction recognition should be done.This paper constructs a Deterministic Finite Automata(DFA) called TRDFA,which can reconstruct transactions directly from sessions without doing path supplement.At the same time,TRDFA considers this situation that some pages are opened from the new window.For those users who just concern on transactions,this method saves time for the preprocessing.

Key words: session recognition, path supplement, transaction recognition