计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (36): 146-149.DOI: 10.3778/j.issn.1002-8331.2008.36.041

• 数据库、信号与信息处理 • 上一篇    下一篇

解决数据样本不平衡性的频繁子图挖掘算法

谢 玓,尚学群,王 淼,张延园   

  1. 西北工业大学 计算机学院,西安 710072
  • 收稿日期:2008-06-02 修回日期:2008-08-11 出版日期:2008-12-21 发布日期:2008-12-21
  • 通讯作者: 谢 玓

Algorithm considering imbalance across datasets for mining frequent subgraphs

XIE Di,SHANG Xue-qun,WANG Miao,ZHANG Yan-yuan   

  1. School of Computer,Northwestern Polytechnical University,Xi’an 710072,China
  • Received:2008-06-02 Revised:2008-08-11 Online:2008-12-21 Published:2008-12-21
  • Contact: XIE Di

摘要: 传统的图挖掘算法应用到生物数据上有其局限性。根据生物网络的特性,通过引入相对支持度的概念,提出了一种解决数据样本间不平衡性的频繁子图挖掘算法——IFS算法。通过对真实的蛋白质互作网络进行处理,证明该算法是可行的。

关键词: 不平衡性, 节点支持度, 相对支持度

Abstract: Traditional algorithms for frequent subgraphs mining have limits when dealing with biological datasets.Biological network has its own characters.Based on these characters,authors propose a new algorithm considering imbalance across datasets,called IFS(Iterated Function System),for mining frequent subgaraphs by relative support.Through dealing with the real protein interaction networks,it is proved that the algorithm is feasible.

Key words: imbalance, vertex support, relative support