Joined pattern segment-based sequential patternmining algorithm for biological datasets

Computer Engineering and Applications ›› 2008, Vol. 44 ›› Issue (2): 190-193.

• 数据库与信息处理 • Previous Articles Next Articles

Joined pattern segment-based sequential patternmining algorithm for biological datasets

WANG Miao,SHANG Xue-qun,XUE He

School of Computer，Northwestern Polytechnical University，Xi’an 710072，China

Received:1900-01-01 Revised:1900-01-01 Online:2008-01-11 Published:2008-01-11
Contact: WANG Miao

基于相邻模式段组合的生物序列模式挖掘算法

王淼,尚学群,薛贺

西北工业大学计算机学院，西安 710072

通讯作者: 王淼

Abstract

Abstract: Traditional algorithms for sequential pattern mining have limits when dealing with biological datasets.Biology sequence has its own characters.Based on these characters，the author develops Joined frequent Pattern Segment approach，JPS，for mining biological sequences.First，the joined frequent pattern segments are produced.Then，longer frequent patterns can be obtained by combining the above segments.The experiment shows JPS has better performance than PrefixSpan.Through dealing with the real protein family database，it is proved that the algorithm can deal with biology sequence data efficiently.

Key words: prefix, frequent set, joined frequent pattern segment, pattern combination

摘要： 传统的序列模式挖掘算法应用在生物序列上有其局限性，根据生物序列的特点，提出了基于相邻频繁模式段的模式挖掘算法－JPS。首先产生相邻频繁模式段，然后对这些频繁模式段进行组合，产生新的频繁模式。通过实验分析，该方法在相似性很强的序列数据库中比传统的PrefixSpan算法效率高。通过对真实的蛋白质序列家族库的处理，证明该算法能有效处理生物序列数据。

关键词: 前缀, 频繁集, 相邻频繁模式段, 模式组合

WANG Miao,SHANG Xue-qun,XUE He. Joined pattern segment-based sequential patternmining algorithm for biological datasets[J]. Computer Engineering and Applications, 2008, 44(2): 190-193.

王淼,尚学群,薛贺. 基于相邻模式段组合的生物序列模式挖掘算法[J]. 计算机工程与应用, 2008, 44(2): 190-193.

[1]	HONG Zheng, TIAN Yifan, ZHANG Hongze, WU Lifa. Extended prefix tree based protocol format inference [J]. Computer Engineering and Applications, 2018, 54(12): 14-20.
[2]	ZHU Shumei, WANG Cheng. Image categorization based on maximum frequent item-sets [J]. Computer Engineering and Applications, 2016, 52(23): 181-184.
[3]	YANG Xiaofei, NIU Cuicui, DING Zhipeng, ZHANG Hongyu. Fast greedy name lookup strategy for NDN [J]. Computer Engineering and Applications, 2016, 52(11): 44-49.
[4]	DING Bangxu, HUANG Yongqing. Algorithm of matrix and Prefix-tree for mining frequent itemsets [J]. Computer Engineering and Applications, 2015, 51(22): 154-157.
[5]	ZENG Dangquan. Chinese text compression algorithm based on PDC coding [J]. Computer Engineering and Applications, 2015, 51(17): 205-209.
[6]	Maimaitiyiming Hasimu1，2, Wushour Silamu1, Weinila Mushajiang1. Design and implementation of Uighur generalized suffix tree construction algorithm [J]. Computer Engineering and Applications, 2013, 49(8): 9-11.
[7]	GAO Haiyang1，2, SHEN Qiang1, ZHANG Xuanyi1, ZHAO Zhijun1. Improved Apriori based on data compression [J]. Computer Engineering and Applications, 2013, 49(14): 117-120.
[8]	YANG Ning, CHEN Qun. Dewey encoding storage for XML keyword search [J]. Computer Engineering and Applications, 2013, 49(1): 137-140.
[9]	YUAN Hejin. Modified PrefixSpan algorithm for video target trajectory analysis [J]. Computer Engineering and Applications, 2011, 47(32): 7-10.
[10]	PU Bin，ZHAO Haijun，LI Mingdong. Realization of symbol demarcation synchronous algorithm using cyclic prefix in OFDM [J]. Computer Engineering and Applications, 2011, 47(29): 146-148.
[11]	LIANG Gang¹，ZHAO Wei²，ZHANG Xun-ying³. Research and design of parallel architecture of distributed arithmetic [J]. Computer Engineering and Applications, 2010, 46(12): 75-78.
[12]	XIE Zhi-qiang¹,GAO Peng-fei¹,YANG Jing². Research on improved algorithm of DES based on prefix codes [J]. Computer Engineering and Applications, 2009, 45(9): 92-94.
[13]	CHEN Bo,WANG Le,DONG Peng. New algorithm for mining maximum frequent itemsets based on datasets iteration [J]. Computer Engineering and Applications, 2009, 45(6): 141-144.
[14]	GONG Guo-qiang,GE Wan-cheng. Symbol synchronization method for mobile digital television receiver [J]. Computer Engineering and Applications, 2009, 45(3): 113-115.
[15]	DAI Zu-xu，CHEN Jing. Prefix code based random number generator [J]. Computer Engineering and Applications, 2009, 45(29): 82-83.

Joined pattern segment-based sequential patternmining algorithm for biological datasets

基于相邻模式段组合的生物序列模式挖掘算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics