Computer Engineering and Applications ›› 2013, Vol. 49 ›› Issue (8): 9-11.

Previous Articles     Next Articles

Design and implementation of Uighur generalized suffix tree construction algorithm

Maimaitiyiming Hasimu1,2, Wushour Silamu1, Weinila Mushajiang1   

  1. 1.School of Information Science and Engineering, Xinjiang University, Urumqi 830046, China
    2.Department of Computer Science, Hotan Teachers College, Hotan, Xinjiang 848000, China
  • Online:2013-04-15 Published:2013-04-15

维吾尔文后缀树构造算法的设计与实现

买买提依明·哈斯木1,2,吾守尔·斯拉木1,维尼拉·木沙江1   

  1. 1.新疆大学 信息科学与工程学院,乌鲁木齐 830046
    2.和田师范专科学校 计算机科学系,新疆 和田 848000

Abstract: Suffix Tree Clustering(STC) have been applied to web page clustering problems. In order to use the STC algorithm to cluster Uighur page, this paper analyzes the characteristics of the generalized suffix tree and Uighur features to design the Uighur generalized suffix tree construction algorithm. The experimental result shows that the method can construct Uighur suffix tree in linear time range, and it can be used to cluster Uighur web page.

Key words: suffix, suffix tree, generalized suffix tree, node, prefix

摘要: 为用后缀树聚类算法对维吾尔文网页进行聚类,通过分析可扩展后缀树和维吾尔文的特点设计了维吾尔文后缀树构造算法。实验结果证明该方法能够在线性的时间范围内构造维吾尔文后缀树,并用它来对维吾尔文网页进行聚类。

关键词: 后缀, 后缀树, 可扩展后缀树, 节点, 公共前缀