Web text clustering based on concept lattice

doi:10.3778/j.issn.1002-8331.2008.23.052

Computer Engineering and Applications ›› 2008, Vol. 44 ›› Issue (23): 169-171.DOI: 10.3778/j.issn.1002-8331.2008.23.052

• 数据库、信号与信息处理 • Previous Articles Next Articles

Web text clustering based on concept lattice

LI Yun,TIAN Su-fang,LI Tuo,XU Tao

Institute of Information Engineering，Yangzhou University，Yangzhou，Jiangsu 225009，China

Received:2007-10-09 Revised:2007-12-17 Online:2008-08-11 Published:2008-08-11
Contact: LI Yun

基于概念格的Web文本聚类

李云,田素方,李拓,徐涛

扬州大学信息工程学院，江苏扬州 225009

通讯作者: 李云

Abstract

Abstract: Web text clustering are mostly based on space vector text express model，the semantics relation of the terms in the text is not considered in this method and the dimension of the terms is very high，which results in the losing of text semantics and the increase of time complexity.The text is considered as object in this paper，and the term of text is abstract as the corresponding attribute.Therefore，a formal context is formed based on text.To express text and measure the similarity the authors extract the concept from formal context.Thus，the dimension of term is reduced，and the complexity of computation is decreased too.Theoretical analysis shows that the method of clustering is effective.

Key words: Web document, clustering, concept lattice, reduce

摘要： Web文本聚类大多是基于空间向量文本表示模型的，它没有考虑特征词之间的语义关系，并且特征词的维数非常高，造成文本语义信息的损失和时间复杂度的增加。把文本作为对象，文本中的特征词作为对应的属性，形成了基于文本的形式背景，从中提取概念来表示文本并度量文本之间的相似度，从而降低了特征词的维数，减少了计算的复杂度，取得了良好的聚类结果。

关键词: Web文档, 聚类, 概念格, 约简

LI Yun,TIAN Su-fang,LI Tuo,XU Tao. Web text clustering based on concept lattice[J]. Computer Engineering and Applications, 2008, 44(23): 169-171.

李云,田素方,李拓,徐涛. 基于概念格的Web文本聚类[J]. 计算机工程与应用, 2008, 44(23): 169-171.

[1]	LAN Hong, HUANG Min. Fusion of KNN Optimized Density Peaks and FCM Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 81-88.
[2]	GUO Xiaojing, SUI Haoda. Application of Improved YOLOv3 in Foreign Object Debris Target Detection on Airfield Pavement [J]. Computer Engineering and Applications, 2021, 57(8): 249-255.
[3]	LI Li, JI Xinyuan, SONG Song. Prediction Model for Number of Software Defects in Loop [J]. Computer Engineering and Applications, 2021, 57(7): 158-163.
[4]	HUO Guangyu, ZHANG Yong, SUN Yanfeng, YIN Baocai. Research on Archive Data Intelligent Classification Based on Semantic [J]. Computer Engineering and Applications, 2021, 57(6): 247-253.
[5]	YANG Fang, YIN Xi, SI Jianhui, LIU Hongyuan, WANG Xue. Mathematical Expression Similarity Calculation Method Based on Focus Clustering [J]. Computer Engineering and Applications, 2021, 57(6): 88-93.
[6]	ZHAO Fan, ZHANG Lin, WEN Zhiquan, YANG Linlin, LIN Guangfeng. Direct and Efficient Natural Scene Chinese Character Approaching Spotting Method [J]. Computer Engineering and Applications, 2021, 57(6): 159-167.
[7]	PENG Qihui, XUAN Shibin, GAO Qing. Distribution Automatic Threshold Density Peak Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(5): 71-78.
[8]	LI Yongzhen, LIAO Husheng. Multi-view Clustering via Graph Convolutional Neural Network [J]. Computer Engineering and Applications, 2021, 57(5): 115-122.
[9]	WANG Changlong, ZHANG Yuandong, MIAO Hong, YANG Yuheng. Application of Double Channel Convolutional Neural Network in Pumpkin Diseases Identification [J]. Computer Engineering and Applications, 2021, 57(5): 183-189.
[10]	HU Xiaomin, WANG Mingfeng, ZHANG Shourong, LI Min. New Differential Evolution with Particle Swarm Optimization Algorithm for Text Clustering [J]. Computer Engineering and Applications, 2021, 57(4): 61-67.
[11]	WANG Junling, LU Xinming. Video Key Frame Extraction Algorithm Based on Semantic Correlation [J]. Computer Engineering and Applications, 2021, 57(4): 192-198.
[12]	WANG Fuyin, ZHANG Desheng, ZHANG Xiao. Adaptive Density Peaks Clustering Algorithm Combining with Whale Optimization Algorithm [J]. Computer Engineering and Applications, 2021, 57(3): 94-102.
[13]	XU Zhijing, WANG Yi. Glaucoma Fundus Images Classification Method Based on Transfer Learning [J]. Computer Engineering and Applications, 2021, 57(3): 144-149.
[14]	CHEN Junfeng, ZHENG Zhongtuan. Over-Sampling Method on Imbalanced Data Based on WKMeans and SMOTE [J]. Computer Engineering and Applications, 2021, 57(23): 106-112.
[15]	ZHANG Zhonglin, ZHAO Yu, YAN Guanghui. Natural Neighbor Density Extremum Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(23): 200-210.

Web text clustering based on concept lattice

基于概念格的Web文本聚类

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics