Semantic similarity computation of concepts and documents

doi:10.3778/j.issn.1002-8331.2008.35.049

Computer Engineering and Applications ›› 2008, Vol. 44 ›› Issue (35): 163-167.DOI: 10.3778/j.issn.1002-8331.2008.35.049

• 数据库、信号与信息处理 • Previous Articles Next Articles

Semantic similarity computation of concepts and documents

SONG Ling¹,GUO Jia-yi²,ZHANG Dong-mei¹,TANG Xiao-bing¹,GAO Nan¹

1.School of Computer Science & Technology，Shandong Jianzhu University，Jinan 250101，China
2.Beijing Information Resource Management Center，Beijing 100082，China

Received:2007-12-19 Revised:2008-03-31 Online:2008-12-11 Published:2008-12-11
Contact: SONG Ling

概念与文档的语义相似度计算

宋玲¹,郭家义²,张冬梅¹,汤晓兵¹,高楠¹

1.山东建筑大学计算机科学与技术学院，济南 250101
2.北京市信息资源管理中心，北京 100082

通讯作者: 宋玲

Abstract

Abstract: A novel method that integrates core ontology as background knowledge into the process of computing similarity of concepts and documents is proposed.Ontology is represented as a graph-based model that reflects semantic relationship between concepts，with which a concept and a document are extended to a semantic fuzzy set.Then fuzzy similarity between two fuzzy sets is computed.Documents comparison is based on concepts comparison.A semantic similarity matrix that exploits semantic relation of the ontology is defined，and fuzzy similarity measure based on shared information content is proposed in the processing of concepts comparison.

Key words: concept similarity, document similarity, ontology, documents clustering

摘要： 将本体作为背景知识引入到概念之间相似度和文档之间相似度的计算中。通过图模型表示本体中概念以及概念之间的语义关系，用来将一个概念和一个文档扩展为一个语义模糊集，并计算模糊集合之间的相似度。文档相似度的计算是在概念相似度计算的基础之上。在概念相似度的计算过程中引入了语义相似度矩阵以及基于共信息理论的模糊相似度方法。

关键词: 概念相似度, 文档相似度, 本体, 文档聚类

SONG Ling¹,GUO Jia-yi²,ZHANG Dong-mei¹,TANG Xiao-bing¹,GAO Nan¹. Semantic similarity computation of concepts and documents[J]. Computer Engineering and Applications, 2008, 44(35): 163-167.

宋玲¹,郭家义²,张冬梅¹,汤晓兵¹,高楠¹. 概念与文档的语义相似度计算[J]. 计算机工程与应用, 2008, 44(35): 163-167.

[1]	YANG Quan. SVM Algorithm for N1+N2 Structure Syntax Relation Determination [J]. Computer Engineering and Applications, 2021, 57(20): 104-108.
[2]	GAO Jian, WANG An. Research on Ontology-Based Network Threat Intelligence Analysis Technology [J]. Computer Engineering and Applications, 2020, 56(11): 112-117.
[3]	LI Jinhai, HE Youshi, MA Yunlei, ZHOU Aiping. Research of Universal Model for Knowledge Representation Based on Multi-layer Domain Ontology [J]. Computer Engineering and Applications, 2020, 56(11): 149-155.
[4]	WANG Siyu, HE Jingsha, TENG Da. Access Control System Supporting Quantification and Protection of Privacy Information [J]. Computer Engineering and Applications, 2019, 55(16): 77-87.
[5]	QU Hanhua1, HUI Jianzhong1, HE Xianfeng2, WANG Muhua1, HE Xiaofeng1, FENG De’en1. Formal concept analysis model of meteorological services [J]. Computer Engineering and Applications, 2018, 54(9): 257-264.
[6]	YOU Yun1，2，3, WAN Changxuan1，3, CHEN Huangye1，3. Diversified commodity recommendation method considering correlation between objects [J]. Computer Engineering and Applications, 2018, 54(7): 70-76.
[7]	WANG Zhen. Engineering change impact analysis based on design knowledge ontology [J]. Computer Engineering and Applications, 2018, 54(5): 247-252.
[8]	ZHU Wenyue，LIU Zongtian. Emergency domain knowledge modeling based on event ontology [J]. Computer Engineering and Applications, 2018, 54(21): 148-155.
[9]	PENG Fei1, ZHANG Tao1, XU Weiguang1, ZHAO Min1, QIN Hengjia2. Security policy generation model of operating system based on ontology [J]. Computer Engineering and Applications, 2018, 54(2): 114-118.
[10]	WANG Hong, ZHANG Hao, SHI Jinchuan. Research on domain ontology concept acquisition method based on Latent Dirichlet Allocation [J]. Computer Engineering and Applications, 2018, 54(13): 252-257.
[11]	WANG Jiahai, CHEN Yu. Data driven Job Shop production scheduling knowledge mining and optimization [J]. Computer Engineering and Applications, 2018, 54(1): 264-270.
[12]	HAN Xueren1, WANG Qingshan1, GUO Yong1, CUI Xingya2. Geographic ontology concept semantic similarity measure model based on BP neural network optimized by PSO [J]. Computer Engineering and Applications, 2017, 53(8): 32-37.
[13]	CHEN Heng1，2, LI Guanyu2, CHEN Xinying2，3. Modular thinking’s application in large scale ontology matching [J]. Computer Engineering and Applications, 2017, 53(8): 149-153.
[14]	BAO Xuguang, WANG Lizhen, CHEN Hongmei. Co-location-based site selection using ontologies [J]. Computer Engineering and Applications, 2017, 53(24): 15-22.
[15]	WANG Qinglin1，2, XUE Huifeng1，3. Virtual knowledge flow generation algorithm supporting complex product systems design [J]. Computer Engineering and Applications, 2017, 53(22): 29-34.

Semantic similarity computation of concepts and documents

概念与文档的语义相似度计算

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics