Research on XML semantic retrieval

Computer Engineering and Applications ›› 2013, Vol. 49 ›› Issue (11): 121-125.

Previous Articles Next Articles

Research on XML semantic retrieval

MO Zhengbo 1, SONG Ling2, LV Qiang3, DENG Wei4

1.School of Science, Qingdao Technological University, Qingdao, Shandong 266033, China
2.School of Computer Science and Technology, Shandong Jianzhu University, Jinan 250101, China
3.Department of Power Grid Maintenance Training, State Grid of China Technology College, Jinan 250002, China
4.Basic Courses Department, Shandong University of Science and Technolagy, Tai’an, Shandong 271021, China

Online:2013-06-01 Published:2013-06-14

XML文档语义检索方法研究

莫正波1，宋玲2，吕强3，邓薇4

1.青岛理工大学理学院，山东青岛 266033
2.山东建筑大学计算机科学与技术学院，济南 250101
3..国网技术学院电网检修培训部，济南 250002
4.山东科技大学基础课部，山东泰安 271021

Abstract

Abstract: With more and more application of semi-structure data, the research of XML document similarity becomes essential in the database and information retrieval communities. Given set of XML documents D and the user query q, XML retrieval is to find out the XML documents from the D which satisfies q. In order to search efficiently, a new approach is presented to calculate similarity between two XML documents. The approach is divided into three steps. The user’s query q is expanded to q' by including the synonyms of q based on WordNet. q' and each XML document in D are allocated to digital signatures. After eliminating the irrelevant documents in D according to the signatures matching, a subset D' of D is got. Precise matching between q and D' is presented and final results are got.

Key words: Extensive Makeup Language（XML）, semi-structure data, similarity

摘要： 由于半结构文档如XML越来越广泛的应用，在数据库和信息检索领域，对半结构XML数据相似度的研究也变得尤为重要。给定XML文档集D和用户查询q，XML检索即是从D中查找出符合q的XML文档。为了有效地进行XML信息检索，提出了一种新的计算用户查询与XML文档之间相似度的算法。该算法分为三步：基于WordNet对用户查询q进行同义词扩展得到q'；将q'和D中的每一篇XML文档都进行数字签名，并通过签名之间的匹配对D进行有效过滤，除去大量不符合用户查询的文档，得到一个文档子集D'，[D'?D]；对q'与D'中的文档进行精确匹配得到检索结果。

关键词: 可扩展标示语言（XML）, 半结构文档, 相似度

MO Zhengbo 1, SONG Ling2, LV Qiang3, DENG Wei4. Research on XML semantic retrieval[J]. Computer Engineering and Applications, 2013, 49(11): 121-125.

莫正波1，宋玲2，吕强3，邓薇4. XML文档语义检索方法研究[J]. 计算机工程与应用, 2013, 49(11): 121-125.

[1]	ZHANG Qishan, CHEN Lulu. Slope One Algorithm Based on Grey Correlational Analysis by Method of Degree of Balance and Approach [J]. Computer Engineering and Applications, 2021, 57(9): 96-102.
[2]	WANG Yonggui, LI Qianyu. Hybrid Collaborative Filtering Recommendation Algorithm Based on KNN-GBDT [J]. Computer Engineering and Applications, 2021, 57(9): 103-108.
[3]	ZHANG Songcan, PU Jiexin, SI Yanna, SUN Lifan. Adaptive Improved Ant Colony Algorithm Based on Population Similarity and Its Application [J]. Computer Engineering and Applications, 2021, 57(8): 70-77.
[4]	ZHANG Xiaowen, REN Yongfeng. Image Matching Algorithm Combining Sparse Representation and Topological Similarity [J]. Computer Engineering and Applications, 2021, 57(8): 198-203.
[5]	YANG Fang, YIN Xi, SI Jianhui, LIU Hongyuan, WANG Xue. Mathematical Expression Similarity Calculation Method Based on Focus Clustering [J]. Computer Engineering and Applications, 2021, 57(6): 88-93.
[6]	QIAN Yunyun, YANG Wenzhong, YAO Miao, LI Hailei, CHAI Yachuang. Topic Community Discovery Model Incorporating Topic Similarity Weight [J]. Computer Engineering and Applications, 2021, 57(5): 107-114.
[7]	JIANG Bin, LIANG Xiao’an, ZHANG Liang, GAO Yangjun. Evidence Combination Method Based on Improved Modified Weight [J]. Computer Engineering and Applications, 2021, 57(24): 100-106.
[8]	TIAN Wei’an, CHEN Hongmei, ZHOU Lihua. Diversified Recommendation Method Based on Similar Users’Curiosity [J]. Computer Engineering and Applications, 2021, 57(23): 113-121.
[9]	LIANG Tian, CAO Dexin. Improved and Simplified Particle Swarm Optimization Algorithm Based on Levy Flight [J]. Computer Engineering and Applications, 2021, 57(20): 188-196.
[10]	WEI Dingfeng, LI Liang, CHAI Jing. Social Recommendation Algorithm by Fusing Item Information [J]. Computer Engineering and Applications, 2021, 57(19): 198-204.
[11]	LIU Li. Top-N Recommendation Algorithm Based on User Diversity Preference [J]. Computer Engineering and Applications, 2021, 57(17): 116-121.
[12]	YANG Yanjiao, ZHAO Guotao, WANG Pidong. Sentence Similarity Calculation Method Based on Semantics and Emotion [J]. Computer Engineering and Applications, 2021, 57(16): 151-158.
[13]	ZHANG Tao, YU Jiong, LIAO Bin, BI Xuehua. Method for Attributed Graph Summarization Based on Minimum Description Length [J]. Computer Engineering and Applications, 2021, 57(15): 124-132.
[14]	ZHAO Qi, DU Yanhui, LU Tianliang, SHEN Shaoyu. Algorithm of Text Similarity Analysis Based on Capsule-BiGRU [J]. Computer Engineering and Applications, 2021, 57(15): 171-177.
[15]	SHI Chen, ZHANG Yu, HU Bo. Model for Near-Synonym/Synonym Phrase Finding Based on Common Surrounding Context [J]. Computer Engineering and Applications, 2021, 57(14): 142-147.

Research on XML semantic retrieval

XML文档语义检索方法研究

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics