计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (20): 206-209.

• 工程与应用 • 上一篇    下一篇

面向中文文本的玉米病虫害本体学习研究

齐 红1,2,官莹莹1,3,刘亚波1,2   

  1. 1.吉林大学 计算机科学与技术学院,长春 130012
    2.吉林大学 符号计算与知识工程教育部重点实验室,长春 130012
    3.鸡西市人民政府办公室 政务信息化管理中心,黑龙江 鸡西 158100
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2011-07-11 发布日期:2011-07-11

Learning ontology of maize pests and diseases from Chinese text

QI Hong1,2,GUAN Yingying1,3,LIU Yabo1,2   

  1. 1.College of Computer Science and Technology,Jilin University,Changchun 130012,China
    2.Key Lab of Symbolic Computation & Knowledge Engineering of Ministry of Education,Jilin University,Changchun 130012,China
    3.Goverment Information Management Center,People’s Government Office of Jixi,Jixi,Heilongjiang 158100,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2011-07-11 Published:2011-07-11

摘要: 由于中文和英文在语法和句法等方面的差异,面向中文文本的本体学习方法尚存在一定困难。研究了面向中文文本的玉米病虫害本体学习方法。提出单字合并法,将其与TFIDF方法结合,进行概念抽取;将欧几里德距离与余弦距离加权平均计算概念相似度,进行概念关系抽取。从中国玉米网选取50篇领域文档,应用上述方法构建了玉米病虫害本体。

关键词: 本体学习, 概念抽取, 概念关系抽取, 玉米病虫害本体

Abstract: As Chinese and English are different in grammar and syntax,Chinese text-oriented ontology learning is very difficult.The paper studies the method of learning ontology of maize pests and diseases from Chinese text.Character combining method is proposed and combined with TFIDF to extract concept.The similarity of concepts is measured by Euclidean distance and cosine distance-weighted average in extracting concept relations.Ontology of maize pests and diseases are learned from fifty domain documents of China Maize Network using above methods.

Key words: ontology learning, concept extraction, concept relation extraction, ontology of maize pests and diseases