计算机工程与应用 ›› 2014, Vol. 50 ›› Issue (6): 192-196.

• 信号处理 • 上一篇    下一篇

汉英机器翻译中格式转换研究

刘智颖1,2,郭艳波3,晋耀红1,2   

  1. 1.北京师范大学 中文信息处理研究所,北京 100875
    2.中国专利信息中心-北京师范大学机器翻译联合实验室,北京 100875
    3.盘古文化传播有限公司,北京 100162
  • 出版日期:2014-03-15 发布日期:2015-05-12

Format conversion in Chinese-English machine translation

LIU Zhiying1,2,GUO Yanbo3,JIN Yaohong1,2   

  1. 1.Institute of Chinese Information Processing, Beijing Normal University, Beijing 100875, China
    2.China Patent Information Center-Beijing Normal University Joint Laboratory of Machine Translation, Beijing 100875, China
    3.Pangu Culture Media Company, Beijing 100162, China
  • Online:2014-03-15 Published:2015-05-12

摘要: 格式在HNC理论中是指广义作用句各主语块位置的不同排列组合方式。由于主语块的排列方式在汉英两种语言中表达的差异,汉语句子翻译到英语时常常发生格式转换。格式转换是HNC机器翻译理论的一个重要内容,是机器翻译理论实践的基础和前提。以HNC机器翻译理论为指导,以真实文本的专利文献汉英句对为分析对象,研究专利机器翻译中汉英两种语言之间广义作用句的格式转换规律,制定了排除规则、识别规则和转换规则,对部分规则进行了人工评测,结果表明准确率能达到85%左右。

关键词: 格式转换, 广义作用句, 机器翻译

Abstract: The format in Hierarchical Network of Concepts(HNC) theory refers to the different arrangement methods of the main chunk in the general action sentence. The format conversion always occurs in the Chinese-English translation for the difference of the main chunk arrangements in two languages. Based on the HNC machine translation theory, this paper analyzes the patent documents Chinese-English sentence pairs, studies the format conversion laws in the general action sentences, makes the exclusion rules, recognition rules and conversion rules, evaluates the effects of part rules. The experiments show that translation precision of about 85% can be obtained.

Key words: format conversion, general action sentence, machine translation