基于短语的统计机器翻译中短语抽取算法改进

计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (13): 147-149.

• 数据库、信号与信息处理 • 上一篇下一篇

基于短语的统计机器翻译中短语抽取算法改进

强静^1,2,张建¹

1.中国科学院合肥智能机械研究所，合肥 230031
2.中国科学技术大学信息科学技术学院，合肥 230027

收稿日期:2007-08-21 修回日期:2007-11-15 出版日期:2008-05-01 发布日期:2008-05-01
通讯作者: 强静

Improving phrase-based statistical translation by modifying phrase extraction algorithm

QIANG Jing^1,2,ZHANG Jian¹

1.Institute of Intelligent Machines，Chinese Academy of Sciences，Hefei 230031，China
2.School of Information Science and Technology，University of Science and Technology of China，Hefei 230027，China

Received:2007-08-21 Revised:2007-11-15 Online:2008-05-01 Published:2008-05-01
Contact: QIANG Jing

摘要/Abstract

摘要： 针对基于短语统计机器翻译中目前常用的Och提出的短语抽取算法，提出了一种改进算法。该算法能够在原有算法的基础上抽取出更多的准确对齐信息，这对语料库较小的汉民统计机器来说意义重大，增加正确的对齐信息可以减少未登录词的产生，提高翻译正确率。经过对不同规模语料库的实验，抽取的短语对数目有明显增多。

关键词: 统计机器翻译, 翻译模型, 短语抽取

Abstract: The paper proposes an improved algorithm of Phrase Extract based on the Och’s phrase extraction algorithm in the phrase based statistical machine translation.The algorithm can take more accurate alignment information based on the original algorithm.It is of great significance for the smaller corpus statistical machinery.It can reduce the unknown words by increasing in correct alignment information，and increases the rate of correct translation.After the different scale corpus experiment.The extracted number of phrase is obviously increase.

Key words: machine translation, translation model, phrase extract

强静^1,2,张建¹. 基于短语的统计机器翻译中短语抽取算法改进[J]. 计算机工程与应用, 2008, 44(13): 147-149.

QIANG Jing^1,2,ZHANG Jian¹. Improving phrase-based statistical translation by modifying phrase extraction algorithm[J]. Computer Engineering and Applications, 2008, 44(13): 147-149.

[1]	帕丽旦·木合塔尔，吾守尔·斯拉木，买买提阿依甫，努尔麦麦提·尤鲁瓦斯. RNN编码器-解码器在维汉机器翻译中的应用[J]. 计算机工程与应用, 2018, 54(15): 235-240.
[2]	郭俊博1，张喜媛2，杜金华2. N-Best句法知识增强的统计机器翻译预调序模型[J]. 计算机工程与应用, 2016, 52(17): 160-165.
[3]	刘颖，姜巍. 统计机器翻译中翻译规则抽取[J]. 计算机工程与应用, 2012, 48(32): 98-101.
[4]	王丽，韩习武. 双语词典在统计机器翻译中的应用[J]. 计算机工程与应用, 2010, 46(16): 135-139.
[5]	王斯日古楞^1，2，斯琴图³，那顺乌日图². 基于短语的汉蒙统计机器翻译研究[J]. 计算机工程与应用, 2010, 46(14): 138-142.
[6]	孙广范，宋金平，肖健，袁琦. 句法调序的统计机器翻译方法研究[J]. 计算机工程与应用, 2009, 45(36): 142-144.
[7]	麻雪云¹，肖诗斌^1，2，王弘蔚^1，2，施水才^1，2. 基于关键名词短语聚类的中文搜索结果聚类[J]. 计算机工程与应用, 2009, 45(31): 118-121.
[8]	罗毅,李淼,朱鉴,胡冠龙. 基于短语统计机器翻译解码算法的研究与实现[J]. 计算机工程与应用, 2007, 43(30): 171-173.

基于短语的统计机器翻译中短语抽取算法改进

Improving phrase-based statistical translation by modifying phrase extraction algorithm

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 8

编辑推荐

Metrics