Comparison of clause alignment based on maximum entropy model and Back Propagation neural network model

Computer Engineering and Applications ›› 2015, Vol. 51 ›› Issue (7): 112-117.

Previous Articles Next Articles

Comparison of clause alignment based on maximum entropy model and Back Propagation neural network model

LIU Ying, WANG Nan

Department of Chinese Language and Literature, Tsinghua University, Beijing 100084, China

Online:2015-04-01 Published:2015-03-31

最大熵模型和BP神经网络的短句对齐比较

刘颖，王楠

清华大学中文系，北京 100084

Abstract

Abstract: Clauses are aligned for Shi Ji ancient and modern parallel corpora using maximum entropy model and Back Propagation neural network model. Maximum entropy model combines clause length, clause alignment mode with co-occurring Chinese word feature. Back Propagation neural network model combines clause length, clause position with co-occurring Chinese word feature. The precision and the recall rate of clause alignment are highest when it uses the three features for maximum entropy model. The precision and the recall rate of maximum entropy model are higher than those of Back Propagation neural network model.

Key words: clause alignment, maximum entropy model, Back Propagation neural network model, Records of the Grand Historian（Shi Ji）

摘要： 利用最大熵模型和BP神经网络对《史记》古文与现代文译文的平行语料进行短句对齐研究。最大熵模型将短句长度、短句对齐模式和共现汉字特征相结合来对平行语料进行短句对齐；BP神经网络则把短句长度、短句位置和共现汉字特征相结合来对平行语料进行短句对齐。实验结果表明：同时考虑短句长度、短句对齐模式和共现汉字3个特征的最大熵模型，短句对齐的准确率和召回率是最高的；并且最大熵模型的准确率和召回率高于BP神经网络。

关键词: 短句对齐, 最大熵模型, BP神经网络, 《史记》

LIU Ying, WANG Nan. Comparison of clause alignment based on maximum entropy model and Back Propagation neural network model[J]. Computer Engineering and Applications, 2015, 51(7): 112-117.

刘颖，王楠. 最大熵模型和BP神经网络的短句对齐比较[J]. 计算机工程与应用, 2015, 51(7): 112-117.

[1]	XIA Wuji1，2, HUAQUE Cairang1. Research of tibetan personal pronouns anaphora resolution based on mixed strategy [J]. Computer Engineering and Applications, 2018, 54(7): 66-69.
[2]	SANG Haiyan1，2, Gulia·Altenbek1，2, NIU Ningning1，2. Kazakh part-of-speech tagging method based on maximum entropy [J]. Computer Engineering and Applications, 2013, 49(11): 126-129.
[3]	Guljamal Mamateli1，Askar Rozi2，Askar Hamdulla1. Hybrid algorithm of polyphonic word disambiguation in Uyghur language [J]. Computer Engineering and Applications, 2011, 47(35): 158-160.
[4]	CAO Bo,SU Yi-dan,DENG Qi. Automatic recognition of Chinese name based on maximum entropy [J]. Computer Engineering and Applications, 2009, 45(4): 227-228.
[5]	LI Ru^1，2，SONG Xiao-xiang¹，WANG Wen-jing¹. Chinese question classification based on Chinese FrameNet [J]. Computer Engineering and Applications, 2009, 45(31): 111-114.
[6]	XIE Fa-kui^1,2，ZHANG Quan². Semantic chunks segmentation based on maximum entropy model [J]. Computer Engineering and Applications, 2009, 45(26): 118-120.
[7]	FANG Wei^1,2,HUANG Li^1,2,CUI Zhi-ming^1,2. Automatic identifying query interfaces of deep Web with maximum entropy classifier [J]. Computer Engineering and Applications, 2008, 44(21): 133-137.
[8]	LI Jun-hui,LI Pei-feng,ZHU Qiao-ming,QIAN Pei-de. Email categorization with maximum entropy model [J]. Computer Engineering and Applications, 2007, 43(35): 126-129.
[9]	,,,. Research on Mail Filtering System based on Maximum Entropy Model [J]. Computer Engineering and Applications, 2006, 42(32期): 0-.

Comparison of clause alignment based on maximum entropy model and Back Propagation neural network model

最大熵模型和BP神经网络的短句对齐比较

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 9

Recommended Articles

Metrics