Computer Engineering and Applications ›› 2014, Vol. 50 ›› Issue (23): 206-211.

Previous Articles     Next Articles

Research on conversion approach between traditional Mongolian and Cyrillic Mongolian

BAO Feilong, GAO Guanglai, YAN Xueliang, WEI Hongxi   

  1. College of Computer Science, Inner Mongolia University, Hohhot 010021, China
  • Online:2014-12-01 Published:2014-12-12

传统蒙古文与西里尔蒙古文相互转换方法的研究

飞  龙,高光来,闫学亮,魏宏喜   

  1. 内蒙古大学 计算机学院,呼和浩特 010021

Abstract: Traditional Mongolian and Cyrillic Mongolian are both Mongolian languages and are widely used in China and Mongolia respectively. With almost the same pronunciations, their written forms are totally different. According to the characteristic of the two languages, this paper proposes a joint sequence model based approach and depicts in detail the corresponding experiments performed. In the experiments, the word error rate and letter error rate for the traditional Mongolian to Cyrillic Mongolian conversion system are 18.38% and 6.75%, and that for Cyrillic Mongolian and traditional Mongolian conversion system are 18.77% and 7.14%. Experimental results show that the proposed approach can meet the basic requirements for practical use.

Key words: traditional Mongolian, Cyrillic Mongolian, joint sequence models, joint multigram

摘要: 传统蒙古文和西里尔蒙古文分别是在中国和蒙古国使用的蒙古文,它们的口语基本相同,但是书写形式完全不同。结合传统蒙古文和西里尔蒙古文的构词特点,提出了基于联合序列模型的传统蒙古文和西里尔蒙古文相互转换方法,并做了大量的相互转换实验。实验中,传统蒙古文到西里尔蒙古文转换系统的词误识率和字母误识率分别达到了18.38%和6.75%,西里尔蒙古文到传统蒙古文转换系统的词误识率字母误识率分别达到了18.77%和7.14%,基本达到了实用要求。

关键词: 传统蒙古文, 西里尔蒙古文, 联合序列模型, 联合多元