Computer Engineering and Applications ›› 2013, Vol. 49 ›› Issue (9): 115-119.

Previous Articles     Next Articles

Research on large vocabulary continuous speech recognition for Uyghur

Nurmemet YOLWAS, Wushour SILAMU   

  1. College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China
  • Online:2013-05-01 Published:2016-03-28

面向大词汇量的维吾尔语连续语音识别研究

努尔麦麦提·尤鲁瓦斯,吾守尔·斯拉木   

  1. 新疆大学 信息科学与工程学院,乌鲁木齐 830046

Abstract: The technology of Large Vocabulary Continuous Speech Recognition(LVCSR) has developed quickly, and many scientific institutions have reinforced the speech recognition research on the Mandarin Chinese and English. However, the study of Uyghur speech recognition technology has started recently. This paper introduces the research on main aspect of Uyghur LVCSR system, such as construction of Uyghur speech corpus, acoustic and language modeling techniques, decoding techniques, and performed experiments for Uyghur LVCSR. At the end, the issues affecting Uyghur LVCSR system are discussed in detail.

Key words: Uyghur language, speech corpus, large vocabulary, recognition technology

摘要: 近年来大词汇量连续语音识别技术得到了迅速的发展,国内外研究机构加大了对汉语和英语语音识别技术的研究,然而,维吾尔语语音识别技术的研究工作最近才起步。建立了面向大词汇量的维吾尔语语音语料库,研究了维吾尔语声学模型和语言模型建模技术、解码技术,进行了面向大词汇量的维吾尔语连续语音识别实验。对维吾尔语大词汇量连续语音识别技术进一步发展中存在的问题进行了讨论。

关键词: 维吾尔语, 语音语料库, 大词汇, 识别技术