Noisy channel based Uyghur neutralized vowel identification model

doi:10.3778/j.issn.1002-8331.2010.15.035

Computer Engineering and Applications ›› 2010, Vol. 46 ›› Issue (15): 118-120.DOI: 10.3778/j.issn.1002-8331.2010.15.035

• 数据库、信号与信息处理 • Previous Articles Next Articles

Noisy channel based Uyghur neutralized vowel identification model

AISHAN Wumaier，TUERGEN Yibulayin，ZAOKERE Kadeer

School of Information Science and Engineering，Xinjiang University，Urumqi 830046，China

Received:2009-04-27 Revised:2009-06-18 Online:2010-05-21 Published:2010-05-21
Contact: AISHAN Wumaier

基于噪声信道的维吾尔语央音原音识别模型

艾山·吾买尔，吐尔根·依步拉音，早克热·卡德尔

新疆大学信息科学与工程学院，乌鲁木齐 830046

通讯作者: 艾山·吾买尔

Abstract

Abstract: In Uyghur，an inflectional suffix added to a word always produces vowel neutralization.When stemming an inflected word，the rule based neutralized vowel detecting has a low precision about 40%.For this problem，the noisy channel based Uyghur neutralized vowel identification model is proposed.The language model and likelihood build on the word ending two letters，three letters and last syllable.In the test，the model’s precision reached 82.45%，this model can improve stemming precision over 15%.

Key words: noisy channel, Uyghur, vowel harmony, stemming, neutralized vowel

摘要： 维吾尔语单词连接构形词缀时，经常发生元音弱化成央音的现象。但对已有形态变化的单词进行形态还原时，使用规则识别弱化央音的原音的效率一般在40%左右。提出基于噪声信道的维吾尔语央音原音识别模型。该模型以弱化词干词尾的二字符、三字符和最后音节作为上下文，建立语言模型和似然度计算公式。在开放测试中，模型的准确率达到82.45%，提高词干提取准确率15%。

关键词: 噪声信道, 维吾尔语, 元音弱化, 词干提取, 央音

CLC Number:

TP391

AISHAN Wumaier，TUERGEN Yibulayin，ZAOKERE Kadeer. Noisy channel based Uyghur neutralized vowel identification model[J]. Computer Engineering and Applications, 2010, 46(15): 118-120.

艾山·吾买尔，吐尔根·依步拉音，早克热·卡德尔. 基于噪声信道的维吾尔语央音原音识别模型[J]. 计算机工程与应用, 2010, 46(15): 118-120.

[1]	Hasan Wumaier, Sirajahmat Ruzmamat, Xireaili Hairela, LIU Wenqi, Tuergen Yibulayin, WANG Liejun, Wayit Abulizi. Bi-directional Uyghur-Chinese Neural Machine Translation with Marked Syllables [J]. Computer Engineering and Applications, 2021, 57(4): 161-168.
[2]	LIU Chang, Abudukelimu·Abulizi, YAO Dengfeng, Halidanmu·Abudukelimu. Survey for Uyghur Morphological Analysis [J]. Computer Engineering and Applications, 2021, 57(15): 42-61.
[3]	Ahmatjan Mattohti, Askar Hamdulla, Abdusalam Dawut. Uyghur Text Regions Localization Using Channel-Enhanced MSER and CNN [J]. Computer Engineering and Applications, 2020, 56(16): 132-138.
[4]	XU Xuebin, Hornisa Mamat, Alim Aysa, ZHU Yali, Kurban Ubul. Word Segmentation of Uyghur Image Based on Clustering and Conjoined Segment Identification [J]. Computer Engineering and Applications, 2020, 56(14): 148-155.
[5]	Yibulayin·Wusiman, GUO Wenqiang, YU Kai. Research on Filtering Algorithm for Senstive Information in Multi-form Uyghur [J]. Computer Engineering and Applications, 2020, 56(10): 127-133.
[6]	AYSADET·Abliz, HOJAHMAT·Ismayil, KAMIL·Muyidin, ASKAR·Hamdulla. Word extraction from Uyghur handwritten documents [J]. Computer Engineering and Applications, 2018, 54(9): 133-138.
[7]	XUE Pengqiang, XIAN Ying, Nurbol, Wushour Silamu. Sensitive information filtering algorithm based on Uyghur text information network research [J]. Computer Engineering and Applications, 2018, 54(5): 236-241.
[8]	Yibulayin·WUSIMAN1, ZHANG Shaowu2, YU Kai1. Research and implementation of converting mechanism of multiple characters Uyghur on the Internet [J]. Computer Engineering and Applications, 2018, 54(19): 114-121.
[9]	MUHETAER Palidan, SILAMU Wushouer, Maimaitayifu, YOULUWASI Nuermaimaiti. Application of RNN encoder-decoder in Uyghur-Chinese machine translation [J]. Computer Engineering and Applications, 2018, 54(15): 235-240.
[10]	Guljamal Mamateli1, Askar rozi2, Askar Hamdulla3. Uyghur prosodic boundary prediction based on hierarchical feature template selection [J]. Computer Engineering and Applications, 2017, 53(8): 250-253.
[11]	JIANG Wen，LIU Likang. Recognition of handwritten Uyghur character based on combination of two features [J]. Computer Engineering and Applications, 2017, 53(5): 192-196.
[12]	NIAN Mei1, FAN Zukui2, LIU Ruolan1. Study on construction of emotional dictionary of Uyghur language [J]. Computer Engineering and Applications, 2017, 53(4): 152-155.
[13]	XU Chun1，2，3, YANG Yong4, JIANG Tonghai1. Research on machine translation based Uyghur morphological analysis [J]. Computer Engineering and Applications, 2017, 53(14): 138-142.
[14]	Alimjan AYSA1，3, Kurban UBUL2，3, Turgun IBRAHIM2，3. Bigram feature extraction for Uyghur text [J]. Computer Engineering and Applications, 2015, 51(3): 216-221.
[15]	Mahpirat Wali1, ZHAO Mengyuan2, Askar Hamdulla1. Keyword based Uyghur single document summarization [J]. Computer Engineering and Applications, 2015, 51(16): 130-135.

Noisy channel based Uyghur neutralized vowel identification model

基于噪声信道的维吾尔语央音原音识别模型

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics