Computer Engineering and Applications ›› 2011, Vol. 47 ›› Issue (35): 117-121.

• 数据库、信号与信息处理 • Previous Articles     Next Articles

Grapheme-to-phoneme conversion of Tibetan with SAMPA

LIU Bo1,YANG Hongwu1,GAN Zhenye1,2,GUO Weitong1   

  1. 1.College of Physics and Electronic Engineering,Northwest Normal University,Lanzhou 730070,China
    2.Key Lab of China National Linguistic Information Technology,Northwest University for Nationalities,Lanzhou 730030,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2011-12-11 Published:2011-12-11

利用SAMPA实现藏语的字音转换

刘 博1,杨鸿武1,甘振业1,2,郭威彤1   

  1. 1.西北师范大学 物理与电子工程学院,兰州 730070
    2.西北民族大学 中国民族语言文字信息技术重点实验室,兰州 730030

Abstract: Speech Assessment Methods Phonetic Alphabet(SAMPA) is a kind of computer readable phonetic alphabet,which adopts computer readable ASCII characters to represent the pronunciations of language.This paper proposes a set of SAMPA label(named SAMPA-T) for Tibetan.The SAMPA labels of consonants and vowels are listed alone with the International Pronunciation Alphabet(IPA) for Tibetan.The paper also realizes the grapheme-to-phoneme conversion of Tibetan by using SAMPA-T.The proposed SAMPA-T can be applied to the Tibetan speech synthesis and other Tibetan speech information processing.

Key words: SAMPA-Tibetan, Speech Assessment Methods Phonetic Alphabet(SAMPA), grapheme-to-phoneme conversion

摘要: 机读音标SAMPA(Speech Assessment Methods Phonetic Alphabet)即计算机可读的音标,用计算机可读的ASCII字符表示语言的发音。提出了一种藏语的SAMPA标注的设计方案SAMPA-T(Tibetan),以藏语拉萨话为例列出了它们的辅音和元音对应的国际音标与SAMPA-T标注,并实现了面向SAMPA-T的藏语字音转换,可应用于藏语语音合成等藏语语音信息处理中。

关键词: 藏语机读音标, 机读音标(SAMPA), 字音转换