计算机工程与应用 ›› 2013, Vol. 49 ›› Issue (10): 14-17.

• 博士论坛 • 上一篇    下一篇

基于BIC与PSO的简约语音识别系统创建

包希日莫1,高光来1,张  璟2   

  1. 1.内蒙古大学 计算机学院,呼和浩特 010021
    2.西安理工大学 计算机科学与工程学院,西安 710048
  • 出版日期:2013-05-15 发布日期:2013-05-14

Construction of concise speech recognition systems based on BIC and PSO

BAO Xirimo1, GAO Guanglai1, ZHANG Jing2   

  1. 1.School of Computer Science, Inner Mongolia University, Hohhot 010021, China
    2.Institute of Computer Science & Engineering, Xi’an University of Technology, Xi’an 710048, China
  • Online:2013-05-15 Published:2013-05-14

摘要: 针对当前尚无建立简约高效语音识别系统标准方法的情形,提出了通过贝叶斯信息准则(Bayesian Information Criterion,BIC)中的权衡系数折中选择系统识别率与复杂度,利用改进的粒子群优化(Particle Swarm Optimization,PSO)算法优化声学模型拓扑结构,进而创建高效简约语音识别系统的新方法。TIDigits上的实验表明,与传统方法创建的同复杂度的基线系统相比,用该方法建立的新系统句子正确率提升了7.85%,与同识别率的基线系统相比,系统复杂度降低了51.4%,说明新系统能够以较低的复杂度获得较高的识别率。

关键词: 隐马尔可夫模型, 语音识别, 高效简约系统, 声学模型拓扑结构, 贝叶斯信息准则, 粒群优化

Abstract: Aiming at the current situation of lacking standard methods to construct efficient and concise speech recognition systems, a new method to build this kind of systems is proposed, which involves determining the relative importance of the recognition performance and system complexity through the value of the regularization coefficient in Bayesian Information Criterion(BIC) and optimizing acoustic model topologies using improved Particle Swarm Optimization(PSO) algorithm. Experiments on TIDigits corpus show that the new system constructed by using this method obtains 7.85% absolute increase in sentence correct rate compared to the baseline with the same complexity and built in the conventional way, and reduces 51.4% of system complexity compared to the baseline with the same recognition rate, which indicates that the new system is capable of obtaining higher recognition rate with lower system complexity.

Key words: Hidden Markov Model(HMM), speech recognition, efficient and concise system, acoustic model topology, Bayesian Information Criterion(BIC), Particle Swarm Optimization(PSO)