计算机工程与应用 ›› 2013, Vol. 49 ›› Issue (16): 196-200.

• 信号处理 • 上一篇    下一篇

语音分离与HMM相结合的语音增强方法

刘凤增,李国辉,唐  敏   

  1. 国防科学技术大学 信息系统与管理学院,长沙 410073
  • 出版日期:2013-08-15 发布日期:2013-08-15

Speech enhancement approach based on speech separation using HMM

LIU Fengzeng, LI Guohui, TANG Min   

  1. Department of Information System and Management, National University of Defense Technology, Changsha 410073, China
  • Online:2013-08-15 Published:2013-08-15

摘要: 针对基于隐马尔科夫(HMM,Hidden Markov Model)的MAP和MMSE两种语音增强算法计算量大且前者不能处理非平稳噪声的问题,借鉴语音分离方法,提出了一种语音分离与HMM相结合的语音增强算法。该算法采用适合处理非平稳噪声的多状态多混合单元HMM,对带噪语音在语音模型和噪声模型下的混合状态进行解码,结合语音分离方法中的最大模型理论进行语音估计,避免了迭代过程和计算量特别大的公式计算,减少了计算复杂度。实验表明,该算法能够有效地去除平稳噪声和非平稳噪声,且感知评价指标PESQ 的得分有明显提高,算法时间也得到有效控制。

关键词: 语音增强, 语音分离, 非平稳噪声, 基于HMM的语音增强

Abstract: There are two typical speech enhancement algorithms based on HMM(Hidden Markov Model) which are MAP(Maximum A Posteriori) estimator and MMSE(Minimum Mean-Square Error) estimator. Both algorithms have high computational complexity, and the former can’t handle non-stationary noise. In response to these shortcomings, with the speech separation technology as reference, speech enhancement algorithm based on speech separation using HMM is designed. This algorithm uses the multi-state AR-HMM which is applied to non-stationary noise condition to decode the mixed state sequence of noisy speech under the speech model and noise model. Then, the decoded speech is estimated by speech separation method using maximization model theory which avoids iterative procedure and huge computation so that the complexity is reduced. The experiments also show that the proposed algorithm can effectively remove the stationary noise and non-stationary noise, improve the PESQ(Perceptual Evaluation of Speech Quality) score and the algorithm time is under control too.

Key words: speech enhancement, speech separation, non-stationary noise, speech enhancement based on HMM