计算机工程与应用 ›› 2013, Vol. 49 ›› Issue (16): 192-195.

• 信号处理 • 上一篇    下一篇

基于AWS_VFR的语音特征提取方法

谈会星,陈福才,李邵梅   

  1. 国家数字交换系统工程技术研究中心,郑州 450002
  • 出版日期:2013-08-15 发布日期:2013-08-15

Variable frame rate based on adaptive weighted-sum for speech feature extraction

TAN Huixing, CHEN Fucai, LI Shaomei   

  1. National Digital Switching System Engineering & Technological R&D Center, Zhengzhou 450002, China
  • Online:2013-08-15 Published:2013-08-15

摘要: 针对语音识别中固定帧率特征提取方法没有充分考虑语音频谱变化特性、噪声鲁棒性差的问题,提出了一种基于自适应加权和的变帧率方法用于特征提取,并在固定音频检索系统中进行实验,在信噪比为20 dB的情况下,与固定帧率的特征提取方法相比,系统检出率提高了近4%。实验表明,该方法在降低噪声影响,提高固定音频检索性能方面是有效的。

关键词: 变帧率, 自适应加权和, 固定音频检索

Abstract: Given the problems of fixed frame rate feature extraction method which did not fully consider the changes in characteristics of speech spectrum, and had a poor noise robustness, this paper proposes a variable frame rate method based on adaptive weighted-sum for speech feature extraction, and experiments are conducted with a specific audio retrieval system. Compared to the fixed frame rate feature extraction method, under the signal noise ratio of 20 dB, the system detection rate increases nearly 4%. Experimental results show that this method is effective to reduce noise, and to improve the performance of fixed audio retrieval.

Key words: variable frame rate, adaptive weighted-sum, specific audio retrieval