短时TEO能量在带噪语音端点检测中的应用

计算机工程与应用 ›› 2013, Vol. 49 ›› Issue (12): 144-147.

短时TEO能量在带噪语音端点检测中的应用

李杰1，周萍2，杜志然1

1.桂林电子科技大学计算机科学与工程学院，广西桂林 541004
2.桂林电子科技大学电子工程与自动化学院，广西桂林 541004

出版日期:2013-06-14 发布日期:2013-06-14

Application of short-time TEO energy in noisy speech endpoint detection

LI Jie1, ZHOU Ping2, DU Zhiran1

1.School of Computer Science and Engineering, Guilin University of Electronic Technology, Guilin, Guangxi 541004, China
2.School of Electric Engineering and Automation, Guilin University of Electronic Technology, Guilin, Guangxi 541004, China

Online:2013-06-14 Published:2013-06-14

摘要/Abstract

摘要： 语音端点检测是语音识别系统的一个重要组成部分，特别是在噪声环境下，其准确性直接影响到语音识别系统的计算复杂度和识别性能。提出了一种在噪声环境下基于短时TEO能量的语音信号端点检测方法，采用了双门限-三态转换判决机制以保证算法在噪声环境下的端点检测准确性和对信号绝对幅度变化的稳健性。实验结果表明，与传统的短时能量法和谱熵法相比，该算法在低信噪比情况下具有更好的端点检测能力，显示了算法的优越性。

关键词: Teager能量算子, 端点检测, 语音识别, 噪声

Abstract: Speech endpoint detection is a crucial component in speech recognition system, especially in noisy environment. Its accuracy affects the computational complexity and the recognition performance of the speech recognition system. This paper proposes an endpoint detection of speech signals based on short-time TEO energy in noisy environment. It uses a three-state transition and judgment mechanism based on double thresholds, which ensures the accuracy in noisy environment and the robustness to changes in absolute levels. Compared with same traditional algorithms such as short-time energy and spectral entropy, experiment results show this algorithm has better detection capability in low signal to noise ratio environments and takes on more advantages.

Key words: Teager energy operator, endpoint detection, speech recognition, noise

李杰1，周萍2，杜志然1. 短时TEO能量在带噪语音端点检测中的应用[J]. 计算机工程与应用, 2013, 49(12): 144-147.

LI Jie1, ZHOU Ping2, DU Zhiran1. Application of short-time TEO energy in noisy speech endpoint detection[J]. Computer Engineering and Applications, 2013, 49(12): 144-147.

[1]	刘迪，贾金露，赵玉卿，钱育蓉. 基于深度学习的图像去噪方法研究综述[J]. 计算机工程与应用, 2021, 57(7): 1-13.
[2]	徐麒皓，李波. 基于NRU网络的肺结节检测方法[J]. 计算机工程与应用, 2021, 57(4): 83-90.
[3]	王洁，金正猛，冯灿. 自适应广义全变差的图像泊松去噪算法[J]. 计算机工程与应用, 2021, 57(20): 203-209.
[4]	陈晓文，刘光帅，刘望华，李旭瑞. 成对旋转不变的共生自适应完全局部三值模式[J]. 计算机工程与应用, 2021, 57(1): 219-226.
[5]	刘洪琛，刘朝霞，张龙. 融合[L2]和KL保真项的图像恢复算法[J]. 计算机工程与应用, 2020, 56(5): 214-221.
[6]	朱苗苗，潘伟杰，刘翔，吕健，赵慧亮. 基于BP神经网络代理模型的交互式遗传算法[J]. 计算机工程与应用, 2020, 56(2): 146-151.
[7]	袁小军，周涛，李琛. 基于稀疏先验的非局域聚类图像去噪算法研究[J]. 计算机工程与应用, 2020, 56(18): 177-185.
[8]	娄英丹，徐静林，黄丽霞，张雪英. MLLR和MAP在远场噪声混响下的语音识别研究[J]. 计算机工程与应用, 2020, 56(10): 122-126.
[9]	卢航，郝顺义，彭志颖，黄国荣. 基于MCC的鲁棒高阶CKF在组合导航中的应用[J]. 计算机工程与应用, 2020, 56(1): 257-264.
[10]	何丽，刘颖，韩克平. 噪声标注下的改进TSVM学习算法[J]. 计算机工程与应用, 2019, 55(17): 44-50.
[11]	魏春英，郭中华. 基于联合信源信道和迭代解码的LDPC编码方案[J]. 计算机工程与应用, 2019, 55(16): 94-98.
[12]	冯正英，王世东. 结合MNF变换和Canny算子的遥感影像变化检测[J]. 计算机工程与应用, 2019, 55(14): 266-270.
[13]	童麟1，2，韩越兴1，3，小长谷明彦3，4. DNA机器人在AFM图像中的分割和识别[J]. 计算机工程与应用, 2019, 55(11): 192-198.
[14]	陈泽伟，曾庆宁，谢先明，龙超. 基于自相关函数的语音端点检测方法[J]. 计算机工程与应用, 2018, 54(6): 216-221.
[15]	张志禹1，李向月1，李向阳2. 同步挤压小波变换对随机噪声抑制的研究[J]. 计算机工程与应用, 2018, 54(5): 57-60.