病态嗓音特征的小波变换提取及识别研究

doi:10.3778/j.issn.1002-8331.2009.22.062

计算机工程与应用 ›› 2009, Vol. 45 ›› Issue (22): 194-196.DOI: 10.3778/j.issn.1002-8331.2009.22.062

病态嗓音特征的小波变换提取及识别研究

于燕平^1，2，胡维平¹

1.广西师范大学物理与电子工程学院，广西桂林 541004
2.柳州铁道职业技术学院电子工程系，广西桂林 545007

收稿日期:2008-04-23 修回日期:2008-07-24 出版日期:2009-08-01 发布日期:2009-08-01
通讯作者: 于燕平

Research of extracting of pathological voice’s characteristics and recognition based on wavelet transformation and Gaussian mixture model

YU Yan-ping ^1，2，HU Wei-ping¹

1.College of Physics and Electronic Engineering，Guangxi Normal University，Guilin，Guangxi 541004，China
2.Department of Electronic Engineering，Liuzhou Railway Vocational Technical College，Liuzhou，Guangxi 545007，China

Received:2008-04-23 Revised:2008-07-24 Online:2009-08-01 Published:2009-08-01
Contact: YU Yan-ping

摘要/Abstract

摘要： 通过分析嗓音的发音机理、病态嗓音与正常嗓音在频域的表现差异，利用小波变换对信号进行分解，突出病态嗓音的特点，提出了基于多尺度分析的小波降噪、分解的熵系数（Entropy Coefficient based on De-noise，Decomposition of Multi-scale Analysis，ECDDMA）作为识别的特征矢量集。并对比分析了语音识别中经典特征参数Mel倒谱系数（MFCC），分别运用这两种特征参数对242例正常嗓音和234例病态嗓音运用高斯混合模型（GMM）进行了识别。结果显示：ECDDMA系数较传统的模拟人耳听觉非线性特性的MFCC及其动态特征能更准确地表征正常与病态嗓音之间的差异，有利于同时提高病态和正常嗓音的识别率。

关键词: 高斯混合模型（GMM）, 病态嗓音, Mel倒谱系数（MFCC）, 小波变换

Abstract: Considering the voice pronunciation mechanism，the different performances of the abnormal voice and the normal voice in the field of frequency，the paper proposes a new method for extracting characteristics that is Entropy Coefficient based on De-noise，Decomposition of Multi-scale Analysis（ECDDMA） using the wavelet decomposition to find the pathological voice’s characteristics，and comparative analysis of the effective speech characteristics MFCC.242 normal voices samples and 234 abnormal samples are recognized with MFCC and the new extracted characteristics ECDDMA based on Gaussian Mixture Model （GMM）.The result indicates that，the parameters of ECDDMA are more advantageous to the normal and abnormal voice recognition than the traditional MFCC and the dynamic characteristic which mimic the human ears non-linear characteristic with frequency，and improves the abnormal and normal voice’s recognition result.

Key words: Gaussian Mixture Model（GMM）, pathological voice, Mel Frequency Cepstrum Coefficient（MFCC）, wavelet transformation

于燕平^1，2，胡维平¹. 病态嗓音特征的小波变换提取及识别研究[J]. 计算机工程与应用, 2009, 45(22): 194-196.

YU Yan-ping ^1，2，HU Wei-ping¹. Research of extracting of pathological voice’s characteristics and recognition based on wavelet transformation and Gaussian mixture model[J]. Computer Engineering and Applications, 2009, 45(22): 194-196.

[1]	杜秀丽，马振倩，邱少明，吕亚娜. 基于卷积注意力机制的运动想象脑电信号识别[J]. 计算机工程与应用, 2021, 57(18): 181-185.
[2]	范文兵，孙志远. 基于小波域广义高斯分布的SAR图像分割算法[J]. 计算机工程与应用, 2020, 56(5): 222-226.
[3]	曹军，陈鹤，张佳薇. 基于超分辨率的多聚焦图像融合算法研究[J]. 计算机工程与应用, 2020, 56(3): 180-186.
[4]	宫睿，王小春. 基于可协调经验小波变换的多聚焦图像融合[J]. 计算机工程与应用, 2020, 56(2): 201-210.
[5]	顾婷婷，刘新会，桑庆兵，李朝锋. 基于双树复小波的无参考立体图像质量评价[J]. 计算机工程与应用, 2019, 55(2): 154-161.
[6]	杨霞，朱晓冬，刘元宁，冯家凯，刘帅. 分块小波特征结合BP神经网络的虹膜识别方法[J]. 计算机工程与应用, 2019, 55(18): 132-139.
[7]	肖文卿1,3，汪鸿浩2，詹长安1. 基于小波系数特征融合的小鼠癫痫脑电分类[J]. 计算机工程与应用, 2019, 55(14): 155-161.
[8]	谢国波，吴震禹. 基于小波变换和重力模型的混沌图像加密算法[J]. 计算机工程与应用, 2019, 55(13): 100-105.
[9]	颜宏文，卢格宇. CEEMD-WT和CNN在短期风速预测中的应用研究[J]. 计算机工程与应用, 2018, 54(9): 224-230.
[10]	张志禹1，李向月1，李向阳2. 同步挤压小波变换对随机噪声抑制的研究[J]. 计算机工程与应用, 2018, 54(5): 57-60.
[11]	陈波1，刘厚泉1，赵志凯2. 时间序列多尺度异常检测方法[J]. 计算机工程与应用, 2018, 54(20): 122-127.
[12]	贾澎涛，贾伟. 煤矿井下视频多目标轨迹跟踪算法研究[J]. 计算机工程与应用, 2018, 54(2): 222-227.
[13]	崔金鸽1，陈炳权1，2，徐庆1. 基于Dual-Tree CWT和自适应双边滤波器的图像去噪算法[J]. 计算机工程与应用, 2018, 54(18): 223-228.
[14]	张艺超，袁贞明，孙晓燕. 基于心冲击信号的睡姿识别[J]. 计算机工程与应用, 2018, 54(17): 135-140.
[15]	黄亚飞，王国富，张法全，叶金才. 基于蜂群算法和带参阈值函数的图像去噪方法[J]. 计算机工程与应用, 2018, 54(17): 164-168.

病态嗓音特征的小波变换提取及识别研究

Research of extracting of pathological voice’s characteristics and recognition based on wavelet transformation and Gaussian mixture model

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics