Speech emotion recognition based on improved MFCC with EMD

Abstract

Abstract: Non-stationary characteristics of speech signal under the different emotions are especially obvious. Traditional MFCC can only reflect speech static features, while EMD can describe non-stationary characteristics of speech signal precisely. In order to extract the non-stationary features of emotional speech, the improved MFCC steps are proposed including EMD decomposition into IMFs, Mel filtering, logarithm and DCT. The improved MFCC is adopted as the new feature with SVM to recognize four speech emotions consisting of happy, angry, bored and fear. Simulation results demonstrate that the recognition rate of the improved MFCC is 77.17%, and in different SNRs, the recognition rate can be increased by 3.26%.

Key words: speech emotion recognition, Mel-Frequency Cepstral Coefficients（MFCC）, Empirical Mode Decomposition（EMD）, Support Vector Machine（SVM）

摘要： 人在不同情感下的语音信号其非平稳性尤为明显，传统的MFCC只能反映语音信号的静态特征，经验模态分解能够精细地刻画语音信号的非平稳特性。为提取情感语音的非平稳特征，用经验模态分解将情感语音信号分解为一系列固有模态函数分量，通过Mel滤波器后取其对数能量，进行DCT反变换后得到改进的MFCC作为情感识别的新特征，采用支持向量机对高兴、生气、厌烦和恐惧等四种语音情感识别。仿真实验结果表明：改进的MFCC识别率达到77.17%，在不同的信噪比下，识别率最大可提高3.26%。

关键词: 语音情感识别, Mel频率倒谱系数, 经验模态分解, 支持向量机

TU Binbin, YU Fengqin. Speech emotion recognition based on improved MFCC with EMD[J]. Computer Engineering and Applications, 2012, 48(18): 119-122.

屠彬彬，于凤芹. 基于EMD的改进MFCC的语音情感识别[J]. 计算机工程与应用, 2012, 48(18): 119-122.

[1]	HAN Weiyu, CHENG Longsheng. Research on Roling Bearing Failure Mode Classification Based on MTS and SVM [J]. Computer Engineering and Applications, 2021, 57(6): 239-246.
[2]	WEN Jiebin, YANG Wenzhong, MA Guoxiang, ZHANG Zhihao, LI Hailei. Micro-expression Recognition Based on Apex Frame Optical Flow and Convolutional Autoencoder [J]. Computer Engineering and Applications, 2021, 57(4): 127-133.
[3]	XU Xianfeng, CAI Lulu, ZHANG Li. Photovoltaic Power Generation Prediction Algorithm Based on MLP and DBN [J]. Computer Engineering and Applications, 2021, 57(3): 266-272.
[4]	LI Junxia, ZHANG Qin, ZHENG Guimei. Overview of Human Posture Recognition by Ultra-wideband Radar [J]. Computer Engineering and Applications, 2021, 57(3): 14-23.
[5]	WANG Chuanyu, LI Weixiang, CHEN Zhenhuan. Reserch of Multi-modal Emotion Recognition Based on Voice and Video Images [J]. Computer Engineering and Applications, 2021, 57(23): 163-170.
[6]	CHEN Fujian, XIE Weixin, XIA Ting. Adaptive Anti-occlusion Target Tracking Algorithm Based on LCT+ [J]. Computer Engineering and Applications, 2021, 57(22): 190-198.
[7]	CHEN Feiyu, YUE Wenbin, RAO Yinglu, XING Jinhao, MA Xiaojing. Autonomous Precision Landing of Drone Based on Improved TLD Algorithm [J]. Computer Engineering and Applications, 2020, 56(7): 247-254.
[8]	MA Ling, LUO Xiaoshu, JIANG Pinqun. Research on Dot Matrix Character Recognition Based on Template Matching and Support Vector Machine [J]. Computer Engineering and Applications, 2020, 56(4): 134-139.
[9]	ZHANG Zhonglin, FENG Yibang, ZHAO Zhongkai. Oversampling Method for Unbalanced Data Sets Based on SVM [J]. Computer Engineering and Applications, 2020, 56(23): 220-228.
[10]	HUANG Guangjun, DENG Yuanlong. Polarizer Visual Defect Detection and Classification Based on Improved LBP and SVM Algorithm [J]. Computer Engineering and Applications, 2020, 56(22): 251-255.
[11]	SUI Xiuwu, NIU Jiabao, LI Haotian, QIAO Mingmin. Upper Limb sEMG Gesture Recognition Method Based on NMF-SVM Model [J]. Computer Engineering and Applications, 2020, 56(17): 161-166.
[12]	YANG Yu，ZENG Guohui，HUANG Bo. Fault Diagnosis Method of Bearings Based on Dual-Tree Complex Wavelet Packet Transform and Improved SVM [J]. Computer Engineering and Applications, 2020, 56(17): 231-235.
[13]	YANG Ying, WANG Jun, WANG Gang. Customer Complaints Classification Method Based on Improved Random Subspace [J]. Computer Engineering and Applications, 2020, 56(13): 230-235.
[14]	YANG Yanrong, SONG Rongjie, ZHOU Zhaoyong. Network Intrusion Detection Method Based on GAN-PSO-ELM [J]. Computer Engineering and Applications, 2020, 56(12): 66-72.
[15]	WANG Zhong, ZHANG Xueliang, LIU Yaqun, ZHOU Junyu, SHAN Dongsheng. Research on Face Gesture Recognition Based on EEG Sensor [J]. Computer Engineering and Applications, 2020, 56(12): 182-186.

Speech emotion recognition based on improved MFCC with EMD

基于EMD的改进MFCC的语音情感识别

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics