Multiple voice features types evolutionary selection algorithm

Abstract

Abstract: Speech feature extraction based on feature selection is a very effective method for speaker recognition. However, the optimal speech features have also changed. Therefore, this paper proposes a kind of four kinds of speech feature wrapper selection framework algorithm（FSF-WrGAF）. The algorithm extracts four kinds of speech features, and conducts dynamic wrapper feature selection by Chainlike Agent Genetic Algorithm（CAGA） and Gaussian Mixture Model-Universal Background Model（GMM-UBM）, thereby obtaining high recognition accuracy. Several algorithms are compared in the experiment part. Experimental results show that the FSF-WrGAF algorithm can obtain apparent improvement in terms of accuracy, equal error rate and detection cost compared with some other algorithms.

Key words: speaker recognition, multiple voice features types, chain-like agent genetic algorithm, Gammatone Frequency Cepstrum Coefficient（GFCC）, Mel Frequency Cepstrum Coefficient（MFCC）, Linear Prediction Cepstrum Coefficient（LPCC）

摘要： 基于特征选择的语音特征获取用于说话人识别是目前较为有效的方式。但是，最优语音特征随着具体应用环境的变化而不同。因此，提出了基于四类型语音特征封装式遗传特征选择算法（FSF-WrGAF），该算法提取了四种类型的语音特征参数，通过链式智能体遗传算法和GMM-UBM进行封装式动态特征选择，获取高精度的识别准确率。采用了多种指标完成该算法的性能测试。实验结果表明，该算法具体实现过程简便，改进效果明显，较同类算法在多项指标（识别率，EER，DET曲线）上都有显著提高。

关键词: 说话人识别, 多类型语音特征, 链式智能体遗传算法, 伽马通滤波器倒谱系数（GFCC）, 梅尔频率倒谱系数（MFCC）, 线性预测倒谱系数（LPCC）

ZHANG Xiaoheng1，2, XIE Wenbin2, LI Yongming2. Multiple voice features types evolutionary selection algorithm[J]. Computer Engineering and Applications, 2016, 52(14): 150-155.

张小恒1，2，谢文宾2，李勇明2. 多类型语音特征进化选择算法[J]. 计算机工程与应用, 2016, 52(14): 150-155.

[1]	ZENG Chunyan, MA Chaofeng, WANG Zhifeng, ZHU Dongliang, ZHAO Nan, WANG Juan, LIU Cong. Survey of Speaker Recognition in Deep Learning Framework [J]. Computer Engineering and Applications, 2020, 56(7): 8-16.
[2]	WANG Xin, ZHANG Hongran. Robust i-vector speaker recognition method based on DNN processing [J]. Computer Engineering and Applications, 2018, 54(22): 167-172.
[3]	XU Limin1, WEI Xiang2. Analysis and design of speaker authentication system based on Android platform of parallel computation [J]. Computer Engineering and Applications, 2017, 53(3): 231-236.
[4]	HUANG Lixia1, WANG Yanan1, ZHANG Xueying1, WANG Hongcui2. Research on noise robustness of speech recognition based on deep auto-encoder neural network [J]. Computer Engineering and Applications, 2017, 53(13): 49-54.
[5]	LUO Jian, YANG Yingen, LEI Zhenchun. Weighted pairwise constraint metric learning in speaker recognition [J]. Computer Engineering and Applications, 2016, 52(11): 158-163.
[6]	SU Peng, CHENG Jian. Application of DHMM to mechanical equipment audio recognition [J]. Computer Engineering and Applications, 2015, 51(1): 266-270.
[7]	HU Zhengquan, ZENG Yuming, ZONG Yuan, LI Mengchao. Improvement of MFCC parameters extraction in speaker recognition [J]. Computer Engineering and Applications, 2014, 50(7): 217-220.
[8]	DU Xiaoqing, YU Fengqin. Speaker recognition algorithm based on HHT cepstrum coefficient [J]. Computer Engineering and Applications, 2014, 50(3): 198-202.
[9]	SUN Yan. Self-adaption fuzzy clustering LBG vector-quantization algorithm [J]. Computer Engineering and Applications, 2014, 50(23): 203-205.
[10]	XIONG Huaqiao, ZHENG Jianbin, ZHAN Enqi, WANG Yang, HUA Jian. Speaker recognition based on speaker model clustering [J]. Computer Engineering and Applications, 2014, 50(2): 133-136.
[11]	KONG Rong, WU Di, LIAO Qipeng, ZHU Junjie, ZHOU Qiang, TAO Zhi. Using complex cepstrum peak filter for reverberation recognition by GMM [J]. Computer Engineering and Applications, 2014, 50(15): 191-193.
[12]	ZHU Peng, WANG Chengru. Speaker recognition combining wavelet packet transform with Teager Energy Operator [J]. Computer Engineering and Applications, 2013, 49(9): 187-189.
[13]	LIANG Hui, ZENG Shuiping. Application of wavelet multiresolution theory to extract personality characteristics [J]. Computer Engineering and Applications, 2013, 49(9): 120-122.
[14]	SUN Quanling, WANG Lixin. Fast kernel clustering back propagation algorithm [J]. Computer Engineering and Applications, 2013, 49(10): 118-120.
[15]	LIU Hong1, LIU Liqun2. Research on speaker recognition with improved MFCC [J]. Computer Engineering and Applications, 2012, 48(8): 155-157.

Multiple voice features types evolutionary selection algorithm

多类型语音特征进化选择算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics