Robust i-vector speaker recognition method based on DNN processing

doi:10.3778/j.issn.1002-8331.1707-0207

Abstract

Abstract: This paper presents a method of combining Deep Neural Network（DNN） regression model with i-vector/Probabilistic Linear Discriminant Analysis（PLDA） speaker recognition system. By fitting the nonlinear function relationship between the noisy and the clean speech i-vector using DNN, the approximate representation of the clean speech i-vector can be obtained to achieve the purpose of reducing the influence of the noise on the system performance. The feasibility and effectiveness of the proposed method are verified by the experiments on TIMIT data set.

Key words: speaker recognition, Deep Neural Network（DNN）, i-vector

摘要： 提出了一种将基于深度神经网络（Deep Neural Network，DNN）特征映射的回归分析模型应用到身份认证矢量（identity vector，i-vector）/概率线性判别分析（Probabilistic Linear Discriminant Analysis，PLDA）说话人系统模型中的方法。DNN通过拟合含噪语音和纯净语音i-vector之间的非线性函数关系，得到纯净语音i-vector的近似表征，达到降低噪声对系统性能影响的目的。在TIMIT数据集上的实验验证了该方法的可行性和有效性。

关键词: 说话人识别, 深度神经网络, i-vector

WANG Xin, ZHANG Hongran. Robust i-vector speaker recognition method based on DNN processing[J]. Computer Engineering and Applications, 2018, 54(22): 167-172.

王昕，张洪冉. 基于DNN处理的鲁棒性I-Vector说话人识别算法[J]. 计算机工程与应用, 2018, 54(22): 167-172.

[1]	WANG Wentao, LI Shumei, TANG Jie, LYU Weilong. DDoS Attack Detection Method Based on Probability Graph Model and DNN [J]. Computer Engineering and Applications, 2021, 57(13): 108-115.
[2]	ZHANG Bohan, LING Jie. Improved Malware Detection Method Based on DNN [J]. Computer Engineering and Applications, 2021, 57(10): 81-87.
[3]	ZENG Chunyan, MA Chaofeng, WANG Zhifeng, ZHU Dongliang, ZHAO Nan, WANG Juan, LIU Cong. Survey of Speaker Recognition in Deep Learning Framework [J]. Computer Engineering and Applications, 2020, 56(7): 8-16.
[4]	ZENG Shulei, LI Xuehua, PAN Chunyu, WANG Yafei, ZHAO Zhongyuan. Resource Allocation Framework Based on Deep Neural Network in Fog Radio Access Network [J]. Computer Engineering and Applications, 2020, 56(24): 78-84.
[5]	LIN Pengfei, HE Xiuqing, CHEN Tiantian, WU Huajun, HE Juhou. Prediction of Loss and Teaching?Intervention for Learners in MOOC from Perspective of Deep Learning [J]. Computer Engineering and Applications, 2019, 55(22): 258-264.
[6]	XU Limin1, WEI Xiang2. Analysis and design of speaker authentication system based on Android platform of parallel computation [J]. Computer Engineering and Applications, 2017, 53(3): 231-236.
[7]	QIU Yuanhang1, GU Mingliang1, MA Yong1, JIN Yun1, HAN Jun1, ZHAO Dongmei1, ZHAO Chenghao2. Automatic identification of Chinese dialects based on global information fusion [J]. Computer Engineering and Applications, 2017, 53(17): 160-165.
[8]	SHU Yi1, XING Yujuan2. Speaker verification based on i-vector and sparse representation using PCA dictionary learning [J]. Computer Engineering and Applications, 2016, 52(18): 144-147.
[9]	ZHANG Xiaoheng1，2, XIE Wenbin2, LI Yongming2. Multiple voice features types evolutionary selection algorithm [J]. Computer Engineering and Applications, 2016, 52(14): 150-155.
[10]	XING Yujuan, CAO Xiaoli, TAN Ping, LI Hengjie. Speaker verification based on WLDA and i-sparse representation classification [J]. Computer Engineering and Applications, 2016, 52(13): 173-176.
[11]	LUO Jian, YANG Yingen, LEI Zhenchun. Weighted pairwise constraint metric learning in speaker recognition [J]. Computer Engineering and Applications, 2016, 52(11): 158-163.
[12]	HU Zhengquan, ZENG Yuming, ZONG Yuan, LI Mengchao. Improvement of MFCC parameters extraction in speaker recognition [J]. Computer Engineering and Applications, 2014, 50(7): 217-220.
[13]	DU Xiaoqing, YU Fengqin. Speaker recognition algorithm based on HHT cepstrum coefficient [J]. Computer Engineering and Applications, 2014, 50(3): 198-202.
[14]	XIONG Huaqiao, ZHENG Jianbin, ZHAN Enqi, WANG Yang, HUA Jian. Speaker recognition based on speaker model clustering [J]. Computer Engineering and Applications, 2014, 50(2): 133-136.
[15]	LIANG Hui, ZENG Shuiping. Application of wavelet multiresolution theory to extract personality characteristics [J]. Computer Engineering and Applications, 2013, 49(9): 120-122.

Robust i-vector speaker recognition method based on DNN processing

基于DNN处理的鲁棒性I-Vector说话人识别算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics