Practical speaker feature extraction method

Computer Engineering and Applications ›› 2008, Vol. 44 ›› Issue (10): 51-53.

• 理论研究 • Previous Articles Next Articles

Practical speaker feature extraction method

LI Ming,ZHANG Yong,LI Jun-quan,ZHANG Ya-fen

School of Computer and Communication，Lanzhou University of Technology，Lanzhou 730050，China

Received:2007-07-17 Revised:2007-10-09 Online:2008-04-01 Published:2008-04-01
Contact: LI Ming

一种实用的说话人特征提取方法

李明,张勇,李军权,张亚芬

兰州理工大学计算机与通信学院，兰州 730050

通讯作者: 李明

Abstract

Abstract: Aimming at the shortage of Sparse Kernel Principal Component Analysis（SKPCA） in feature extraction，a novel feature extraction method based on the kernel K-means clustering and the SKPCA for speaker recognition is proposed.Here kernel K-means clustering is to divide all the frames of each sample into a given amount of clusters，since the resulted clustering centers can represent better the clusters they belong to，the clustering is replaced by the clustering center，and the dimensions of kernel matrix are decreased accordingly.This method reduces storage and computational complexity，it guarantees to reduce data can represent the original data well and information loss is minimum.The experimental results show the proposed approach do not affect veracity，improve recognition rate，and meet the requirement of speaker recognition in terms of practicability.

Key words: Kernel Principal Component Analysis（KPCA）, sparse KPCA, kernel K-means clustering, speaker recognition

摘要： 针对稀疏核主成分分析方法在特征提取中的不足，提出了一种基于核K-均值聚类的稀疏核主成分分析（Sparse KPCA）的特征提取方法用于说话人识别。通过核K-均值聚类的方法对语音帧进行聚类，由于聚类的中心能够很好地代表类内的特征，用中心样本帧取代该类，减少了核矩阵的维数，然后再采用稀疏KPCA方法对核矩阵进行特征提取。该方法能够减少存储空间和计算的复杂度，它保证约简后的数据能够很好地代表原始数据并且在约简过程中信息损失最小。实验结果验证了提出的方法在不影响识别率的前提下提高了识别速度，满足了说话人识别的实用性要求。

关键词: 核主成分分析（KPCA）, 稀疏KPCA, 核K-均值聚类, 说话人识别

LI Ming,ZHANG Yong,LI Jun-quan,ZHANG Ya-fen. Practical speaker feature extraction method[J]. Computer Engineering and Applications, 2008, 44(10): 51-53.

李明,张勇,李军权,张亚芬. 一种实用的说话人特征提取方法[J]. 计算机工程与应用, 2008, 44(10): 51-53.

[1]	ZENG Chunyan, MA Chaofeng, WANG Zhifeng, ZHU Dongliang, ZHAO Nan, WANG Juan, LIU Cong. Survey of Speaker Recognition in Deep Learning Framework [J]. Computer Engineering and Applications, 2020, 56(7): 8-16.
[2]	WANG Xin, ZHANG Hongran. Robust i-vector speaker recognition method based on DNN processing [J]. Computer Engineering and Applications, 2018, 54(22): 167-172.
[3]	JIANG Yan1, SHUAI Renjun1, ZHANG Shu2, ZHA Daifeng3. Prediction for fasting blood glucose level of health records based on KPCA-LSSVM [J]. Computer Engineering and Applications, 2018, 54(13): 241-245.
[4]	CHEN Feiyu1, RUAN Kun2, HU Youbin1, CAO Lei3. Waterline extraction algorithm based on KPCA and spectral features constrained [J]. Computer Engineering and Applications, 2018, 54(11): 171-177.
[5]	XU Limin1, WEI Xiang2. Analysis and design of speaker authentication system based on Android platform of parallel computation [J]. Computer Engineering and Applications, 2017, 53(3): 231-236.
[6]	WANG Chunfang, GAO Yuyu. KPCA face recognition based on KL divergence [J]. Computer Engineering and Applications, 2016, 52(9): 130-134.
[7]	ZHANG Xiaoheng1，2, XIE Wenbin2, LI Yongming2. Multiple voice features types evolutionary selection algorithm [J]. Computer Engineering and Applications, 2016, 52(14): 150-155.
[8]	LUO Jian, YANG Yingen, LEI Zhenchun. Weighted pairwise constraint metric learning in speaker recognition [J]. Computer Engineering and Applications, 2016, 52(11): 158-163.
[9]	WANG Yibing1, HU Bangjun2. Robust face recognition based on low frequency DCT coefficients retransforming optimized by CLAHE [J]. Computer Engineering and Applications, 2014, 50(9): 135-140.
[10]	HU Zhengquan, ZENG Yuming, ZONG Yuan, LI Mengchao. Improvement of MFCC parameters extraction in speaker recognition [J]. Computer Engineering and Applications, 2014, 50(7): 217-220.
[11]	DU Xiaoqing, YU Fengqin. Speaker recognition algorithm based on HHT cepstrum coefficient [J]. Computer Engineering and Applications, 2014, 50(3): 198-202.
[12]	TANG Yongbo. Kernel Principal Component Analysis model for transformer fault detection based on modified feature sample [J]. Computer Engineering and Applications, 2014, 50(21): 4-7.
[13]	XIONG Huaqiao, ZHENG Jianbin, ZHAN Enqi, WANG Yang, HUA Jian. Speaker recognition based on speaker model clustering [J]. Computer Engineering and Applications, 2014, 50(2): 133-136.
[14]	LIANG Hui, ZENG Shuiping. Application of wavelet multiresolution theory to extract personality characteristics [J]. Computer Engineering and Applications, 2013, 49(9): 120-122.
[15]	ZHU Peng, WANG Chengru. Speaker recognition combining wavelet packet transform with Teager Energy Operator [J]. Computer Engineering and Applications, 2013, 49(9): 187-189.

Practical speaker feature extraction method

一种实用的说话人特征提取方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics