Speaker recognition based on speaker model clustering

Abstract

Abstract: This paper proposes a speaker recognition method based on Speaker Model Clustering（SMC） to improve the efficiency of the recognition system. Through the calculation of an approximated Kullback-Leibler divergence, the similar speaker model is clustered. All of cluster centroid and cluster representative construct a hierarchical speaker recognition model together. During the recognition stage, the cluster is selected by calculating distance between the test vectors and cluster centroids or cluster representatives on the first step. In accordance with calculating the logarithmic likelihood between the test vectors and the speaker models in the selected cluster, the speaker is determined, with the sharp decreasement of computation. The experimental results show that the proposed method improves the recognition speed about four times and loses the accuracy rate about 0.95% compared with the traditional Gaussian Mixture Model（GMM）. In conclusion, the SMC method can improve the recognition speed with almost the same accuracy.

Key words: speaker recognition, Gaussian mixture model, Speaker Model Clustering（SMC）

摘要： 为了提高说话人识别系统的识别效率，提出一种基于说话人模型聚类的说话人识别方法，通过近似KL距离将相似的说话人模型聚类，为每类确定类中心和类代表，构成分级说话人识别模型。测试时先通过计算测试矢量与类中心或类代表之间的距离选择类，再通过计算测试矢量与选中类中的说话人模型之间对数似然度确定目标说话人，这样可以大大减少计算量。实验结果显示，在相同条件下，基于说话人模型聚类的说话人识别的识别速度要比传统的GMM的识别速度快4倍，但是识别正确率只降低了0.95%。因此，与传统GMM相比，基于说话人模型聚类的说话人识别能在保证识别正确率的同时大大提高识别速度。

关键词: 说话人识别, 高斯混合模型, 说话人模型聚类（SMC）

XIONG Huaqiao, ZHENG Jianbin, ZHAN Enqi, WANG Yang, HUA Jian. Speaker recognition based on speaker model clustering[J]. Computer Engineering and Applications, 2014, 50(2): 133-136.

熊华乔，郑建彬，詹恩奇，汪阳，华剑. 基于说话人模型聚类的说话人识别[J]. 计算机工程与应用, 2014, 50(2): 133-136.

[1]	PAN Peixin, PAN Zhongliang. Active Contour Image Segmentation Combined with Saliency [J]. Computer Engineering and Applications, 2021, 57(8): 225-230.
[2]	LEI Henglin, Gulanbaier Tuerhong, Mairidan Wushouer, ZHANG Dongmei. Review of Novelty Detection [J]. Computer Engineering and Applications, 2021, 57(5): 47-55.
[3]	ZENG Chunyan, MA Chaofeng, WANG Zhifeng, ZHU Dongliang, ZHAO Nan, WANG Juan, LIU Cong. Survey of Speaker Recognition in Deep Learning Framework [J]. Computer Engineering and Applications, 2020, 56(7): 8-16.
[4]	JIA Bingbing, CAO Hui, QIN Chijie. Research on Improving Phoneme Recognition Rate Based on Subspace Gaussian Mixture Model and Deep Neural Network Combination [J]. Computer Engineering and Applications, 2019, 55(24): 117-121.
[5]	CHEN Chao. Target Tracking Algorithm Involving Gaussian Mixture Model and Weighted Likelihood [J]. Computer Engineering and Applications, 2019, 55(12): 124-131.
[6]	LI Chao, SUN Jun. Effective method of weld defect detection and classification based on machine vision [J]. Computer Engineering and Applications, 2018, 54(6): 264-270.
[7]	WANG Xin, ZHANG Hongran. Robust i-vector speaker recognition method based on DNN processing [J]. Computer Engineering and Applications, 2018, 54(22): 167-172.
[8]	QIU Gongda1, HE Ming1, ZHU Chaozheng1, YANG Jie2, LIU Yong1. Fuzzy clustering based on connected point with max density in sparse border [J]. Computer Engineering and Applications, 2018, 54(14): 82-88.
[9]	LIANG Kaibin, GUAN Yihong. Brain MR Images segmentation method based on hidden Gaussian mixture model [J]. Computer Engineering and Applications, 2018, 54(10): 196-203.
[10]	SUN Kai，XIE Linbo. Moving objects detection method based on combination of improved local binary pattern and W4 algorithm [J]. Computer Engineering and Applications, 2017, 53(5): 187-191.
[11]	XU Limin1, WEI Xiang2. Analysis and design of speaker authentication system based on Android platform of parallel computation [J]. Computer Engineering and Applications, 2017, 53(3): 231-236.
[12]	SUN Peng1，2, XIA Fei1，2，3, ZHANG Hao1，2，3, PENG Daogang1，2, MA Xi1，2, LUO Zhijiang1，2. Research of human fall detection algorithm based on improved Gaussian mixture model [J]. Computer Engineering and Applications, 2017, 53(20): 173-179.
[13]	CHEN Hui, HU Likun, HUANG Yuwen. Stereo matching algorithm based on Gaussian mixture model and tree structure [J]. Computer Engineering and Applications, 2017, 53(20): 195-200.
[14]	NIU Yirong, WANG Shitong. Fast image segmentation algorithm based on noise benefit [J]. Computer Engineering and Applications, 2016, 52(21): 195-201.
[15]	HU Zhili, GUO Min. Fast segmentation in color image based on SLIC and GrabCut [J]. Computer Engineering and Applications, 2016, 52(2): 186-190.

Speaker recognition based on speaker model clustering

基于说话人模型聚类的说话人识别

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics