基于GMM的说话人识别技术研究

计算机工程与应用 ›› 2011, Vol. 47 ›› Issue (11): 114-117.

• 数据库、信号与信息处理 • 上一篇下一篇

基于GMM的说话人识别技术研究

曹洁，潘鹏

兰州理工大学计算机与通信学院，兰州 730050

收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2011-04-11 发布日期:2011-04-11

Research on GMM based speaker recognition technology

CAO Jie，PAN Peng

College of Computer and Communication，Lanzhou University of Technology，Lanzhou 730050，China

Received:1900-01-01 Revised:1900-01-01 Online:2011-04-11 Published:2011-04-11

摘要/Abstract

摘要： 为了探讨高斯混合模型在说话人识别中的作用，设计了一个基于GMM的说话人识别系统。整个系统由音频信号预处理，语音活动检测，说话人模型建立以及音频信号识别4个模块组成。前三个模块构成了系统的模型训练部分，最后一个模块构成了系统的语音识别部分。包含在第二个模块中的由GMM模型搭建的语音活动检测器是研究的创新之处。利用增强的多方互动会议语料库中的视听会议对系统中的部分可调参数以及系统的识别错误率进行了测试。仿真结果表明，在语音活动检测器和若干滤波算法的帮助下，系统对包含重叠语音的音频信号的识别准确率可以达到83.02%。

关键词: 高斯混合模型, 语音活动检测, 识别错误率

Abstract: In order to investigate the function of Gaussian Mixture Model（GMM） in speaker recognition，a GMM based speaker recognition system is designed.The system consists of four modules that are audio signal pre-processing，speech activity detection，speaker modeling as well as audio signal recognition.The first three modules constitute the model training segment of the system and the last module constitutes the speech recognition segment of the system.A speech activity detector which is built by GMM in the second module is the innovation of the research.Some tunable parameters and recognition error rate of the system are tested using audio-visual meetings in the Augmented Multi-party Interaction（AMI） corpus.Simulations show that with the help of the speech activity detector and several filter algorithms，recognition accuracy rate of the system for audio signal with overlap speech can reach 83.02%.

Key words: Gaussian Mixture Model（GMM）, speech activity detection, recognition error rate

曹洁，潘鹏. 基于GMM的说话人识别技术研究[J]. 计算机工程与应用, 2011, 47(11): 114-117.

CAO Jie，PAN Peng. Research on GMM based speaker recognition technology[J]. Computer Engineering and Applications, 2011, 47(11): 114-117.

[1]	潘沛鑫，潘中良. 结合显著性的主动轮廓图像分割[J]. 计算机工程与应用, 2021, 57(8): 225-230.
[2]	雷恒林，古兰拜尔·吐尔洪，买日旦·吾守尔，张东梅. 新奇检测综述[J]. 计算机工程与应用, 2021, 57(5): 47-55.
[3]	王师琦，曾庆宁，龙超，熊松龄，祁潇潇. 语音增强与检测的多任务学习方法研究[J]. 计算机工程与应用, 2021, 57(20): 197-202.
[4]	贾兵兵，曹辉，秦驰杰. 基于SGMM和DNN结合提高音素识别率的研究[J]. 计算机工程与应用, 2019, 55(24): 117-121.
[5]	陈超. 高斯混合模型结合加权似然的目标跟踪算法[J]. 计算机工程与应用, 2019, 55(12): 124-131.
[6]	仇功达1，何明1，祝朝政1，杨杰2，刘勇1. 基于稀疏交界最大密度连通的模糊聚类方法[J]. 计算机工程与应用, 2018, 54(14): 82-88.
[7]	梁恺彬，管一弘. 基于隐高斯混合模型的人脑MRI分割方法[J]. 计算机工程与应用, 2018, 54(10): 196-203.
[8]	陈卉，胡立坤，黄钰雯. 采用高斯混合模型及树结构的立体匹配算法[J]. 计算机工程与应用, 2017, 53(20): 195-200.
[9]	牛艺蓉，王士同. 基于噪音受益的快速图像分割算法[J]. 计算机工程与应用, 2016, 52(21): 195-201.
[10]	胡志立，郭敏. 基于SLIC的改进GrabCut彩色图像快速分割[J]. 计算机工程与应用, 2016, 52(2): 186-190.
[11]	杜楠楠，赵晖. 维吾尔语情感语音韵律转换研究[J]. 计算机工程与应用, 2016, 52(19): 154-160.
[12]	张明光，张钰. 基于ANN伪量测建模的配电网状态估计[J]. 计算机工程与应用, 2016, 52(17): 253-256.
[13]	刘玉超. 一种自适应的多粒度概念提取方法——高斯云变换[J]. 计算机工程与应用, 2015, 51(9): 1-8.
[14]	党小超1，2，毛鹏鑫1，郝占军1，2. 基于快速求解高斯混合模型的流量聚类算法[J]. 计算机工程与应用, 2015, 51(8): 96-101.
[15]	赵英，陈骏君. 基于流相关性的网络流量分类[J]. 计算机工程与应用, 2015, 51(21): 25-29.

基于GMM的说话人识别技术研究

Research on GMM based speaker recognition technology

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics