一种基于K-SVD的说话人识别方法

计算机工程与应用 ›› 2012, Vol. 48 ›› Issue (34): 112-115.

• 数据库、信号与信息处理 • 上一篇下一篇

一种基于K-SVD的说话人识别方法

马振1，张雄伟2，杨吉斌2

1.解放军理工大学通信工程学院，南京 210007
2.解放军理工大学指挥自动化学院，南京 210007

出版日期:2012-12-01 发布日期:2012-11-30

Speaker recognition method based on K-SVD

MA Zhen1, ZHANG Xiongwei2, YANG Jibin2

1.Institute of Communication Engineering, PLA University of Science & Technology, Nanjing 210007, China
2.Institute of Command Automation, PLA University of Science & Technology, Nanjing 210007, China

Online:2012-12-01 Published:2012-11-30

摘要/Abstract

摘要： 为了充分提取语音中的个人特征信息，类比矢量量化，提出了一种基于K-均值奇异值分解（K-SVD）的说话人识别方法。利用K-SVD训练得到的字典可较好地保存语音信号中的个人特征信息。利用这一特性，通过K-SVD从训练数据中提取包含说话人个人特征信息的字典，利用该字典实现说话人识别。相对于传统方法，该方法能够更好地利用语音的稀疏性保存语音中的个人特征信息并减小重构误差。实验仿真结果表明，与基于矢量量化的说话人识别方法相比，该方法在多说话人的情况下具有更好的识别率，具有更高的实用价值。

关键词: 说话人识别, K-均值奇异值分解（K-SVD）, 字典, 稀疏性

Abstract: In order to extract the personal characteristics, a speaker recognition method based on K-means Singular Value Decomposition（K-SVD） is proposed. The personal characteristics in voice can be well preserved in the dictionary trained from the K-SVD. With this feature, the dictionary which contains the personal characteristics is extracted from training data through the K-SVD algorithm. Then the trained dictionary is used for the speaker recognition. Compared to traditional methods, the personal characteristics in voice can be better preserved based on the proposed method through the sparse nature of voice and can reduce the reconstruction error. Experimental results show that the proposed method outperforms the VQ based methods for too many speakers in the view of recognition rate, so the proposed method has more practical value.

Key words: speaker recognition, K-mean Singular Value Decomposition（K-SVD）, dictionary, sparse

马振1，张雄伟2，杨吉斌2. 一种基于K-SVD的说话人识别方法[J]. 计算机工程与应用, 2012, 48(34): 112-115.

MA Zhen1, ZHANG Xiongwei2, YANG Jibin2. Speaker recognition method based on K-SVD[J]. Computer Engineering and Applications, 2012, 48(34): 112-115.

[1]	王子儒，李振民. 融合数据增强的迁移字典学习[J]. 计算机工程与应用, 2021, 57(23): 193-199.
[2]	丁玉祥，卞维新，接标，赵俊. 融合邻域回归和稀疏表示的图像超分辨率重构[J]. 计算机工程与应用, 2021, 57(2): 230-236.
[3]	王钰，刘凡，王菲. 基于自编码器和稀疏表示的单样本人脸识别[J]. 计算机工程与应用, 2021, 57(1): 168-172.
[4]	董艳花，张树美，赵俊莉. 有遮挡人脸识别方法综述[J]. 计算机工程与应用, 2020, 56(9): 1-12.
[5]	曾春艳，马超峰，王志锋，朱栋梁，赵楠，王娟，刘聪. 深度学习框架下说话人识别研究综述[J]. 计算机工程与应用, 2020, 56(7): 8-16.
[6]	李巧，陈花竹，杨春雨，李丹. 基于判别性解析字典与分类器学习的模式分类[J]. 计算机工程与应用, 2020, 56(6): 165-171.
[7]	陈子兆，矫文成. 改进的稀疏深度置信网络[J]. 计算机工程与应用, 2020, 56(2): 62-67.
[8]	代乾龙，孙伟. 基于改进稀疏栈式编码的车型识别[J]. 计算机工程与应用, 2020, 56(1): 136-141.
[9]	闫丽萍1，马家军1，陈文兴2. 稀疏结构化最小二乘双支持向量回归机[J]. 计算机工程与应用, 2019, 55(3): 10-14.
[10]	张凯兵，郑冬冬，景军锋. 低分辨人脸识别综述[J]. 计算机工程与应用, 2019, 55(22): 14-24.
[11]	张凯兵，王珍，闫亚娣，朱丹妮. 优化的AdaBoost回归图像超分辨方法[J]. 计算机工程与应用, 2019, 55(20): 159-163.
[12]	马丽红，谭学仕. 共有结构假设下流形正则图的零样本分类方法[J]. 计算机工程与应用, 2019, 55(15): 153-160.
[13]	张哲源，张灵，陈云华. 结合分块LBP与投影字典对学习的表情识别[J]. 计算机工程与应用, 2019, 55(12): 149-154.
[14]	聂栋栋，贺悦悦. 基于字典扩展的快速人脸识别算法[J]. 计算机工程与应用, 2018, 54(8): 201-206.
[15]	王雪1，隋立春1，2，杨振胤3，康军梅1. 最优方向耦合字典学习的遥感影像超分辨率重建[J]. 计算机工程与应用, 2018, 54(7): 201-205.

一种基于K-SVD的说话人识别方法

Speaker recognition method based on K-SVD

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics