Speech recognition based on k-means clustering and neural network ensemble

Computer Engineering and Applications ›› 2012, Vol. 48 ›› Issue (12): 144-147.

Previous Articles Next Articles

Speech recognition based on k-means clustering and neural network ensemble

YAO Minfeng1, LI Xinguang1, HUANG Wentao2

1.School of Informatics, Guangdong University of Foreign Studies, Guangzhou 510006, China
2.School of Automation, Guangdong University of Technology, Guangzhou 510006, China

Online:2012-04-21 Published:2012-04-20

一种k均值和神经网络集成的语音识别方法

姚敏锋1，李心广1，黄文涛2

1.广东外语外贸大学信息学院，广州 510006
2.广东工业大学自动化学院，广州 510006

Abstract

Abstract: In this paper, a method of speech recognition based on k-means clustering and neural network ensemble is proposed. The method is based on neural network model. After a number of individual neural networks are trained, the k-means clustering algorithm is used to select a part of the trained individual networks’ weights and thresholds with small similarity. Many neural networks with the selected weights and thresholds are combined. The method not only overcomes the shortcomings that single BP neural network model is easy to local convergence and lack of stability, but also solves the problems that the traditional method in training lasts for a long time and the differences of individual network are not obvious. The experimental results prove the effectiveness of this method.

Key words: k-means clustering, neural network ensemble, speech recognition

摘要： 提出了一种基于k均值聚类和BP神经网络集成的语音识别方法，该方法以神经网络集成模型为基础，利用k均值聚类算法选择部分有差异性的个体神经网络再进行集成学习，既克服了单个BP网络模型容易局部收敛和不稳定性的缺点，又解决了传统集成方法训练时间长和个体网络差异性不明显的问题。通过对非特定人孤立词的语音识别的实验，证实了该方法的有效性。

关键词: k均值聚类, 神经网络集成, 语音识别

YAO Minfeng1, LI Xinguang1, HUANG Wentao2. Speech recognition based on k-means clustering and neural network ensemble[J]. Computer Engineering and Applications, 2012, 48(12): 144-147.

姚敏锋1，李心广1，黄文涛2. 一种k均值和神经网络集成的语音识别方法[J]. 计算机工程与应用, 2012, 48(12): 144-147.

[1]	WANG Changlong, ZHANG Yuandong, MIAO Hong, YANG Yuheng. Application of Double Channel Convolutional Neural Network in Pumpkin Diseases Identification [J]. Computer Engineering and Applications, 2021, 57(5): 183-189.
[2]	ZHANG Ziran, HUANG Weihua, CHEN Yang, ZHANG Zheng, LI Ziyuan. Improved Ant Colony Path Planning Algorithm Based on Bidirectional Search [J]. Computer Engineering and Applications, 2021, 57(21): 270-277.
[3]	LU Junjie, HUANG Jinquan, LU Feng. Likelihood K-means Clustering for Gas Path Failure Diagnostics of Turbofan Engine [J]. Computer Engineering and Applications, 2020, 56(9): 136-141.
[4]	WANG Weihong, ZENG Yingjie. Collaborative Filtering Recommendation Algorithm Based on Clustering and User Preference [J]. Computer Engineering and Applications, 2020, 56(3): 68-73.
[5]	CAO Lin, WANG Zhiteng, CHEN Liang, LI Hongshun, GAO Shen, ZHANG Zili. Neural Network Ensemble Based on Improved Quantum Immune Algorithm [J]. Computer Engineering and Applications, 2020, 56(22): 142-147.
[6]	MA Jinghui, PAN Wei, WANG Ru. 3D Point Cloud Classification Based on K-means Clustering [J]. Computer Engineering and Applications, 2020, 56(17): 181-186.
[7]	GUO Yongkun, ZHANG Xinyou, LIU Liping, DING Liang, NIU Xiaolu. K-means Clustering Algorithm of Optimizing Initial Clustering Center [J]. Computer Engineering and Applications, 2020, 56(15): 172-178.
[8]	LOU Yingdan, XU Jinglin, HUANG Lixia, ZHANG Xueying. Speech Recognition Based on MLLR and MAP Under Distant Noise Reverberation Environment [J]. Computer Engineering and Applications, 2020, 56(10): 122-126.
[9]	LIU Qiang1, SHI Hong1, WANG Pingxin2，3, YANG Xibei1. Three-Way Clustering Analysis Based on [ε] Neighborhood [J]. Computer Engineering and Applications, 2019, 55(6): 140-144.
[10]	XI Runping1，2, JIA Gaoyun1，2, ZHANG Yanning1，2, ZHANG Fujun1，2. Objects Detection in Different Source Images Based on Evaluation Vector [J]. Computer Engineering and Applications, 2019, 55(1): 180-185.
[11]	DONG Benzhi, NIE Lili, JING Weipeng, CUI Hang. Identification method of ambrostoma quadriimpressum motschlsky based on Faster R-CNN [J]. Computer Engineering and Applications, 2018, 54(23): 89-93.
[12]	ZHAO Yue, LI Yaoqiang, XU Xiaona, WU Licheng. Near-optimal active learning for Tibetan speech recognition [J]. Computer Engineering and Applications, 2018, 54(22): 156-159.
[13]	HUANG Xiaohui1，2, LI Jing1, MA Rui2，3. Design and research of Tibetan spoken speech corpus [J]. Computer Engineering and Applications, 2018, 54(13): 231-235.
[14]	HAN Chong1, YUAN Yingshan2, MEI Tao2, GENG Huiling2. Data stream outlier detection algorithm based on K-means [J]. Computer Engineering and Applications, 2017, 53(3): 58-63.
[15]	SONG Chunxiao, SUN Ying. Nonlinear geometric feature extraction algorithm for emotional speech recognition [J]. Computer Engineering and Applications, 2017, 53(20): 128-133.

Speech recognition based on k-means clustering and neural network ensemble

一种k均值和神经网络集成的语音识别方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics