Hunan dialects identification based on GMM and difference speech feature

doi:10.3778/j.issn.1002-8331.2009.35.039

Computer Engineering and Applications ›› 2009, Vol. 45 ›› Issue (35): 129-131.DOI: 10.3778/j.issn.1002-8331.2009.35.039

• 数据库、信号与信息处理 • Previous Articles Next Articles

Hunan dialects identification based on GMM and difference speech feature

WANG Qi-xue，QIAN Sheng-you，ZHAO Xin-min

College of Physics and Information Science，Hunan Normal University，Changsha 410081，China

Received:2009-07-28 Revised:2009-10-19 Online:2009-12-11 Published:2009-12-11
Contact: WANG Qi-xue

基于差分特征和高斯混合模型的湖南方言识别

王岐学，钱盛友，赵新民

湖南师范大学物理与信息科学学院，长沙 410081

通讯作者: 王岐学

Abstract

Abstract: Rhythm of speech is an important acoustic distinction between different Chinese dialects，and the difference feature is an important reflection of rhythm.While difference features ΔMFCC & ΔΔMFCC are used as characteristic parameters and Gaussian Mixture Model（GMM） is used as a trained model，the dialect can be identified through calculating the likelihood probability of the test samples.Changsha dialect，Shaoyang dialect，Hengyang dialect and Mandarin have been investigated with this method，and its effect has been compared with the effect using MFFC as characteristic parameters.Experiment results show that a more high recognition rate and better anti-noise performance can be obtained by GMM trained with difference feature.

摘要： 语音的韵律是区分汉语方言的重要语音声学特征，而语音的差分特征是语音韵律的重要体现。采用差分特征ΔMFCC和ΔΔMFCC作为特征参数，用高斯混合模型（GMM）作为训练模型，通过计算测试样本的似然概率来识别方言的类型。用该方法对长沙方言、邵阳方言、衡阳方言和普通话进行了识别研究，并与采用MFCC作为特征参数的识别效果进行了比较。实验结果表明差分特征具有识别率高、抗噪声性能更好等优点。

CLC Number:

TP391

WANG Qi-xue，QIAN Sheng-you，ZHAO Xin-min. Hunan dialects identification based on GMM and difference speech feature[J]. Computer Engineering and Applications, 2009, 45(35): 129-131.

王岐学，钱盛友，赵新民. 基于差分特征和高斯混合模型的湖南方言识别[J]. 计算机工程与应用, 2009, 45(35): 129-131.

[1]	HE Bing^1，2，WANG Xuan¹. Robust digital watermarking image method based on Radon invariant moments [J]. Computer Engineering and Applications, 2010, 46(9): 98-101.
[2]	HOU Hong-hua，GUI Zhi-guo. Application of wavelet entropy in denoising processing and R wave detection of ECG signal [J]. Computer Engineering and Applications, 2010, 46(9): 116-119.
[3]	FAN Cong-xian，LIU Qiu-ju，XU Ting-rong. Research and improved algorithm of PageRank based on Web structure mining [J]. Computer Engineering and Applications, 2010, 46(9): 127-129.
[4]	CHI Dong-xiang¹，CHENG Wei-zhong². Liver MR image segmentation with iterative quadtree decomposition [J]. Computer Engineering and Applications, 2010, 46(9): 142-145.
[5]	YANG Hui-yun，ZHANG You-hui，HUO Li-ling，ZHAO Jin. Application of Bayes decision and neighborhood averaging method on image denoising [J]. Computer Engineering and Applications, 2010, 46(9): 149-151.
[6]	XU Cun-lu¹，GAO Jia¹，WU Guo-de². Brain tumor image segmentation method based on Chan-Vese model [J]. Computer Engineering and Applications, 2010, 46(9): 155-158.
[7]	QIU Xuan¹，ZHOU Ze-ming²，HU You-bin². Appling region variance and weighted mean to image fusion of curvelet transform [J]. Computer Engineering and Applications, 2010, 46(9): 166-168.
[8]	LIANG Ying-hong. Human detection method based on motion projection periodicity feature [J]. Computer Engineering and Applications, 2010, 46(9): 169-172.
[9]	YU Ming，ZHANG Yan-yun，XUE Cui-hong，SUN Lin-juan. Image segmentation algorithm of single handwritten Chinese characters [J]. Computer Engineering and Applications, 2010, 46(9): 180-182.
[10]	YIN Yong，HOU Hai-zhen. Adaptive shot segmentation method based on histogram frame difference [J]. Computer Engineering and Applications, 2010, 46(9): 186-189.
[11]	TAN Ye-hao，JIANG Zhi-fang，DU Xiao-liang，MENG Xiang-xu. Visualization of multi-dimensional data with interpolation based on compactly supported radial basis functions [J]. Computer Engineering and Applications, 2010, 46(9): 220-223.
[12]	SONG Liang，GENG Guo-hua. Research on clear redundancy in brain CT images [J]. Computer Engineering and Applications, 2010, 46(9): 208-211.
[13]	LU Shuang，ZHU Jian-hong，PENG Li. Improved ART2 neural network algorithm for fault diagnosis [J]. Computer Engineering and Applications, 2010, 46(9): 212-214.
[14]	ZHANG Xin-lin，CHEN Yuan，ZENG De-sheng. Method to track movable targets by mobile sources [J]. Computer Engineering and Applications, 2010, 46(9): 217-219.
[15]	YUE Yu-fang，AN Jian-zhu，ZHANG Yu-shuang. Study of target tracking system combined with 3D scene [J]. Computer Engineering and Applications, 2010, 46(9): 224-226.

Hunan dialects identification based on GMM and difference speech feature

基于差分特征和高斯混合模型的湖南方言识别

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics