Automatic identification of Chinese dialects based on global information fusion

doi:10.3778/j.issn.1002-8331.1610-0315

Computer Engineering and Applications ›› 2017, Vol. 53 ›› Issue (17): 160-165.DOI: 10.3778/j.issn.1002-8331.1610-0315

Previous Articles Next Articles

Automatic identification of Chinese dialects based on global information fusion

QIU Yuanhang1, GU Mingliang1, MA Yong1, JIN Yun1, HAN Jun1, ZHAO Dongmei1, ZHAO Chenghao2

1. School of Physics and Electronic Engineering, Jiangsu Normal University, Xuzhou, Jiangsu 221116, China
2. School of Electrical Engineering & Automation, Jiangsu Normal University, Xuzhou, Jiangsu 221116, China

Online:2017-09-01 Published:2017-09-12

全局信息融合的汉语方言自动辨识

邱远航1，顾明亮1，马勇1，金赟1，韩军1，赵冬梅1，赵呈昊2

1.江苏师范大学物理与电子工程学院，江苏徐州 221116
2.江苏师范大学电气工程及自动化学院，江苏徐州 221116

Abstract

Abstract: A new method of Chinese dialects identification based on Identity vector（I-vector） combined with prosodic information is proposed. The high-dimensional super-vector is mapped to a low-dimensional I-vector representation by Total Variability（TV） model. Channel compensation and feature dimension reduction are also performed. Chinese is a typical language with a tone and Chinese dialects have obvious differences among rhythm, stress and other rhythmic structure. The serial fusion of I-vectors with global prosodic information can improve the distinguishability of Chinese dialects effectively. The Equal Error Rate（EER） using fusion strategy of five Chinese dialects and Mandarin is 8.01%, which is 56.2% lower than the Gaussian Mixture Model-Universal Background Model（GMM-UBM） method. The experimental results show that the I-vector method fusing global prosodic information can improve the Chinese dialects identification accuracy effectively.

Key words: Chinese dialects identification, prosodic features, I-vector, features fusion

摘要： 提出身份认证矢量（Identity vector，I-vector）结合韵律信息的汉语方言辨识方法。全差异空间替代本征音与本征信道空间，将高维超矢量映射为低维I-vector表示，并进行信道补偿与特征降维处理。汉语是有调语言，各方言在其韵律结构上具有明显差异，I-vector特征融合全局韵律信息，可有效增补各方言鉴别性。利用融合信息对闽、粤、吴等五种方言以及普通话进行辨识实验，等错率（Equal Error Rate，EER）达到8.01%，比高斯混合模型-通用背景模型（Gaussian Mixture Model-Universal Background Model，GMM-UBM）降低56.2%，表明融合全局韵律信息的I-vector方法可有效提高汉语方言辨识正确率。

关键词: 汉语方言辨识, 韵律特征, I-vector, 特征融合

QIU Yuanhang1, GU Mingliang1, MA Yong1, JIN Yun1, HAN Jun1, ZHAO Dongmei1, ZHAO Chenghao2. Automatic identification of Chinese dialects based on global information fusion[J]. Computer Engineering and Applications, 2017, 53(17): 160-165.

邱远航1，顾明亮1，马勇1，金赟1，韩军1，赵冬梅1，赵呈昊2. 全局信息融合的汉语方言自动辨识[J]. 计算机工程与应用, 2017, 53(17): 160-165.

[1]	SONG Zhonghao, GU Yu, CHEN Xu, NIE Shengdong. Target Detection in High-Resolution Remote Sensing Image Based on Weighted Strategy [J]. Computer Engineering and Applications, 2021, 57(13): 199-206.
[2]	ZHANG Yonghong, YAN Bin, TIAN Wei, WANG Jiangeng. Multi-feature High-Resolution Remote Sensing Road Extraction Based on PPMU-net [J]. Computer Engineering and Applications, 2021, 57(1): 200-206.
[3]	WANG Xin, ZHANG Hongran. Robust i-vector speaker recognition method based on DNN processing [J]. Computer Engineering and Applications, 2018, 54(22): 167-172.
[4]	HUANG Dongmei, XU Qiongqiong, HE Qi, DU Yanling. Multi-features fusion for image auto-annotation based on DBN model [J]. Computer Engineering and Applications, 2018, 54(1): 224-228.
[5]	SHU Yi1, XING Yujuan2. Speaker verification based on i-vector and sparse representation using PCA dictionary learning [J]. Computer Engineering and Applications, 2016, 52(18): 144-147.
[6]	XING Yujuan, CAO Xiaoli, TAN Ping, LI Hengjie. Speaker verification based on WLDA and i-sparse representation classification [J]. Computer Engineering and Applications, 2016, 52(13): 173-176.
[7]	LIU Fumin, ZHANG Zhibin, SHEN Jiquan. Emotion recognition based on multi-features fused by kernel canonical correlation analysis [J]. Computer Engineering and Applications, 2014, 50(9): 193-196.
[8]	YU Jinxia, XU Jingmin. Adaptive particle filter tracking algorithm by fusing multi-features [J]. Computer Engineering and Applications, 2014, 50(18): 178-181.
[9]	ZHANG Renshang. Features extraction algorithm of CT image based on GNSCT-LCM [J]. Computer Engineering and Applications, 2014, 50(11): 159-162.
[10]	LI Ke, XU Kehu, ZHANG Bo. Military camouflage target tracking based on features fusion adaptively [J]. Computer Engineering and Applications, 2012, 48(34): 171-174.
[11]	YE Jihua, WANG Shimin, GUO Fan, YANG Qinhong, YU Min. Research of Gabor features fusion in embedded face recognition system [J]. Computer Engineering and Applications, 2012, 48(11): 148-151.
[12]	HUANG Xiaozhong，LI Hui，XU Dongxing，GUO Wei. SVM speaker verification based on prosodic feature [J]. Computer Engineering and Applications, 2011, 47(15): 148-151.
[13]	WANG Qiang¹,ZHANG Yong-kui². Research on Chinese story link detection based on SVM [J]. Computer Engineering and Applications, 2008, 44(33): 141-143.
[14]	CAI Dong-feng,WANG Zhi-chao,JI Duo,ZHANG Gui-ping. Border distance based multi-vector document clustering method [J]. Computer Engineering and Applications, 2008, 44(3): 198-201.
[15]	WANG Huan，REN Ming-wu，YANG Jing-yu. New particle filter tracking method based on multi-features fusion [J]. Computer Engineering and Applications, 2007, 43(25): 21-24.

Automatic identification of Chinese dialects based on global information fusion

全局信息融合的汉语方言自动辨识

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics