Chinese dialect active identification method fusing diversity measure

doi:10.3778/j.issn.1002-8331.1611-0129

Computer Engineering and Applications ›› 2017, Vol. 53 ›› Issue (15): 149-154.DOI: 10.3778/j.issn.1002-8331.1611-0129

Previous Articles Next Articles

Chinese dialect active identification method fusing diversity measure

XIA Yuguo1, DAI Hongxia1, GU Mingliang2

1.School of Electronic Information Engineering, Jiangsu Vocational College of Information Technology, Wuxi, Jiangsu 214153, China
2.School of Linguistic Science, Jiangsu Normal University, Xuzhou, Jiangsu 221116, China

Online:2017-08-01 Published:2017-08-14

融合多样性测度的汉语方言主动辨识方法

夏玉果1，戴红霞1，顾明亮2

1.江苏信息职业技术学院电子信息工程学院，江苏无锡 214153
2.江苏师范大学语言科学学院，江苏徐州 221116

Abstract

Abstract: In order to solve the problem of the redundant training samples in dialect identification system, an approach for Chinese dialect identification fusing diversity measure is proposed. Firstly, the uncertain samples are chosen by SVM classifier, then according to the distribution of these samples, the uncertain samples with diversity are selected and the new training set including these distinctive samples is constructed after several iterations. Finally, SVM is reused to make the decision. Experimental results indicate that, compared with the traditional active method, the proposed approach effectively overcomes the redundancy of the samples and the number of manually annotated samples reduces 50% under the same condition of recognition accuracy.

Key words: Chinese dialect identification, active learning, Support Vector Machine（SVM）, diversity measure

摘要： 为了解决方言辨识系统中训练样本冗余的问题，提出了一种融合多样性测度的汉语方言主动辨识方法。利用SVM分类器选取不确定性的样本。根据样本间分布情况的测度算法，选取出兼具多样性的训练样本，经过多次迭代将这些最具区别性的样本组成训练集。将此训练集重新输入到SVM进行分类辨识。实验结果表明，该方法能有效克服选取样本的冗余，与传统的主动学习方法相比，在同等识别率的情况下，人工标注样本的数量减少了50%。

关键词: 汉语方言辨识, 主动学习, 支持矢量机, 多样性测度

XIA Yuguo1, DAI Hongxia1, GU Mingliang2. Chinese dialect active identification method fusing diversity measure[J]. Computer Engineering and Applications, 2017, 53(15): 149-154.

夏玉果1，戴红霞1，顾明亮2. 融合多样性测度的汉语方言主动辨识方法[J]. 计算机工程与应用, 2017, 53(15): 149-154.

[1]	HAN Weiyu, CHENG Longsheng. Research on Roling Bearing Failure Mode Classification Based on MTS and SVM [J]. Computer Engineering and Applications, 2021, 57(6): 239-246.
[2]	WEN Jiebin, YANG Wenzhong, MA Guoxiang, ZHANG Zhihao, LI Hailei. Micro-expression Recognition Based on Apex Frame Optical Flow and Convolutional Autoencoder [J]. Computer Engineering and Applications, 2021, 57(4): 127-133.
[3]	LI Junxia, ZHANG Qin, ZHENG Guimei. Overview of Human Posture Recognition by Ultra-wideband Radar [J]. Computer Engineering and Applications, 2021, 57(3): 14-23.
[4]	XU Xianfeng, CAI Lulu, ZHANG Li. Photovoltaic Power Generation Prediction Algorithm Based on MLP and DBN [J]. Computer Engineering and Applications, 2021, 57(3): 266-272.
[5]	CHEN Fujian, XIE Weixin, XIA Ting. Adaptive Anti-occlusion Target Tracking Algorithm Based on LCT+ [J]. Computer Engineering and Applications, 2021, 57(22): 190-198.
[6]	ZHANG Hainan, YOU Xiaoming, LIU Sheng, LIU Zhongqiang. Interactive Learning Cuckoo Search Algorithm [J]. Computer Engineering and Applications, 2020, 56(7): 147-154.
[7]	CHEN Feiyu, YUE Wenbin, RAO Yinglu, XING Jinhao, MA Xiaojing. Autonomous Precision Landing of Drone Based on Improved TLD Algorithm [J]. Computer Engineering and Applications, 2020, 56(7): 247-254.
[8]	MA Ling, LUO Xiaoshu, JIANG Pinqun. Research on Dot Matrix Character Recognition Based on Template Matching and Support Vector Machine [J]. Computer Engineering and Applications, 2020, 56(4): 134-139.
[9]	ZHANG Zhonglin, FENG Yibang, ZHAO Zhongkai. Oversampling Method for Unbalanced Data Sets Based on SVM [J]. Computer Engineering and Applications, 2020, 56(23): 220-228.
[10]	HUANG Guangjun, DENG Yuanlong. Polarizer Visual Defect Detection and Classification Based on Improved LBP and SVM Algorithm [J]. Computer Engineering and Applications, 2020, 56(22): 251-255.
[11]	SUI Xiuwu, NIU Jiabao, LI Haotian, QIAO Mingmin. Upper Limb sEMG Gesture Recognition Method Based on NMF-SVM Model [J]. Computer Engineering and Applications, 2020, 56(17): 161-166.
[12]	YANG Yu，ZENG Guohui，HUANG Bo. Fault Diagnosis Method of Bearings Based on Dual-Tree Complex Wavelet Packet Transform and Improved SVM [J]. Computer Engineering and Applications, 2020, 56(17): 231-235.
[13]	YANG Ying, WANG Jun, WANG Gang. Customer Complaints Classification Method Based on Improved Random Subspace [J]. Computer Engineering and Applications, 2020, 56(13): 230-235.
[14]	YANG Yanrong, SONG Rongjie, ZHOU Zhaoyong. Network Intrusion Detection Method Based on GAN-PSO-ELM [J]. Computer Engineering and Applications, 2020, 56(12): 66-72.
[15]	ZHAO Xiaoyong, WANG Ningning, WANG Lei. Research of Outlier Ensemble Mining Based on Active Learning [J]. Computer Engineering and Applications, 2020, 56(12): 112-117.

Chinese dialect active identification method fusing diversity measure

融合多样性测度的汉语方言主动辨识方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics