Vocal effort related robust speech recognition based on adaptation method

Abstract

Abstract: Adaptation of acoustic models is presented to cope with the acoustic variability caused vocal effort variability in Mandarin speech recognition. Acoustic models trained on normal speech are applied to recognize sentences under the remaining four vocal effort modes. The maximum likelihood linear regression adaptation method is extended to the stochastic segment model, and the acoustic models after adaptation are used to recognize speech of corresponding vocal effort mode. Experiments conducted on “863-test” show that there is significant?decrease in recognition accuracy in case of mismatched speech models, and the recognition performance can be improved considerably by adaptation. This proves that adaptation of acoustic models is effective in solving the acoustic variability caused vocal effort.

Key words: speech recognition, vocal effort, adaptation, maximum likelihood linear regression

摘要： 针对声音效果变化引起的语音声学特性的改变，提出基于声学模型自适应的方法。分析了正常模式下训练的声学模型在识别其他声效模式下语音的表现;根据随机段模型的模型特性，将最大似然线性回归方法引入到随机段模型系统中，并利用自适应后的声学模型来识别对应的声效模式下的语音。在“863-test”测试集上进行的汉语连续语音识别实验显示，正常模式下训练的声学模型识别其他四种声效模式下的语音时，识别精度均有较大程度的下降；而自适应后的系统在识别对应的声效模式的语音时，识别精度有了明显的改观。表明了基于声学模型自适应的方法在解决语音识别中声音效果变化问题上的有效性。

关键词: 语音识别, 声音效果, 自适应, 最大似然线性回归

CHAO Hao, SONG Cheng, XUE Xiao, LIU Zhizhong. Vocal effort related robust speech recognition based on adaptation method[J]. Computer Engineering and Applications, 2016, 52(2): 156-160.

晁浩，宋成，薛霄，刘志中. 基于模型自适应的声效鲁棒性语音识别算法[J]. 计算机工程与应用, 2016, 52(2): 156-160.

[1]	ZOU Jie, LI Jun. Multi-strategy Covariance Matrix Learning Differential Evolution Algorithm [J]. Computer Engineering and Applications, 2021, 57(7): 78-87.
[2]	ZHAO Pengfei, LI Yanling, LIN Min. Intent Detection of Domain Adaptation Combined with Capsule Network [J]. Computer Engineering and Applications, 2021, 57(21): 188-194.
[3]	QIAN Zhengyuan, ZENG Guosun. Differential Evolution Algorithm Guided by Elite Island Population [J]. Computer Engineering and Applications, 2021, 57(20): 73-81.
[4]	WANG Bei, CHEN Jinguang, WANG Mingming. Improved Target Tracking Algorithm Based on Kernelized Correlation Filter in Complex Scenarios [J]. Computer Engineering and Applications, 2021, 57(2): 198-208.
[5]	HU Yuelin, CAI Xiaodong, LIU Yuzhu. Cross-Domain Person Re-identification Algorithm Combining Inter-Domain and Intra-Domain Changes [J]. Computer Engineering and Applications, 2021, 57(13): 212-217.
[6]	SHAN Yugang, HU Weiguo. Review of Visual Object Tracking Algorithms of Adaptive Direction and Scale [J]. Computer Engineering and Applications, 2020, 56(9): 13-23.
[7]	WANG Guoyi, SUN Yongrong, WU Lei, ZENG Qinghua. Scale Adaptive Correlation Tracking Method for Circular Target Based on Block Detection [J]. Computer Engineering and Applications, 2020, 56(8): 177-184.
[8]	JIN Bingchu, WEN Hui, SHI Zhiqiang, ZHANG Zhiyuan, CHEN Junjie. Malware Classification Method Based on Path Tree of Behavior [J]. Computer Engineering and Applications, 2020, 56(11): 98-104.
[9]	LOU Yingdan, XU Jinglin, HUANG Lixia, ZHANG Xueying. Speech Recognition Based on MLLR and MAP Under Distant Noise Reverberation Environment [J]. Computer Engineering and Applications, 2020, 56(10): 122-126.
[10]	CHI Zongzheng, DONG Shaozheng, GUO Tong, REN Zhilei, ZHOU Kuanjiu, GUO He. Research on Wind Farm Layout Based on Hyper-Heuristic [J]. Computer Engineering and Applications, 2019, 55(7): 220-225.
[11]	ZHAO Chanjuan, ZHOU Shaoguang, LIU Lili, DING Qian. Hyperspectral Image Classification Based on Homogenous Region and Transfer Component Analysis [J]. Computer Engineering and Applications, 2019, 55(19): 198-206.
[12]	WU Yanwen1, LI Bin1, SUN Chenhui1, DU Jiawei1, WANG Xinyue2. Research on Domain Adaptive Recommendation Methods Based on Transfer Learning [J]. Computer Engineering and Applications, 2019, 55(13): 59-65.
[13]	ZHAO Yue, LI Yaoqiang, XU Xiaona, WU Licheng. Near-optimal active learning for Tibetan speech recognition [J]. Computer Engineering and Applications, 2018, 54(22): 156-159.
[14]	WANG Yanbo, YIN Hong, PENG Zhenrui, JIANG Zhaoyuan. Elite cuckoo algorithm with gravitational search and Gaussian perturbation [J]. Computer Engineering and Applications, 2018, 54(21): 48-55.
[15]	LI Chang, XU Qi, LI Guanglei, ZHOU Huachun. On-demand adaptation method for multi-domain security services based on service function chaining [J]. Computer Engineering and Applications, 2018, 54(21): 56-64.

Vocal effort related robust speech recognition based on adaptation method

基于模型自适应的声效鲁棒性语音识别算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics