Study on suitability of face mask speech quality evaluation algorithm

doi:10.3778/j.issn.1002-8331.1604-0320

Computer Engineering and Applications ›› 2017, Vol. 53 ›› Issue (19): 114-117.DOI: 10.3778/j.issn.1002-8331.1604-0320

Previous Articles Next Articles

Study on suitability of face mask speech quality evaluation algorithm

WANG Xia1, MA Junhui1, WANG Guangyan2, ZHANG Yan1

1.School of Electronic and Information Engineering, Heibei University of Technology, Tianjin 300401, China
2. School of Information Engineering, Tianjin University of Commerce, Tianjin 300134, China

Online:2017-10-01 Published:2017-10-13

面罩语音质量评价算法适用性研究

王霞1，马俊晖1，王光艳2，张艳1

1.河北工业大学电子信息工程学院，天津 300401
2.天津商业大学信息工程学院，天津 300134

Abstract

Abstract: The performance of speech quality evaluation for speech coding has been very clear, but it may not be applied to face mask speech. The suitability of speech quality evaluation measures under various noise environments in the application of air speech and face mask speech is discussed. Mean opinion score and three kinds of objective speech quality evaluation measures are taken to evaluate noisy speech and enhanced speech with various signal-to-noise, which include segmental signal-to-noise ratio, modified bark spectral distortion and perceptual evaluation of speech quality, and the suitability is judged by the accordance with subjective evaluation. Wiener filtering and LSA-MMSE algorithms are taken to enhance the noisy speech. Pink noise and wave noise are used. The simulation results show that the suitability of speech quality evaluation algorithms is limited to the kind of speech, SNR of noisy speech, background noise environment and the kind of speech enhancement algorithm. With regard to pink noise, PESQ is not suitable for evaluating air speech enhanced by wiener filtering, and MBSD is only suitable for evaluating face mask speech enhanced by LSA-MMSE. Under the environment of wave noise, PESQ can evaluate face mask speech, and MBSD is not suitable for evaluating face mask speech.

Key words: face mask speech, Wiener filtering, Modified Bark Spectral Distortion（MBSD）, Perceptual Evaluation of Speech Quality（PESQ）

摘要： 针对语音编码的音质评价算法性能已十分明确，但对于面罩语音不一定适用。讨论了语音质量评价算法对空气语音与面罩语音在不同噪声环境下的适用性。采用主观意见得分和三种客观评价测度对多种信噪比的带噪语音和增强语音进行评价，包括分段信噪比、改进的巴克谱失真（MBSD）和语音感知质量评价（PESQ），根据与主观评价的一致性判断客观评价方法的适用性。增强算法采用维纳滤波法和对数谱最小均方误差法（LSA-MMSE），噪声采用粉红噪声、海浪噪声。仿真结果表明，语音质量评价算法的适用性与语音类型、信噪比、背景噪声、增强算法种类有关。粉红噪声环境下，PESQ不适合评价经维纳滤波增强的空气语音；MBSD算法只适用于评价经LSA-MMSE增强的面罩语音。海浪噪声环境下，PESQ适用于评价面罩语音，MBSD不适合评价面罩语音。

关键词: 面罩语音, 维纳滤波, 改进巴克谱失真（MBSD）, 语音感知质量评价（PESQ）

WANG Xia1, MA Junhui1, WANG Guangyan2, ZHANG Yan1. Study on suitability of face mask speech quality evaluation algorithm[J]. Computer Engineering and Applications, 2017, 53(19): 114-117.

王霞1，马俊晖1，王光艳2，张艳1. 面罩语音质量评价算法适用性研究[J]. 计算机工程与应用, 2017, 53(19): 114-117.

[1]	NING Kuangfeng, WANG Jingfang. DCT domain Wiener filter speech enhancement [J]. Computer Engineering and Applications, 2015, 51(8): 226-230.
[2]	XI Ji1, LIANG Ruiyu2, WANG Guowei2, QIU Xiaomei2, MA Anjun2. Speech noise reduction algorithm research of multi-channel hearing aid [J]. Computer Engineering and Applications, 2014, 50(11): 237-240.
[3]	ZHU Yingjun1, YANG Yong2, ZHENG Xinghua1, ZHANG Wen1. Denoising method for OCT image based on combination of wavelet tranform with wiener filtering [J]. Computer Engineering and Applications, 2012, 48(34): 195-198.
[4]	LI Yunhong, YI Xin. Image denoising using wavelet packet transform based on correctional Wiener filtering [J]. Computer Engineering and Applications, 2012, 48(21): 182-185.
[5]	WANG Jingfang. Real-time voice activity robust detection [J]. Computer Engineering and Applications, 2011, 47(20): 147-150.
[6]	WANG Jingfang. Iterative Wiener filtering-based real-time voice in noises [J]. Computer Engineering and Applications, 2011, 47(19): 132-135.
[7]	ZHANG Liang，GONG Wei-guo. Improved Wiener filtering speech enhancement algorithm [J]. Computer Engineering and Applications, 2010, 46(26): 129-131.
[8]	FAN Xiao-chun，QIU Zheng-quan. Speaker recognition based on Wiener filter and MMCE [J]. Computer Engineering and Applications, 2010, 46(10): 113-114.
[9]	WU Shu-hong^1,2,ZHANG Gang¹,ZHAO Zhe-feng¹. 8 Kbit/s LD-aCELP speech coding with backward pitch detection [J]. Computer Engineering and Applications, 2009, 45(17): 119-121.
[10]	GUO Shui-xia¹,TANG Yong-jun². Extension and application of Wiener filtering in image processing [J]. Computer Engineering and Applications, 2008, 44(14): 178-180.
[11]	LI Ning,SHUI Peng-lang. Image denoising algorithm via doubly local Wiener filtering with windows based on SWT and DTCWT [J]. Computer Engineering and Applications, 2007, 43(28): 44-46.
[12]	. The Extension of Wiener Filtering When Noise Could Be Non-Additive In Image Processing [J]. Computer Engineering and Applications, 2007, 43(12): 184-185.

Study on suitability of face mask speech quality evaluation algorithm

面罩语音质量评价算法适用性研究

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 12

Recommended Articles

Metrics