Speech enhancement algorithm based on noise estimation of binary masking

Abstract

Abstract: In order to solve the residual background noise and the musical noise resulted by the existing speech enhancement algorithm for hearing aids, a speech enhancement algorithm based on noise estimation of binary masking is proposed in this paper. The estimated background noise and initial enhanced speech are obtained by using the minima-controlled recursive averaging algorithm. The estimated noise and the initial enhanced speech are processed by gammatone filter and inner cells model and time-frequency representation is obtained. Binary masking of noisy is calculated. The binary masking is used to synthesize enhanced speech by utilizing human auditory masking in time-frequency domain. Experimental results show that the proposed algorithm is compared with the MCRA algorithm, Speech Intelligibility Index（SII）, Perceptual Evaluation of Speech Quality（PESQ） and SNR are improved.

Key words: speech enhancement, hearing aids, noise estimate, binary masking

摘要： 针对现有的助听器语音增强算法在非平稳噪声环境下，残留大量背景噪声的同时还引入了“音乐噪声”，致使增强语音可懂度和信噪比不理想等问题。提出了一种基于噪声估计的二值掩蔽语音增强算法，该算法利用人耳听觉感知理论，结合人耳的听觉特性和耳蜗的工作机理。采用最小值控制递归平均（Minima-Controlled Recursive Averaging，MCRA）算法获得估计噪声和初步增强语音；将估计噪声和初步增强语音分别通过可以模拟人工耳蜗模型的gammatone滤波器组进行滤波处理，得到各自的时频表示形式；利用人耳的听觉掩蔽特性，计算含噪语音在时频域的二值掩蔽；利用二值掩蔽得到增强语音。实验结果表明：该算法很大程度上去除了谱减法引入的“音乐噪声”，与基于MCRA谱减法相比，增强语音的语言可懂度指数（Speech Intelligibility Index，SII）、主观语音质量评估（Perceptual Evaluation of Speech Quality，PESQ）和信噪比（Signal to Noise Ratio，SNR）都得到了提高。

关键词: 语音增强, 助听器, 噪声估计, 二值掩蔽

CAO Longtao1, LI Ruwei1, BAO Changchun1, WU Shuicai2. Speech enhancement algorithm based on noise estimation of binary masking[J]. Computer Engineering and Applications, 2015, 51(17): 222-227.

曹龙涛1，李如玮1，鲍长春1，吴水才2. 基于噪声估计的二值掩蔽语音增强算法[J]. 计算机工程与应用, 2015, 51(17): 222-227.

[1]	WANG Shiqi, ZENG Qingning, LONG Chao, XIONG Songling, QI Xiaoxiao. Multi-task Learning for Speech Enhancement and Detection [J]. Computer Engineering and Applications, 2021, 57(20): 197-202.
[2]	WANG Yan, JIA Hairong, JI Huifang, WANG Weimei. Feature Joint Optimization of Deep Belief Network for Speech Enhancement [J]. Computer Engineering and Applications, 2019, 55(9): 38-42.
[3]	JI Huifang, JIA Hairong, WANG Yan. Speech Enhancement Method for Improving Phase Spectrum Compensation [J]. Computer Engineering and Applications, 2019, 55(8): 48-52.
[4]	WANG Jie1, YANG Chengcheng1, MO Jiayong2, WANG Dunze1, WANG Xiexie1. A priori SNR estimator based on harmonic regeneration [J]. Computer Engineering and Applications, 2018, 54(7): 44-48.
[5]	WANG Hu, LI Jing, ZHAO Hengmiao, ZANG Yan, LI Chuntang. Speech enhancement algorithm of sparse low rank model and phase spectral compensation [J]. Computer Engineering and Applications, 2018, 54(5): 150-155.
[6]	WANG Bo，YU Fengqin，CHEN Ying. Speech enhancement based on nonsmooth nonnegative matrix factorization [J]. Computer Engineering and Applications, 2017, 53(7): 160-164.
[7]	WANG Xia1, WANG Dan1, WANG Guangyan2, ZHANG Yan1. Noisy face mask speech enhancement combining compressed sensing with EMD [J]. Computer Engineering and Applications, 2017, 53(18): 137-140.
[8]	ZHANG Jianwei, TAO Liang, ZHOU Jian, WANG Huabin. Improved minima controlled recursive averaging algorithm based on improved spectrum smoothing strategy and speech enhancement [J]. Computer Engineering and Applications, 2017, 53(1): 153-157.
[9]	HU Yonggang1, ZHANG Xiongwei1, ZOU Xia1, MIN Gang1，2, ZHANG Liwei1, WANG Jian3. Speech enhancement algorithm using ADMM sparse nonnegative matrix factorization [J]. Computer Engineering and Applications, 2016, 52(3): 108-112.
[10]	NING Kuangfeng, WANG Jingfang. DCT domain Wiener filter speech enhancement [J]. Computer Engineering and Applications, 2015, 51(8): 226-230.
[11]	GU Peng1, 2, ZHU Junhua1，2, DING Fei1，2. Method in noisy signal compressed sensing for speech de-noising [J]. Computer Engineering and Applications, 2015, 51(15): 216-220.
[12]	WANG Yulin1, TIAN Xuelong1，2, GAO Xueli1. DSP realization of modified speech enhancement algorithm based on adaptive filters [J]. Computer Engineering and Applications, 2015, 51(1): 208-212.
[13]	XU Wenchao, WANG Guangyan, GENG Yanxiang, BAI Fang, FEI Teng. Speech enhancement algorithm based on spectral subtraction and variable-step LMS algorithm [J]. Computer Engineering and Applications, 2015, 51(1): 213-217.
[14]	NING Kuangfeng, WANG Jingfang. Speech enhancement based on group-separable compressed sensing [J]. Computer Engineering and Applications, 2014, 50(24): 204-208.
[15]	XIA Lele1, SUN Yongrong1, WANG Yong2. Speech enhancement technology based on adaptive noise estimation [J]. Computer Engineering and Applications, 2014, 50(23): 225-228.

Speech enhancement algorithm based on noise estimation of binary masking

基于噪声估计的二值掩蔽语音增强算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics