Natural sounds recognition using GMM distribution

Computer Engineering and Applications ›› 2011, Vol. 47 ›› Issue (25): 152-155.

• 数据库、信号与信息处理 • Previous Articles Next Articles

Natural sounds recognition using GMM distribution

YU Qingqing，LI Ying，LI Yong

College of Mathematics and Computer Science，Fuzhou University，Fuzhou 350108，China

Received:1900-01-01 Revised:1900-01-01 Online:2011-09-01 Published:2011-09-01

基于高斯混合模型的自然环境声音的识别

余清清，李应，李勇

福州大学数学与计算机科学学院，福州 350108

Abstract

Abstract: A recognition method for natural sounds based on Gaussian Mixture Model（GMM） distribution is proposed.Mel-Frequency Cepstral Coefficients（MFCCs） are used to analyze natural sounds for their feature extraction.The expectation maximization algorithm is used to learn a Gaussian mixture model distribution of MFCCs for the set of audio feature vectors that describe each sound.Minimum classification error criterion and vote rule are used to yield higher recognition accuracy for natural sounds.Experimentally，compared with K-Nearest Neighbor（KNN） method，GMM is able to achieve a higher accuracy rate for discriminating 36 classes of natural sounds.The classified accuracy rate of GMM reaches to 95.83%.

Key words: Mel-frequency cepstral coefficients, Gaussian mixture model, natural sounds recognition, vote rule

摘要： 提出了一种基于高斯混合模型（GMM）的自然环境声音的识别方法。提取Mel频率倒谱系数（MFCCs）来分析声音信号;对于每种声音使用期望最大化算法基于MFCC特征集建立高斯混合模型;使用最小错误率判决规则和投票裁决的方法进行识别。使用GMM对36种自然环境的声音进行识别的正确率可达95.83%，且识别效果优于K最近邻（KNN）。

关键词: Mel频率倒谱系数, 高斯混合模型, 自然环境声音的识别, 投票裁决

YU Qingqing，LI Ying，LI Yong. Natural sounds recognition using GMM distribution[J]. Computer Engineering and Applications, 2011, 47(25): 152-155.

余清清，李应，李勇. 基于高斯混合模型的自然环境声音的识别[J]. 计算机工程与应用, 2011, 47(25): 152-155.

[1]	PAN Peixin, PAN Zhongliang. Active Contour Image Segmentation Combined with Saliency [J]. Computer Engineering and Applications, 2021, 57(8): 225-230.
[2]	LEI Henglin, Gulanbaier Tuerhong, Mairidan Wushouer, ZHANG Dongmei. Review of Novelty Detection [J]. Computer Engineering and Applications, 2021, 57(5): 47-55.
[3]	JIA Bingbing, CAO Hui, QIN Chijie. Research on Improving Phoneme Recognition Rate Based on Subspace Gaussian Mixture Model and Deep Neural Network Combination [J]. Computer Engineering and Applications, 2019, 55(24): 117-121.
[4]	CHEN Chao. Target Tracking Algorithm Involving Gaussian Mixture Model and Weighted Likelihood [J]. Computer Engineering and Applications, 2019, 55(12): 124-131.
[5]	LI Chao, SUN Jun. Effective method of weld defect detection and classification based on machine vision [J]. Computer Engineering and Applications, 2018, 54(6): 264-270.
[6]	QIU Gongda1, HE Ming1, ZHU Chaozheng1, YANG Jie2, LIU Yong1. Fuzzy clustering based on connected point with max density in sparse border [J]. Computer Engineering and Applications, 2018, 54(14): 82-88.
[7]	LIANG Kaibin, GUAN Yihong. Brain MR Images segmentation method based on hidden Gaussian mixture model [J]. Computer Engineering and Applications, 2018, 54(10): 196-203.
[8]	SUN Kai，XIE Linbo. Moving objects detection method based on combination of improved local binary pattern and W4 algorithm [J]. Computer Engineering and Applications, 2017, 53(5): 187-191.
[9]	SUN Peng1，2, XIA Fei1，2，3, ZHANG Hao1，2，3, PENG Daogang1，2, MA Xi1，2, LUO Zhijiang1，2. Research of human fall detection algorithm based on improved Gaussian mixture model [J]. Computer Engineering and Applications, 2017, 53(20): 173-179.
[10]	CHEN Hui, HU Likun, HUANG Yuwen. Stereo matching algorithm based on Gaussian mixture model and tree structure [J]. Computer Engineering and Applications, 2017, 53(20): 195-200.
[11]	NIU Yirong, WANG Shitong. Fast image segmentation algorithm based on noise benefit [J]. Computer Engineering and Applications, 2016, 52(21): 195-201.
[12]	HU Zhili, GUO Min. Fast segmentation in color image based on SLIC and GrabCut [J]. Computer Engineering and Applications, 2016, 52(2): 186-190.
[13]	DU Nannan, ZHAO Hui. Rearch on prosodic hierarchy conversion for Uyghur emotional speech [J]. Computer Engineering and Applications, 2016, 52(19): 154-160.
[14]	SHU Yi1, XING Yujuan2. Speaker verification based on i-vector and sparse representation using PCA dictionary learning [J]. Computer Engineering and Applications, 2016, 52(18): 144-147.
[15]	ZHANG Mingguang, ZHANG Yu. Distribution network state estimation based on artificial neural network for pseudo measurement modeling [J]. Computer Engineering and Applications, 2016, 52(17): 253-256.

Natural sounds recognition using GMM distribution

基于高斯混合模型的自然环境声音的识别

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics