计算机工程与应用 ›› 2015, Vol. 51 ›› Issue (3): 197-201.

• 信号处理 • 上一篇    下一篇

结合节拍语义和MFCC声学特征的音乐流派分类

庄  严,于凤芹   

  1. 江南大学 物联网工程学院,江苏 无锡 214122
  • 出版日期:2015-02-01 发布日期:2015-01-28

Combining beat semantic features and MFCC acoustic features for music genre classification

ZHUANG Yan, YU Fengqin   

  1. School of Internet of Things Engeneering, Jiangnan University, Wuxi, Jiangsu 214122, China
  • Online:2015-02-01 Published:2015-01-28

摘要: 由于音乐节拍的强度、快慢、持续时间等是反映音乐不同流派风格的重要语义特征,而音乐节拍多属于由打击乐器所产生的低频部分,为此利用小波变换对音乐信号进行6层分解来提取低频节拍特征;针对节拍特征差异不明显的音乐流派,提出用描述频域能量包络的MFCC声学特征与节拍特征结合,并用基于音乐流派机理分析的8阶MFCC代替常用的12阶MFCC。对8类音乐流派实验仿真结果表明,基于语义特征和声学特征结合的方法,总体分类准确率可达68.37%,同时特征维数增加对分类时间影响很小。

关键词: 音乐流派分类, 节拍特征, Mel频率倒谱系数(MFCC), 小波分解, 支持向量机

Abstract: As the music beat features such as intensity, speed and duration which belong to semantic features characterize the different music genres, while the music beats are mostly generated by percussion component of the low-frequency part which can be extracted by using 6 layers wavelet decomposition of music signals. For genres which are not easy to be differentiated by beats features only, a method by combining the acoustic characteristics describing the frequency-domain energy envelope MFCC and beat features is proposed, and 8-order MFCC is used instead of the usual 12-order MFCC based on the music genres mechanism analysis. Music genres experiments simulation results evaluated on 8 genre classes show that the method based on combination of semantic and acoustic characteristics, the overall classification accuracy rate reaches 68.37%, while the increase of feature dimensions has little increase in the time-consuming.

Key words: music genre classification, beat features, Mel-Frequency Cepstral Coefficients(MFCC), wavelet decomposition, Support Vector Machine(SVM)