计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (25): 126-128.DOI: 10.3778/j.issn.1002-8331.2008.25.038

• 数据库、信号与信息处理 • 上一篇    下一篇

基于SVM的流行音乐中人声的识别

石自强,李海峰,孙佳音   

  1. 哈尔滨工业大学 计算机科学与技术学院,哈尔滨 150001
  • 收稿日期:2008-03-19 修回日期:2008-05-19 出版日期:2008-09-01 发布日期:2008-09-01
  • 通讯作者: 石自强

Vocal discrimination in pop music based on SVM

SHI Zi-qiang,LI Hai-feng,SUN Jia-yin   

  1. Department of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001,China
  • Received:2008-03-19 Revised:2008-05-19 Online:2008-09-01 Published:2008-09-01
  • Contact: SHI Zi-qiang

摘要: 针对流行音乐中人声的发现问题,使用SVM分类器针对MFCC特征进行训练和分类。依据音频特征的连续性,后期对分类结果进行低通滤波。实验结果表明,该方法在帧层面上的识别率可以达到85.76%。实验中也发现不同语种的演唱者在发音上,特别是在MFCC特征上存在很大的统计差异性。实验中对歌曲分类的结果可以作为近一步实现音乐相似性度量的依据之一。

关键词: 流行音乐结构分析, 人声发现, 支持向量机

Abstract: Facing the problem of vocal discrimination in pop music,the authors propose applying MFCC parameters as features,and Support Vector Machine(SVM) as classifier.Due to the continuity of audio signal features,the authors consider low-pass filtering to the classification results as post-processing.Experiment results show that at frame level,a quite promising classification accuracy of 85.76% can be obtained.It is also revealed that singers with different languages have large vocal differences in pronunciation, especially in MFCC feature statistics.The classification results may be used as a similarity measure for music structure analysis in the future work.

Key words: pop music structure analysis, Vocal/Non-vocal discrimination, Support Vector Machine(SVM)