计算机工程与应用 ›› 2015, Vol. 51 ›› Issue (7): 136-140.

• 数据库、数据挖掘、机器学习 • 上一篇    下一篇

基于改进模糊C均值的软件缺陷预测研究

张  焯1,徐  玲2,杨  丹2   

  1. 1.重庆大学 数学与统计学院,重庆 400044
    2.重庆大学 软件学院,重庆 400044
  • 出版日期:2015-04-01 发布日期:2015-03-31

Research on software defect prediction based on improved fuzzy C-means

ZHANG Zhuo1, XU Ling2, YANG Dan2   

  1. 1.College of Mathematics and Statistics, Chongqing University, Chongqing 400044, China
    2.College of Software Engineering, Chongqing University, Chongqing 400044, China
  • Online:2015-04-01 Published:2015-03-31

摘要: 软件缺陷预测用来预测软件系统各个模块中是否存在BUG。传统的软件缺陷预测技术研究主要局限在有监督方法上,这类方法需要大量的已标注数据进行训练,但在工程实际中,这类标签数据不易获取。提出了一种结合模拟退火和遗传算法的改进模糊C均值算法,以解决模糊C均值容易受初始聚类中心影响而收敛到局部最优的缺陷。实验结果表明提出的方法在软件缺陷预测中具备高鲁棒性和较高预测精度。

关键词: 软件缺陷预测, 模糊C均值, 模拟退火, 遗传算法, 无监督

Abstract: Software defect prediction is to predict whether there is a bug in a software system module. Traditional researches on software defect prediction mainly focus on supervised method. This type of method needs a lot of instances with labels as the training set. However, in engineering practice, the instances with labels are difficult to obtain. This paper proposes an improved fuzzy C-means algorithm, combining simulated annealing and genetic algorithm, to solve the defect that fuzzy C-means is easily affected by the initial cluster centers. The experimental results show that the proposed method has high robustness and accuracy on software defect prediction.

Key words: software defect prediction, fuzzy C-means, simulated annealing, genetic algorithm, unsupervised