计算机工程与应用 ›› 2012, Vol. 48 ›› Issue (11): 111-114.

• 数据库、信号与信息处理 • 上一篇    下一篇

数据缺失条件下的贝叶斯优化算法

张亚萍1,胡学钢2,方振国1,姜恩华1   

  1. 1.淮北师范大学 物理与电子信息学院,安徽 淮北 235000
    2.合肥工业大学 计算机与信息学院,合肥 230009
  • 出版日期:2012-04-11 发布日期:2012-04-16

Bayesian optimization algorithm under conditions of incomplete data

ZHANG Yaping1, HU Xuegang2, FANG Zhenguo1, JIANG Enhua1   

  1. 1.School of Physics and Electronic Information, Huaibei Normal University, Huaibei, Anhui 235000, China
    2.School of Computer & Information, Hefei University of Technology, Hefei 230009, China
  • Online:2012-04-11 Published:2012-04-16

摘要: 针对朴素贝叶斯算法存在的三方面约束和限制,提出一种数据缺失条件下的贝叶斯优化算法。该算法计算任两个属性的灰色相关度,根据灰色相关度完成相关属性的联合、冗余属性的删除和属性加权;根据灰色相关度执行改进EM算法完成缺失数据的填补,对经过处理的数据集用朴素贝叶斯算法进行分类。实验结果验证了该优化算法的有效性。

关键词: 灰色相关度, 条件属性, 类别属性, 属性联合, 属性加权

Abstract: An improved naive classification algorithm is presented to solve the three problems that affect the accuracy of naive Bayes algorithm. The gray related degree about condition attributes and classes is calculated, according to which the attribute joining and attribute weighted are completed. The absent attributes are filled with an improved EM algorithm. The samples are classified by Bayesian classification algorithm. The results of experiments indicate that the optimization algorithm has the higher efficiency for clustering.

Key words: grey relational degree, condition attribute, class attribute, attribute joining, attribute weighted