Computer Engineering and Applications ›› 2009, Vol. 45 ›› Issue (7): 175-177.DOI: 10.3778/j.issn.1002-8331.2009.07.052

• 数据库、信号与信息处理 • Previous Articles     Next Articles

Novel algorithm for discretization of mixed decision table based on rough sets and probability distribution

HUANG Jing-tao,LIU Feng-nian,MA Jian-wei,CAO Zhe   

  1. College of Electronic and Information Engineering,Henan University of Science and Technology,Luoyang,Henan 471003,China
  • Received:2008-01-21 Revised:2008-04-25 Online:2009-03-01 Published:2009-03-01
  • Contact: HUANG Jing-tao

基于粗集和概率分布的混合决策表离散化算法

黄景涛,刘丰年,马建伟,曹 哲   

  1. 河南科技大学 电子信息工程学院,河南 洛阳 471003
  • 通讯作者: 黄景涛

Abstract: In order to solve the problem of classical rough sets theory in processing mixed decision table’s discretization with too many rules and low accuracy,the discretization method based on the greedy algorithm and the attribute value sector probability is used,it changes the traditional method of only process continue attributes for mixed decision table.Firstly,this method only discretizes continue attributes for mixed decision table with the improved greedy algorithm,then computes continual attributes interval probability,and subdivides the sector with the maximum probability,finally considers further discretization for the original discrete attributes.The experimental simulation result indicates this method can strengthen the resolution,reduce the rule number,and always accommodate decision table after discretization,many present separate methods which doesn’t consider the policy-making compatibility,it can retain the system useful information.It shows the validity and veracity of the method.

摘要: 为解决经典粗糙集理论在处理连续、离散混合属性决策表离散化时规则数多、准确率低的问题,采用基于贪心算法和属性值区间概率相结合的离散化方法,该方法针对传统的对混合决策表仅考虑连续属性离散化的问题。首先运用改进的贪心算法对混合决策表中的连续属性进行初步离散化,然后计算连续属性各属性值区间概率,并对取值概率大的区间细化,最后再考虑对原来的离散属性进一步离散化,从而增强系统分辨能力;且离散化后的决策表总是相容的,与目前很多离散方法不考虑决策相容性相比,该方法能够最大限度地保留系统的有用信息。通过仿真分析验证了该方法的有效性。