计算机工程与应用 ›› 2010, Vol. 46 ›› Issue (21): 232-234.DOI: 10.3778/j.issn.1002-8331.2010.21.067

• 工程与应用 • 上一篇    下一篇

基于粗糙集的医疗数据挖掘研究与应用

叶明全1,伍长荣2,胡学钢3   

  1. 1.皖南医学院 计算机教研室,安徽 芜湖 241002
    2.安徽师范大学 数学计算机学院,安徽 芜湖 241002
    3.合肥工业大学 计算机与信息学院,合肥 230009
  • 收稿日期:2009-01-06 修回日期:2009-03-13 出版日期:2010-07-21 发布日期:2010-07-21
  • 通讯作者: 叶明全

Research and application on medical data mining based on rough sets

YE Ming-quan1,WU Chang-rong2,HU Xue-gang3   

  1. 1.Computer Staff Room,Wannan Medical College,Wuhu,Anhui 241002,China
    2.Institute of Mathematics and Computer,Anhui Normal University,Wuhu,Anhui 241002,China
    3.Institute of Computer and Information,Hefei University of Technology,Hefei 230009,China
  • Received:2009-01-06 Revised:2009-03-13 Online:2010-07-21 Published:2010-07-21
  • Contact: YE Ming-quan

摘要: 医疗数据挖掘能够对现有病历数据库中数据进行自动分析并且提供有价值的医学知识。针对临床病历数据库中存在大量重复样本和冗余属性,从而影响医疗诊断的精度和速度这一问题,建立了基于信息论的粗糙集理论模型和SQL语言之间的关系,提出了基于SQL语言的条件信息熵属性约简算法,利用数据库查询语言实现了数据清洗、求核和属性约简等过程。实验结果表明该算法实现简单,运行效率高,为粗糙集理论更广泛地应用于具体的医疗数据挖掘提供了一种方法。

Abstract: Medical data mining can generate effective knowledge rules from medical record database.It is found that the precision and speed of medical diagnosis are unsatisfied due to large-scale repeating data and redundant attributes in medical data during practical applications.To solve the problem,this paper establishes relation of rough set theory based on information view and SQL language,and conditional information entropy attributes reduction algorithm based on SQL language is put forward.Experiments show that the algorithm can be easily realized taking advantage of database query language and is more efficient.It establishes a method by which rough set theory is widely used in material medical data mining.

中图分类号: