计算机工程与应用 ›› 2007, Vol. 43 ›› Issue (18): 234-238.

• 工程与应用 • 上一篇    下一篇

一种新聚类算法在基因表达数据分析中的应用

曹 晖,席 斌,米 红   

  1. 厦门大学 信息科学与技术学院 模式识别与智能系统研究所,福建 厦门 361005
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-06-21 发布日期:2007-06-21
  • 通讯作者: 曹 晖

Application of new clustering algorithms in gene expression data

CAO Hui,XI Bin,MI Hong   

  1. College of Science and Technology,Xiamen University,Xiamen,Fujian 361005,China
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-06-21 Published:2007-06-21
  • Contact: CAO Hui

摘要: 自组织特征映射神经网络与层次聚类算法是两种较经典的分析基因表达数据的聚类算法,但由于基因表达数据的复杂性与不稳定性,这两种算法都存在着自身的优劣。因此,在比较两种算法差异性的基础上,创造性地提出了一种新算法,即通过SOM算法对基因表达数据进行聚类,再用层次聚类将每个类对应的神经元权值二次聚类,并将此算法应用在酵母菌基因表达数据中,用实验证明改进算法克服了自组织算法的一些缺陷,提高了基因聚类的效能。

Abstract: Self-Organizing Maps(SOM) and the hierarchical clustering are two of the most classical clustering technologies for analyzing gene expression data,which exist own advantages and disadvantages on account of the complexity and the instability of gene expression data.Therefore,on base of comparing difference of the two clustering technologies,this article creatively proposes one new algorithm,that is first clustering gene expression data with SOM and second clustering the weight of nerve cells corresponding the clustering from the first step.In succession,the new algorithm is applied to the published data of yeast gene expression to prove that it conquers some bug of SOM and improves the efficiency of gene clustering through emulation mode.