Computer Engineering and Applications ›› 2009, Vol. 45 ›› Issue (36): 216-218.DOI: 10.3778/j.issn.1002-8331.2009.36.061

• 工程与应用 • Previous Articles     Next Articles

Application of new algorithm in gene expression profile clustering

ZHANG Guo-yin1,CHENG Hui-jie1,2,LIU Yong-mei1,YAO Ai-hong1   

  1. 1.College of Computer Science and Technology,Harbin Engineering University,Harbin 150001,China
    2.Department of Computer,Harbin Medical University,Harbin 150081,China
  • Received:2008-07-10 Revised:2008-11-17 Online:2009-12-21 Published:2009-12-21
  • Contact: ZHANG Guo-yin

一种新算法在基因表达谱聚类中的应用

张国印1,程慧杰1,2,刘咏梅1,姚爱红1   

  1. 1.哈尔滨工程大学 计算机科学与技术学院,哈尔滨 150001
    2.哈尔滨医科大学 计算机教研室,哈尔滨 150081
  • 通讯作者: 张国印

Abstract: Clustering techniques can extract biomedical information from gene expression data.The basic particle swarm optimization is liable to be trapped in local convergence and sensitive to adjustable parameters.A new algorithm is proposed to solve gene clustering problem.Four particles are initialized and two of them are chosen randomly as a pair.The result of K-means is assigned to one particle in the pair.At each iteration,every particle updates itself only according to its velocity and the best position of the pair it belongs to.One elitist particle is generated from each pair respectively and they are combined into a new pair.The new particle pair continues to search and its best position is just the global optima of the algorithm.Experiment results show that the algorithm has perfect homogeneity and separation in yeast gene expression data and has an advantage in computation time and convergence rate.

Key words: gene expression profile, particle swarm optimization, particle pair, clustering analysis

摘要: 聚类分析是从基因表达数据中提取生物医学信息的主要方法。针对基本粒子群优化算法易陷入局部极值和对可调参数敏感的问题,提出了一种新型粒子对算法来解决基因聚类问题。算法初始化四个粒子,随机分成两对,将K-均值快速聚类的结果作为每个粒子对中一个粒子的初始位置。在每次迭代中,粒子仅依靠自身速度和粒子对的最佳位置来完成自身更新。每个粒子对产生的精英粒子,组成一个新的粒子对,继续搜索,新粒子对的最优位置即为聚类算法的最优解。实验结果表明算法具有良好的同质性和差异性,且在计算时间和收敛速度方面具有相当的优势。

关键词: 基因表达谱, 粒子群优化, 粒子对, 聚类分析

CLC Number: