Computer Engineering and Applications ›› 2008, Vol. 44 ›› Issue (14): 228-230.
• 工程与应用 • Previous Articles Next Articles
LIU Quan-jin1,LI Ying-xin2
Received:
Revised:
Online:
Published:
Contact:
刘全金1,李颖新2
通讯作者:
Abstract: In this paper an approach is proposed for sample categorization of gene expression profiles based on structure of gene expression profiles.Firstly,genes are removed as“noise genes”with small Bhattacharyya distance.Secondly,multi-edit-nearest-neighbor algorithm is modified to eliminate“noise samples”.Then boosting-based support vector machines combination classifiers are constructed and employed to classify the samples.Finally,this methods is used to classify colon genes expression profiles samples.The results show that the means is feasible and effective.
Key words: Bhattacharyya distance, multi-edit-nearest-neighbor algorithm, Boosting algorithm
摘要: 基于基因表达谱结构提出一种基因表达谱的样本分类方法。首先用基因的Bhattacharyya距离衡量其所含样本类别的信息,过滤Bhattacharyya距离较小的噪声基因;然后修改重复剪辑近邻算法,剔除噪声样本;再基于Boosting算法构建支持向量机组合分类器;最后以结肠癌基因表达谱样本为例,进行了分类实验。实验结果表明该方法简单、有效,对基因表达谱样本的分类问题有强的实用性。
关键词: Bhattacharyya距离, 重复剪辑近邻法, Boosting算法
LIU Quan-jin1,LI Ying-xin2. Application of Boosting algorithm to sample categorization of gene expression profiles[J]. Computer Engineering and Applications, 2008, 44(14): 228-230.
刘全金1,李颖新2. Boosting算法在基因表达谱样本分类中的应用[J]. 计算机工程与应用, 2008, 44(14): 228-230.
0 / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://cea.ceaj.org/EN/
http://cea.ceaj.org/EN/Y2008/V44/I14/228