一种融合变异系数的k-mean聚类分析方法

计算机工程与应用 ›› 2012, Vol. 48 ›› Issue (35): 114-117.

• 数据库、信号与信息处理 • 上一篇下一篇

一种融合变异系数的k-mean聚类分析方法

范阿琳，任树华

大连工业大学信息科学与工程学院，辽宁大连 116034

出版日期:2012-12-11 发布日期:2012-12-21

K-means clustering algorithm based on coefficient of variation

FAN Alin, REN Shuhua

School of Information Science and Engineering, Dalian Polytechnic University, Dalian, Liaoning 116034, China

Online:2012-12-11 Published:2012-12-21

摘要/Abstract

摘要： K-means聚类算法的性能依赖于距离度量的选择，k-means算法将欧几里德距离作为最常用的距离度量方法。欧氏距离认为所有属性在聚类中作用是相同的，但是这种距离度量方法并不能准确反映样本间的相异性。针对这种不足，提出了融合变异系数的k-means聚类分析方法（CV-k-means），利用变异系数权重向量来减少不相关属性的影响。实验结果表明，该方法的聚类结果优于k-means算法。

关键词: k-means 算法, 相异性度量, 权, 变异系数

Abstract: The performance of k-means clustering algorithm depends on the selection of distance metrics. The Euclid distance is commonly chosen as the similarity measure in k-means clustering algorithm, which treats all features equally and does not accurately reflect the dissimilarity among samples. K-means clustering algorithm based on Coefficient of Variation（CV-k-means） is proposed in this paper to solve this problem. The CV-k-means clustering algorithm uses variation coefficient weight vector to decrease the affects of irrelevant features. The experimental results show that the proposed algorithm can generate better clustering results than k-means algorithm.

Key words: k-means clustering, dissimilarity measure, weighting, coefficient of variation

范阿琳，任树华. 一种融合变异系数的k-mean聚类分析方法[J]. 计算机工程与应用, 2012, 48(35): 114-117.

FAN Alin, REN Shuhua. K-means clustering algorithm based on coefficient of variation[J]. Computer Engineering and Applications, 2012, 48(35): 114-117.

[1]	伍京华，耿翠阳，韩佳丽. 基于Agent的多属性决策模型及其在高校实验教学中的应用[J]. 计算机工程与应用, 2021, 57(8): 238-243.
[2]	李莉，纪欣沅，宋嵩. 回环软件缺陷数量预测模型[J]. 计算机工程与应用, 2021, 57(7): 158-163.
[3]	韩晓微，韩震，岳高峰，崔建江. 救灾无人机的优化A*航迹规划算法[J]. 计算机工程与应用, 2021, 57(6): 232-238.
[4]	李玲，王峥，李娜. 雾计算中支持计算外包的访问控制方案[J]. 计算机工程与应用, 2021, 57(6): 81-87.
[5]	谭乐平，宋平，杨琦峰. 绿色供应链的投贷联动融资决策均衡[J]. 计算机工程与应用, 2021, 57(5): 229-238.
[6]	王鹏，叶学义，王涛，钱丁炜. 双偏差双空间局部方向模式的人脸识别[J]. 计算机工程与应用, 2021, 57(4): 91-99.
[7]	张俊杰，张聪，赵涵捷. 重复利用状态值的竞争深度Q网络算法[J]. 计算机工程与应用, 2021, 57(4): 134-140.
[8]	陈世明，林子朋，高彦丽，裴惠琴. 自适应耦合权重下的异质群体一致性研究[J]. 计算机工程与应用, 2021, 57(4): 231-235.
[9]	张公凯，陈才学，郑拓. 改进鲸鱼算法在电动汽车有序充电中的应用[J]. 计算机工程与应用, 2021, 57(4): 272-278.
[10]	熊健，覃仁超，何梦乙，刘建兰，唐风扬. 改进随机森林在Android恶意软件检测中的应用[J]. 计算机工程与应用, 2021, 57(3): 130-136.
[11]	周舟，韩芳，王直杰. 改进SSD算法在中国手语识别上的应用[J]. 计算机工程与应用, 2021, 57(3): 156-161.
[12]	朱惠娟，宗平，丛玉华. 基于权重池的多尺度图像质量评估方法[J]. 计算机工程与应用, 2021, 57(3): 215-221.
[13]	蒋斌，梁小安，张亮，高杨军. 基于改进修正权重的证据组合方法[J]. 计算机工程与应用, 2021, 57(24): 100-106.
[14]	杨坤桥，王煜翔，郭兵，李强. 委托股权证明共识机制的改进研究[J]. 计算机工程与应用, 2021, 57(24): 107-114.
[15]	刘万军，张正寰，曲海成. 融合DenseNet的多尺度图像去模糊模型[J]. 计算机工程与应用, 2021, 57(24): 219-226.

一种融合变异系数的k-mean聚类分析方法

K-means clustering algorithm based on coefficient of variation

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics