一种基于交集的聚类组合算法

计算机工程与应用 ›› 2007, Vol. 43 ›› Issue (2): 177-177.

一种基于交集的聚类组合算法

江永全,杨燕,许翔燕

西南交通大学信息学院

收稿日期:2006-01-10 修回日期:1900-01-01 出版日期:2007-01-11 发布日期:2007-01-11
通讯作者: 江永全 river0001

Clustering Combination Algorithm Based on Intersection

YongQuan Jiang,Yan Yang,Xiangyan Xu

西南交通大学信息学院

Received:2006-01-10 Revised:1900-01-01 Online:2007-01-11 Published:2007-01-11
Contact: YongQuan Jiang

摘要/Abstract

摘要： 聚类作为一种无监督的学习，能根据数据间的相似程度自动地进行分类。本文提出的基于交集的聚类组合新方法，借鉴了选举投票的思想。给定同一数据集的不同聚类结果，此算法先求出不同聚类结果中每个簇的对应关系，然后计算这几个聚类结果对应簇的交集，对剩余的有争议对象进行投票，最后把投票之后仍未确定归属的对象分配给最近对象所在的簇，或者不经过投票直接将有争议的对象分配给最近对象所在的簇。实验表明，两种方法都能明显改善聚类质量，投票后得到的结果要略优于不投票的结果。

关键词: 交集, 投票, 聚类, 聚类组合

Abstract: Being an unsupervised learning, clustering is a division of data into groups of similar objects. This paper presents a new intersection-based clustering combination algorithm, which imitates the ways of voting. Assigns some different clustering results of a same data set, this algorithm extracts the corresponding relations of each cluster in these different clustering results first, and then compute the intersection of corresponding clusters of these results, put the remaining disputable objects to vote, finally distribute the objects in abeyance after voting to the nearest object’s cluster, or distribute the remaining disputable objects to the nearest object’s cluster without voting. The experiment indicates both methods can obviously improve the clustering performance; the result with voting is better than the result without voting.

Key words: intersection, vote, clustering, clustering combination

江永全,杨燕,许翔燕. 一种基于交集的聚类组合算法[J]. 计算机工程与应用, 2007, 43(2): 177-177.

YongQuan Jiang,Yan Yang,Xiangyan Xu. Clustering Combination Algorithm Based on Intersection[J]. Computer Engineering and Applications, 2007, 43(2): 177-177.

[1]	方美东, 王辉, 张爱华. 双层非负矩阵分解的分形图像压缩算法[J]. 计算机工程与应用, 2022, 58(8): 204-213.
[2]	韩海韵, 杨有龙, 孙丽芹. 结合模糊聚类的多示例集成算法[J]. 计算机工程与应用, 2022, 58(7): 87-96.
[3]	王永贵, 李昕. 融合狼群算法和模糊聚类的混合推荐算法[J]. 计算机工程与应用, 2022, 58(5): 104-111.
[4]	闫军, 常乐, 封丽华. 在线订单分批及拣选路径规划模型及算法[J]. 计算机工程与应用, 2022, 58(4): 283-289.
[5]	王英博, 韩国淼, 王铭泽. 基于子空间聚类的协同过滤推荐算法[J]. 计算机工程与应用, 2022, 58(3): 127-134.
[6]	朱良奇, 黄勃, 黄季涛, 马莉媛, 史志才. 融合BERT和自编码网络的短文本聚类研究[J]. 计算机工程与应用, 2022, 58(2): 145-152.
[7]	梁鸿翔, 张步烨, 李炜卓, 程茜雅. 结合网络表示学习和文本卷积网络的类案发现[J]. 计算机工程与应用, 2022, 58(2): 153-160.
[8]	孙璐, 梁永全. 融合网格划分和DBSCAN的改进聚类算法[J]. 计算机工程与应用, 2022, 58(14): 73-79.
[9]	谢习华, 王刚, 辛涛, 赵喻明. 基于SLIC和改进区域生长的非结构化道路识别[J]. 计算机工程与应用, 2022, 58(14): 210-218.
[10]	余瑶, 杜世强, 宋金梅. 面向多视图聚类的低秩张量表示学习[J]. 计算机工程与应用, 2022, 58(13): 154-163.
[11]	罗琪, 焦明海. 双端可共享网络的多模态行人重识别方法[J]. 计算机工程与应用, 2022, 58(13): 235-240.
[12]	张亚玲, 屈玲玉. 应用BWP指标的差分隐私保护k-means算法[J]. 计算机工程与应用, 2022, 58(10): 108-115.
[13]	李云红, 张轩, 李传真, 苏雪平, 聂梦瑄, 毕远东, 谢蓉蓉. 融合DBSCAN的改进YOLOv3目标检测算法[J]. 计算机工程与应用, 2022, 58(10): 208-215.
[14]	张欣环, 刘宏杰, 吴金洪, 施俊庆, 毛程远, 孟国连. 基于占空比的聚类算法评价指标研究[J]. 计算机工程与应用, 2022, 58(1): 175-181.
[15]	付燕, 韩泽, 叶鸥. 针对近重复视频的FD-means聚类清洗算法[J]. 计算机工程与应用, 2022, 58(1): 197-203.