Community discovery based on improved clustering algorithm with central constraints

doi:10.3778/j.issn.1002-8331.1611-0035

Abstract

Abstract: In the process of community discovery, it firstly starts a random walk from a node, calculates the symmetrical social distance between two nodes, and uses this distance to analyze the correlation between two user nodes. In the social network, there is a phenomenon of non-uniformity. Some individuals are very dense, while others are very sparse. Therefore, the virtual community needs to be excavated with specific community discovery technology. However, through the accuracy index of virtual community algorithm evaluation, it is found that for the data with large data volume and strong data stickiness, the clustering algorithm of poly-clustering algorithm（PCM） class effect is not ideal. The PCM algorithm is improved with central constraints, the new clustering algorithm is more suitable for the existence of some data missing or there is a large number of noise, the exception point of the real network data set. Experiments are carried out to verify the accuracy of the real data set.

Key words: symmertrical social distance, random walk, Possibilistic C-Means（PCM） algorithm, accuracy of indicators

摘要： 进行社区发现时，首先从某一节点开始进行随机行走，计算两个节点之间的对称社会距离，并用此距离来分析两个用户节点之间的相关性。社交网络中存在着关系不均匀的现象，有些个体之间关系非常稠密，而有些却异常稀疏，由此构成的虚拟社区需要用特定的社区发现技术进行挖掘。前人提出过利用可能性C均值聚类算法（PCM）和处理好的社会距离进行社区发现，但通过虚拟社区算法评价的准确度指标发现，对于数据量大，数据粘性强的数据，其聚类效果并不理想。而聚类中心的好坏直接决定着聚类性能的好与坏，因此利用类中心约束方法对PCM算法进行改进，得到的新型聚类算法更加适用于真实网络数据集。实验针对真实数据集，利用准确度指标进行了验证。

关键词: 对称社会距离, 随机行走, 可能性C均值算法, 准确度指标

XIA Yangyang, LIU Yuan, HUANG Yadong. Community discovery based on improved clustering algorithm with central constraints[J]. Computer Engineering and Applications, 2018, 54(8): 265-270.

夏洋洋，刘渊，黄亚东. 结合中心约束改进聚类算法的社区发现技术[J]. 计算机工程与应用, 2018, 54(8): 265-270.

[1]	WEI Dingfeng, LI Liang, CHAI Jing. Social Recommendation Algorithm by Fusing Item Information [J]. Computer Engineering and Applications, 2021, 57(19): 198-204.
[2]	GU Junhua, CHEN Bo, WANG Rui, ZHANG Suqi. Social Recommendation Combined with Important Nodes Trust Propagation [J]. Computer Engineering and Applications, 2021, 57(17): 190-195.
[3]	LI Weiyong, KONG Feng, ZHANG Wei, CHEN Yunfang. Node-Diffusing Capability of Biased Random Walks of Information Diffusion Method [J]. Computer Engineering and Applications, 2020, 56(24): 123-129.
[4]	XU Yong, ZHA Qianming, KE Mengya, LIU Fen. Phantom routing design for source location protection [J]. Computer Engineering and Applications, 2018, 54(19): 88-93.
[5]	LIU Ling, MA Yi, WEN Junhao, WANG Xibin. Social recommendation algorithm based on user influence walk model [J]. Computer Engineering and Applications, 2017, 53(10): 61-67.
[6]	CHEN Xi1, FAN Min1, XIONG Qingyu2. Saliency detection algorithm based on Markov chain of superpixels [J]. Computer Engineering and Applications, 2016, 52(7): 171-175.
[7]	LIU Fangxin1, HE Ming1，2, LIU Guangyun1, KANG Kai1. Fault-tolerance mechanism of random walk for underwater acoustic sensor networks based on dynamic behaviors [J]. Computer Engineering and Applications, 2015, 51(24): 86-89.
[8]	LIN Jiajia1, LIU Yanheng1，2, WANG Yazhou1, TIAN Xueying1. Social network topology model based on random walk and common points [J]. Computer Engineering and Applications, 2015, 51(12): 74-77.
[9]	ZHU Qiang. Random walking image segmentation based on improved adaptive snowfall model [J]. Computer Engineering and Applications, 2013, 49(23): 127-131.
[10]	LIN Xiao, XIAO Guoqiang, WU Song, QIU Kaijin. Random walk model based object recognition [J]. Computer Engineering and Applications, 2013, 49(21): 145-151.
[11]	HAN Qilong, PAN Haiwei, CAI Shaobin, YAO Nianmin, YIN Guisheng. Nodes similarity measure method based on sturcture-attribute balance graph [J]. Computer Engineering and Applications, 2013, 49(1): 15-18.
[12]	ZHANG Xuezhi1, QI Ji2, LIN Ping2. GPU programming and acceleration of Laplace growth model [J]. Computer Engineering and Applications, 2012, 48(22): 84-87.
[13]	WANG Hong1，2，WANG Xicheng1，3. Random walk soft clustering method for identifying overlapping functional modules in protein interaction networks [J]. Computer Engineering and Applications, 2011, 47(9): 4-7.
[14]	HE Jianjun，LI Renfa. Modified nodes ranking method using random walk model [J]. Computer Engineering and Applications, 2011, 47(12): 87-89.
[15]	ZENG Xiao-ping，LI Jin-zhi，LIU Guo-jin. Natural image matting based on RWR [J]. Computer Engineering and Applications, 2010, 46(25): 160-163.

Community discovery based on improved clustering algorithm with central constraints

结合中心约束改进聚类算法的社区发现技术

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics