Research on parallel clustering of power load based on improved K-Means algorithm

doi:10.3778/j.issn.1002-8331.1603-0110

Computer Engineering and Applications ›› 2017, Vol. 53 ›› Issue (17): 260-265.DOI: 10.3778/j.issn.1002-8331.1603-0110

Previous Articles Next Articles

Research on parallel clustering of power load based on improved K-Means algorithm

XU Yuanbin1, LI Guohui2，3, GUO Kun2，3, GUO Songrong2，3, LIN Wei2，3

1.State Grid Electic Power Company, Fuzhou 350001, China
2.College of Mathematics and Computer Science, Fuzhou University, Fuzhou 350116, China
3.Fujian Provincial Key Laboratory of Network Computing and Intelligent Information Processing, Fuzhou 350116, China

Online:2017-09-01 Published:2017-09-12

基于改进的并行K-Means算法的电力负荷聚类研究

许元斌1，李国辉2，3，郭昆2，3，郭松荣2，3，林炜2，3

1.国网信通亿力科技有限责任公司，福州 350001
2.福州大学数学与计算机科学学院，福州 350116
3.福建省网络计算与智能信息处理重点实验室，福州 350116

Abstract

Abstract: The electrical power enterprise usually based on power load data, uses the traditional K-Means algorithm to classify the customers, but the biggest drawback of this method must be specified by the user manual clustering number of clusters. It proposes a method combining Canopy algorithm and K-Means algorithm based on load clustering, without the need to manually specify the number of clusters, the automatic division of the customer. First of all, it collects users’ electricity data, uses the parallel computing framework MapReduce to preprocess the original data. Then, it uses Canopy and K-Means algorithm to establish the clustering model of automatic load. Finally, in the real consumption data on the empirical analysis, by using the Silhouette index to evaluate, it shows that the proposed method is more stable and convenient, and has wider applicability.

Key words: load clustering, parallel computing, Canopy, K-Means

摘要： 电力企业通常根据电力负荷数据，采用传统的K-Means算法对客户进行划分，而这种方法最大的缺陷就是必须由用户手动指定聚类簇数。提出了一种将Canopy算法和K-Means算法结合应用于负荷聚类的方法，无需手动指定聚类簇数。收集到的用户历史用电数据，使用并行计算框架MapReduce对原始数据进行预处理。应用Canopy和K-Means算法建立自动负荷聚类模型。在真实用电数据上进行实证分析，通过使用Silhouette指标对结果进行评估，证明提出的方法更加稳定和具有广泛的适用性。

关键词: 负荷聚类, 并行计算, Canopy, K-Means

XU Yuanbin1, LI Guohui2，3, GUO Kun2，3, GUO Songrong2，3, LIN Wei2，3. Research on parallel clustering of power load based on improved K-Means algorithm[J]. Computer Engineering and Applications, 2017, 53(17): 260-265.

许元斌1，李国辉2，3，郭昆2，3，郭松荣2，3，林炜2，3. 基于改进的并行K-Means算法的电力负荷聚类研究[J]. 计算机工程与应用, 2017, 53(17): 260-265.

[1]	WANG Changlong, ZHANG Yuandong, MIAO Hong, YANG Yuheng. Application of Double Channel Convolutional Neural Network in Pumpkin Diseases Identification [J]. Computer Engineering and Applications, 2021, 57(5): 183-189.
[2]	ZHANG Ziran, HUANG Weihua, CHEN Yang, ZHANG Zheng, LI Ziyuan. Improved Ant Colony Path Planning Algorithm Based on Bidirectional Search [J]. Computer Engineering and Applications, 2021, 57(21): 270-277.
[3]	CHENG Jingyi, DUAN Xianhua, ZHU Wei. Research on Metal Surface Defect Detection by Improved YOLOv3 [J]. Computer Engineering and Applications, 2021, 57(19): 252-258.
[4]	PAN Chengsheng, ZHANG Bin, LYU Yana, DU Xiuli, QIU Shaoming. K-Means Text Clustering Based on Improved Gray Wolf Optimization Algorithm [J]. Computer Engineering and Applications, 2021, 57(1): 188-193.
[5]	GAO Weijun, SHI Yang, YANG Jie, ZHANG Chunxia. An Improved Lightweight Head Detection Method [J]. Computer Engineering and Applications, 2021, 57(1): 207-212.
[6]	LU Junjie, HUANG Jinquan, LU Feng. Likelihood K-means Clustering for Gas Path Failure Diagnostics of Turbofan Engine [J]. Computer Engineering and Applications, 2020, 56(9): 136-141.
[7]	DU Wei, FU You. GPU-Based Least Squares Monte Carlo Algorithm Option Pricing [J]. Computer Engineering and Applications, 2020, 56(4): 225-229.
[8]	ZONG Xiaoping, TIAN Weiqian. Segmentation and Feature Extraction of Brain Tumor Based on Magnetic Resonance Image Using K-means [J]. Computer Engineering and Applications, 2020, 56(3): 187-193.
[9]	WANG Weihong, ZENG Yingjie. Collaborative Filtering Recommendation Algorithm Based on Clustering and User Preference [J]. Computer Engineering and Applications, 2020, 56(3): 68-73.
[10]	JIN Zhiyan, YANG Lei, LIN Junmin, WANG Zhe. Communication Avoiding Algorithm of Generalized Conjugate Residual Method [J]. Computer Engineering and Applications, 2020, 56(3): 74-79.
[11]	WANG Zilong, LI Jin, SONG Yafei. Improved K-means Algorithm Based on Distance and Weight [J]. Computer Engineering and Applications, 2020, 56(23): 87-94.
[12]	LIU Jiahua, CHEN Jingyu. Design of Multi-core Parallel Spiking Neural Network Simulator [J]. Computer Engineering and Applications, 2020, 56(22): 244-250.
[13]	ZHANG Zhen, LI Haofang, LI Mengzhou. Research on YOLO Algorithm in Abnormal Security Images [J]. Computer Engineering and Applications, 2020, 56(21): 187-193.
[14]	MA Jinghui, PAN Wei, WANG Ru. 3D Point Cloud Classification Based on K-means Clustering [J]. Computer Engineering and Applications, 2020, 56(17): 181-186.
[15]	MA Keqin, YANG Yanjiao, QIN Hongwu, GENG Lin, WANG Pidong. K-means Clustering Algorithm Combining Max-Min Distance and Weighted Density [J]. Computer Engineering and Applications, 2020, 56(16): 50-54.

Research on parallel clustering of power load based on improved K-Means algorithm

基于改进的并行K-Means算法的电力负荷聚类研究

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics