基于改进的并行K-Means算法的电力负荷聚类研究

doi:10.3778/j.issn.1002-8331.1603-0110

计算机工程与应用 ›› 2017, Vol. 53 ›› Issue (17): 260-265.DOI: 10.3778/j.issn.1002-8331.1603-0110

基于改进的并行K-Means算法的电力负荷聚类研究

许元斌1，李国辉2，3，郭昆2，3，郭松荣2，3，林炜2，3

1.国网信通亿力科技有限责任公司，福州 350001
2.福州大学数学与计算机科学学院，福州 350116
3.福建省网络计算与智能信息处理重点实验室，福州 350116

出版日期:2017-09-01 发布日期:2017-09-12

Research on parallel clustering of power load based on improved K-Means algorithm

XU Yuanbin1, LI Guohui2，3, GUO Kun2，3, GUO Songrong2，3, LIN Wei2，3

1.State Grid Electic Power Company, Fuzhou 350001, China
2.College of Mathematics and Computer Science, Fuzhou University, Fuzhou 350116, China
3.Fujian Provincial Key Laboratory of Network Computing and Intelligent Information Processing, Fuzhou 350116, China

Online:2017-09-01 Published:2017-09-12

摘要/Abstract

摘要： 电力企业通常根据电力负荷数据，采用传统的K-Means算法对客户进行划分，而这种方法最大的缺陷就是必须由用户手动指定聚类簇数。提出了一种将Canopy算法和K-Means算法结合应用于负荷聚类的方法，无需手动指定聚类簇数。收集到的用户历史用电数据，使用并行计算框架MapReduce对原始数据进行预处理。应用Canopy和K-Means算法建立自动负荷聚类模型。在真实用电数据上进行实证分析，通过使用Silhouette指标对结果进行评估，证明提出的方法更加稳定和具有广泛的适用性。

关键词: 负荷聚类, 并行计算, Canopy, K-Means

Abstract: The electrical power enterprise usually based on power load data, uses the traditional K-Means algorithm to classify the customers, but the biggest drawback of this method must be specified by the user manual clustering number of clusters. It proposes a method combining Canopy algorithm and K-Means algorithm based on load clustering, without the need to manually specify the number of clusters, the automatic division of the customer. First of all, it collects users’ electricity data, uses the parallel computing framework MapReduce to preprocess the original data. Then, it uses Canopy and K-Means algorithm to establish the clustering model of automatic load. Finally, in the real consumption data on the empirical analysis, by using the Silhouette index to evaluate, it shows that the proposed method is more stable and convenient, and has wider applicability.

Key words: load clustering, parallel computing, Canopy, K-Means

许元斌1，李国辉2，3，郭昆2，3，郭松荣2，3，林炜2，3. 基于改进的并行K-Means算法的电力负荷聚类研究[J]. 计算机工程与应用, 2017, 53(17): 260-265.

XU Yuanbin1, LI Guohui2，3, GUO Kun2，3, GUO Songrong2，3, LIN Wei2，3. Research on parallel clustering of power load based on improved K-Means algorithm[J]. Computer Engineering and Applications, 2017, 53(17): 260-265.

[1]	王昌龙，张远东，缪宏，杨煜恒. 双通道卷积神经网络在南瓜病害识别上的应用[J]. 计算机工程与应用, 2021, 57(5): 183-189.
[2]	张子然，黄卫华，陈阳，章政，李梓远. 基于双向搜索的改进蚁群路径规划算法[J]. 计算机工程与应用, 2021, 57(21): 270-277.
[3]	程婧怡，段先华，朱伟. 改进YOLOv3的金属表面缺陷检测研究[J]. 计算机工程与应用, 2021, 57(19): 252-258.
[4]	冯凯，李婧. k元n方体网络的子网络可靠性[J]. 计算机工程与应用, 2021, 57(16): 83-89.
[5]	李健，张大伟，姜晓明，向立云. 并行化洪水演进模拟研究综述[J]. 计算机工程与应用, 2021, 57(13): 1-7.
[6]	孙明，陈昕. 面向卷积神经网络的硬件加速器设计方法[J]. 计算机工程与应用, 2021, 57(13): 77-84.
[7]	潘成胜，张斌，吕亚娜，杜秀丽，邱少明. 改进灰狼优化算法的K-Means文本聚类[J]. 计算机工程与应用, 2021, 57(1): 188-193.
[8]	高玮军，师阳，杨杰，张春霞. 一种改进的轻量人头检测方法[J]. 计算机工程与应用, 2021, 57(1): 207-212.
[9]	叶颖诗，魏福义，蔡贤资. 基于并行计算的快速Dijkstra算法研究[J]. 计算机工程与应用, 2020, 56(6): 58-65.
[10]	范文兵，孙志远. 基于小波域广义高斯分布的SAR图像分割算法[J]. 计算机工程与应用, 2020, 56(5): 222-226.
[11]	杜伟，傅游. 基于GPU的最小二乘蒙特卡罗算法期权定价[J]. 计算机工程与应用, 2020, 56(4): 225-229.
[12]	王卫红，曾英杰. 基于聚类和用户偏好的协同过滤推荐算法[J]. 计算机工程与应用, 2020, 56(3): 68-73.
[13]	金之雁，杨磊，林隽民，王哲. 广义共轭余差法的通信避免算法[J]. 计算机工程与应用, 2020, 56(3): 74-79.
[14]	宗晓萍，田伟倩. 采用K-means的脑肿瘤磁共振图像分割与特征提取[J]. 计算机工程与应用, 2020, 56(3): 187-193.
[15]	王子龙，李进，宋亚飞. 基于距离和权重改进的K-means算法[J]. 计算机工程与应用, 2020, 56(23): 87-94.

基于改进的并行K-Means算法的电力负荷聚类研究

Research on parallel clustering of power load based on improved K-Means algorithm

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics