基于输入分片扰乱的BP神经网络MapReduce训练方法

doi:10.3778/j.issn.1002-8331.1608-0069

计算机工程与应用 ›› 2018, Vol. 54 ›› Issue (2): 137-143.DOI: 10.3778/j.issn.1002-8331.1608-0069

基于输入分片扰乱的BP神经网络MapReduce训练方法

陈旺虎，俞茂义，马生俊

西北师范大学计算机科学与工程学院，兰州 730070

出版日期:2018-01-15 发布日期:2018-01-31

Training BP neural networks with MapReduce based on sample data slice disruptions

CHEN Wanghu, YU Maoyi, MA Shengjun

College of Computer Science and Engineering, Northwest Normal University, Lanzhou 730070, China

Online:2018-01-15 Published:2018-01-31

摘要/Abstract

摘要： BP神经网络的MapReduce训练中，每个map训练任务产生的中间权阵只对该训练节点上的输入分片收敛，为提高BP神经网络的训练效率，保证MapReduce训练的全局收敛性，提出一种基于输入分片扰乱的MapReduce训练方法。通过对训练样本集进行系统抽样来扰乱输入分片，并产生新的输入分片，依靠新的输入分片以map任务的原权阵为基础进行迭代训练，可加速MapReduce训练达到收敛的进程；为提高map训练任务的局部收敛速度，在每轮次的训练完成后，选取map任务产生的权阵中全局误差最小者，作为下轮次各map训练任务的初始权阵。在Hadoop集群上的实验表明，该方法可使MapReduce训练BP神经网络的效率得到很大提升。

关键词: 神经网络, MapReduce, 输入分片, 收敛

Abstract: During the training of a BP neural network with MapReduce, its convergence with the current intermediate weight matrix is just got by sample data slices on the specific map task node. Therefore, the converge of the BP network to the whole training sample set is hard to be fulfilled. The approach to training BP networks with MapReduce based on sample slice disruptions is proposed. Based on systematic sampling to the whole training sample data, new input slice can be produced for each training map task. Such sample slices are used for the specific map tasks as new input during future training. This can accelerate the process of convergence of the BP network. Moreover, in order to speed up the local convergence of the map training tasks, the intermediate matrix with minimum global error is taken as the initial weight matrix during the future training. The experimental results on Hadoop clusters show that the approach can improve the efficiency of BP neural network training with MapReduce.

Key words: neural network, MapReduce, sample slices, convergence

陈旺虎，俞茂义，马生俊. 基于输入分片扰乱的BP神经网络MapReduce训练方法[J]. 计算机工程与应用, 2018, 54(2): 137-143.

CHEN Wanghu, YU Maoyi, MA Shengjun. Training BP neural networks with MapReduce based on sample data slice disruptions[J]. Computer Engineering and Applications, 2018, 54(2): 137-143.

[1]	许昊，张凯，田英杰，种法广，王子超. 深度神经网络图像描述综述[J]. 计算机工程与应用, 2021, 57(9): 9-22.
[2]	冉蓉，徐兴华，邱少华，崔小鹏，欧阳斌. 基于深度卷积神经网络的裂纹检测方法综述[J]. 计算机工程与应用, 2021, 57(9): 23-35.
[3]	牟清萍，张莹，张东波，王新杰，杨知桥. 目标丢失判别机制的视觉跟踪算法及应用研究[J]. 计算机工程与应用, 2021, 57(9): 140-147.
[4]	包志强，邢瑜，吕少卿，黄琼丹. 改进YOLO V2的6D目标姿态估计算法[J]. 计算机工程与应用, 2021, 57(9): 148-153.
[5]	王林，柴江云. 深度神经网络在多场景车辆属性识别中的研究[J]. 计算机工程与应用, 2021, 57(9): 162-167.
[6]	赵志焱，杨华，胡志伟，宇海萍. 基于TACNN的玉露香梨叶虫害识别[J]. 计算机工程与应用, 2021, 57(9): 176-181.
[7]	周伦钢，孙怡峰，王坤，吴疆，黄维贵，李炳龙. 目标多种多值属性的端端快速识别网络[J]. 计算机工程与应用, 2021, 57(9): 182-190.
[8]	张成，戴俊峰，熊闻心. 融合LeNet-5改进的扫描文档手写日期识别[J]. 计算机工程与应用, 2021, 57(9): 207-211.
[9]	麻哲旭，杨峰，乔旭. 铁路路基病害智能检测方法[J]. 计算机工程与应用, 2021, 57(9): 272-278.
[10]	蒋斌，钟瑞，张秋闻，张焕龙. 采用深度学习方法的非正面表情识别综述[J]. 计算机工程与应用, 2021, 57(8): 48-61.
[11]	张松灿，普杰信，司彦娜，孙力帆. 基于种群相似度的自适应改进蚁群算法及应用[J]. 计算机工程与应用, 2021, 57(8): 70-77.
[12]	李震霄，孙伟，刘明明，郑丽丽，陈劭颖. 交通监控场景中的车辆检测与跟踪算法研究[J]. 计算机工程与应用, 2021, 57(8): 103-111.
[13]	张越，黄友锐，刘鹏坤. 引入注意力机制的多分辨率人体姿态估计研究[J]. 计算机工程与应用, 2021, 57(8): 126-132.
[14]	李现国，冯欣欣，李建雄. 多尺度残差网络的单幅图像超分辨率重建[J]. 计算机工程与应用, 2021, 57(7): 215-221.
[15]	翟正利，李鹏辉，冯舒. 图对抗攻击研究综述[J]. 计算机工程与应用, 2021, 57(7): 14-21.

基于输入分片扰乱的BP神经网络MapReduce训练方法

Training BP neural networks with MapReduce based on sample data slice disruptions

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics