Hadoop平台下新型图像并行处理模型设计

doi:10.3778/j.issn.1002-8331.1712-0198

计算机工程与应用 ›› 2019, Vol. 55 ›› Issue (6): 186-190.DOI: 10.3778/j.issn.1002-8331.1712-0198

Hadoop平台下新型图像并行处理模型设计

刘军，李威，吴梦婷，陈起凤

武汉工程大学计算机科学与工程学院，武汉 430205

出版日期:2019-03-15 发布日期:2019-03-14

New Design of Image Parallel Processing Model Based on Hadoop Platform

LIU Jun, LI Wei, WU Mengting, CHEN Qifeng

School of Computer Science and Engineering, Wuhan Institute of Technology, Wuhan 430205, China

Online:2019-03-15 Published:2019-03-14

摘要/Abstract

摘要： Hadoop在处理海量小图像数据时，存在输入分片过多以及海量小图像存储问题。针对这些问题，不同于采用HIPI、SequenceFile等方法，提出了一个新型图像并行处理模型。利用Hadoop适合处理纯文本数据的特性，本模型使用存储了图像路径的文本文件替换图像数据作为输入，不需要设计图像数据类型。在Map阶段直接完成图像的读取、处理、存储过程。为了简化图像处理算法，将OpenCV和Map函数结合并设计了对应的存储方法，实现小图像文件的存储。实验表明，在Hadoop分布式系统平台下，模型不论在小数据量还是在大数据量的测试数据环境中，都具有良好的吞吐性能和稳定性。

关键词: Hadoop, 并行计算框架（MapReduce）, 图像处理, OpenCV

Abstract: While dealing with huge amount of small image data, Hadoop has the problems of managing the excessive fragmentation of the inputs and saving the rapid growth of small image files. In view of solving these problems, the solution of a new mass small image parallel processing model is proposed and implemented, and is different from the methods such as HIPI and SequenceFile. For Hadoop is suitable for the text-only data processing, the image data is replaced by the text file that stores the image path as input, and the model does not need to design image data types. The functions such as image reading, image processing, image storage are completed in the Map stage of Hadoop. And to simplify the image processing algorithms, the OpenCV functions are combined with the Map function and the corresponding storage method is designed to accommodate the storage of small image files. Experimental results show that, the model has good performance on throughput test and good stability wherever the test data is the small amount of data or large amount of data in Apache Hadoop system.

Key words: Hadoop, MapReduce, image processing, OpenCV

刘军，李威，吴梦婷，陈起凤. Hadoop平台下新型图像并行处理模型设计[J]. 计算机工程与应用, 2019, 55(6): 186-190.

LIU Jun, LI Wei, WU Mengting, CHEN Qifeng. New Design of Image Parallel Processing Model Based on Hadoop Platform[J]. Computer Engineering and Applications, 2019, 55(6): 186-190.

[1]	冉蓉，徐兴华，邱少华，崔小鹏，欧阳斌. 基于深度卷积神经网络的裂纹检测方法综述[J]. 计算机工程与应用, 2021, 57(9): 23-35.
[2]	张波，徐黎明，黄志伟，要小鹏. 梯度策略的多目标GANs帕累托最优解算法[J]. 计算机工程与应用, 2021, 57(9): 89-95.
[3]	杨培伟，周余红，邢岗，田智强，许夏瑜. 卷积神经网络在生物医学图像上的应用进展[J]. 计算机工程与应用, 2021, 57(7): 44-58.
[4]	胡文涛，陈秀宏. 基于邻域图的低秩投影学习[J]. 计算机工程与应用, 2021, 57(7): 209-214.
[5]	云旭，宋焕生，梁浩翔，侯景严，戴喆. 基于深度学习的关键岗位人员行为分析系统[J]. 计算机工程与应用, 2021, 57(6): 225-231.
[6]	卢苇，刘丹，邵敏，吴扬东. 改进Mask R-CNN网络在医学图像识别与分割中的应用[J]. 计算机工程与应用, 2021, 57(24): 234-241.
[7]	张德，林青宇，郭茂祖. 单幅图像超分辨重建的深度学习方法综述[J]. 计算机工程与应用, 2021, 57(22): 28-41.
[8]	李祥霞，谢娴，李彬，尹华，许波，郑心炜. 生成对抗网络在医学图像处理中的应用[J]. 计算机工程与应用, 2021, 57(18): 24-37.
[9]	林本丰，王呈，孙悦程. 融合LSD算法与深度学习的开关状态检测方法[J]. 计算机工程与应用, 2021, 57(17): 181-189.
[10]	许洋洋，李伟，王杰. 利用事件和期限驱动对机器人延时的优化[J]. 计算机工程与应用, 2021, 57(17): 260-268.
[11]	吴东阳，窦建平，李俊. 四旋翼飞行器的数字孪生系统设计[J]. 计算机工程与应用, 2021, 57(16): 237-244.
[12]	李雷孝，邓丹，李杰，王永生. 基于粒子群优化的全比较计算数据分发策略[J]. 计算机工程与应用, 2021, 57(15): 109-117.
[13]	王滢暄，宋焕生，梁浩翔，余宵雨，云旭. 基于改进的YOLOv4高速公路车辆目标检测研究[J]. 计算机工程与应用, 2021, 57(13): 218-226.
[14]	陈少洁，赵淦森，林成创，彭璟，黄凯信，李壮伟，黄润桦，杜嘉华，樊小毛. 深度学习在染色体分割中的应用综述[J]. 计算机工程与应用, 2021, 57(11): 46-56.
[15]	胡文涛，陈秀宏. 基于局部保持投影的鲁棒稀疏子空间学习[J]. 计算机工程与应用, 2021, 57(10): 194-199.

Hadoop平台下新型图像并行处理模型设计

New Design of Image Parallel Processing Model Based on Hadoop Platform

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics