Exploring Message Passing Method Between Spark Tasks

doi:10.3778/j.issn.1002-8331.2112-0182

Abstract

Abstract: Engineering problems and scientific research are facing dual challenges of big data processing and high-performance computing tasks. Spark, a distributed processing framework based on in-memory computing technology, has been widely used in academia and industry. However, its MapReduce-like programming model fails to communicate between tasks, causing numerical algorithms in scientific computing cannot be efficiently implemented. In response to the above problems, a computing system is proposed in this paper that combines Spark in-memory computing model with MPI message passing, which takes full advantage of the fast speed of memory access and multiple high performance communication mechanisms of MPI. It can not only supplement the insufficient expressiveness of the Spark programming model, but also provide a data-oriented DAG computation method for MPI. Internal runtime environment and scheduling strategy of Spark are modified to seamlessly integrate MPI into Spark to provide a unified in-memory computing system for high-performance computing and big data processing tasks. The tests indicate that the performance of numerical computation and iterative algorithm is improved by at least 50% compared with Spark.

Key words: Spark, MPI, scientific computing, in-memory computing, iterative algorithm

摘要： 当今诸多工程问题及科学研究中，都面临着大数据处理和高性能计算任务的双重挑战。基于内存计算技术提出的分布式处理框架Spark已在学术和工业界得到了广泛的应用，但其MapReduce-like的编程模型在任务间无法进行通信，导致科学计算中的数值算法无法进行高效实现。针对上述问题，研究了一种Spark内存计算与MPI消息传递模型相结合的解决方案，充分利用内存访问存取快速的特点和MPI的多种高性能通信机制，解决了Spark编程模型表达能力不足的缺陷，同时为MPI提供了面向数据的DAG计算方式。通过对Spark内部的运行环境和调度系统进行修改，使得MPI在Spark中得以无缝融合，为高性能计算和大数据任务提供了一个统一的内存计算系统。测试结果表明，在数值计算和迭代算法上相比Spark至少有50%的性能提升。

关键词: Spark, MPI, 科学计算, 内存计算, 迭代算法

XIA Libin, LIU Xiaoyu, SUN Wei, JIANG Xiaowei, SUN Gongxing. Exploring Message Passing Method Between Spark Tasks[J]. Computer Engineering and Applications, 2022, 58(21): 91-97.

夏立斌, 刘晓宇, 孙玮, 姜晓巍, 孙功星. Spark任务间消息传递方法研究[J]. 计算机工程与应用, 2022, 58(21): 91-97.

References

[1] ZAHARIA M，FRANKLIN M J，GHODSI A，et al.Apache Spark：a unified engine for big data processing[J].Communications of the ACM，2016，59（11）：56-65.
[2] LIANG F，LU X.Accelerating iterative big data computing through MPI[J].Journal of Computer Science and Technology，2015，30（2）：283-294.
[3] WEINSTEIN L，SWERTZ M N.Pathogenic properties of invading microorganism[M]//SODEMAN W A.Pathologic physiology：mechanisms of disease.Philadephia：Saunders，1974：745-772.
[4] ANDERSON M，SMITH S，SUNDARAM N，et al.Bridging the gap between HPC and big data frameworks[J].Proceedings of the VLDB Endowment，2017，10（8）：901-912.
[5] GITTENS A，ROTHAUGE K，WANG S，et al.Alchemist：an apache spark?MPI interface[J].Concurrency and Computation：Practice and Experience，2019，31（16）：e5026.
[6] MORITZ P，NISHIHARA R，STOICA I，et al.SparkNet：training deep networks in Spark[J/OL].（2016-02-28）[2021-09-01].http：//arxiv.org/abs/1511.06051.
[7] Online Computer Library Center，Inc.History of OCLC [EB/OL].（2000-01-08）[2019-12-23].http：//www.oclc.org/about/history/default.htm.
[8] TensorFlowOnSpark[EB/OL].Yahoo，2021[2021-09-01].https：//github.com/yahoo/TensorFlowOnSpark.
[9] RayDP[M/OL].Optimized analytics package for spark platform（OAP），2021[2021-09-01].https：//github.com/oap-project/raydp.
[10] XU L，LI M，BUTT A R.GERBIL：MPI+YARN[C]//2015 15th IEEE/ACM International Symposium on Cluster，Cloud and Grid Computing，2015：627-636.
[11] mpich-yarn[EB/OL].Alibaba，2021[2021-09-01].https：//github.com/alibaba/mpich2-yarn.
[12] MPI Forum 4.0[EB/OL][2021-09-01].https：//www.mpi-forum.org/mpi-40/.
[13] SCHAFER D，LAGUNA I，SKJELLUM A，et al.Extending the MPI stages model of fault tolerance[C]//2020 Workshop on Exascale MPI（ExaMPI），2020：52-61.
[14] LI H.Alluxio：a virtual distributed file system[D/OL].UC Berkeley，2018[2020-08-21].https：//escholarship.org/uc/item/4n80320w#main.
[15] GRAHAM R L，BARRETT B W，SHIPMAN G M，et al.Open mpi：a high performance，flexible implementation of mpi point-to-point communications[J].Parallel Processing Letters，2007，17（1）：79-88.
[16] 魏占辰，刘晓宇，黄秋兰，等.Spark迭代密集型应用的优化方法研究[J].计算机工程与应用，2020，56（23）：68-73.
WEI Z C，LIU X Y，HUANG Q L，et al.Research on optimization for iteration-intensive applications on Spark[J].Computer Engineering and Applications，2020，56（23）：68-73.
[17] 陈莹，丁亨通，冯旭，等.格点量子色动力学在中国[J].现代物理知识，2020，32（1）：36-44.
CHEN Y，DING H T，FENG X，et al.Lattice quantum chromodynamics in China[J].Modern Physics，2020，32（1）：36-44.