Three dimensional CPML-FDTD parallel algorithm based on CUDA

Computer Engineering and Applications ›› 2011, Vol. 47 ›› Issue (25): 220-223.

• 工程与应用 • Previous Articles Next Articles

Three dimensional CPML-FDTD parallel algorithm based on CUDA

HU Yuan，LI Kang，KONG Fanmin，DU Liuge

Department of Radio Physics，School of Information Science and Engineering，Shandong University，Jinan 250100，China

Received:1900-01-01 Revised:1900-01-01 Online:2011-09-01 Published:2011-09-01

基于CUDA架构的三维CPML-FDTD并行方法

胡媛，李康，孔凡敏，杜刘革

山东大学信息科学与工程学院无线电物理系，济南 250100

Abstract

Abstract: Finite Difference Time Domain（FDTD） algorithm costs much time in simulating the electrical-large object.To overcome the drawback，a three-dimensional parallel FDTD algorithm based on Computer Unified Device Architecture（CUDA） is implemented，where the parallel property of FDTD and General Purpose Graphics Processing Units（GPGPU） technique are utilized effectively，and the Convolutionary Perfectly Matched Layer（CPML） absorbing boundary is adopted.Combining the property of FDTD and CUDA，the algorithm is further optimized.Compared with the performance of CPU in the corresponding period，the proposed algorithm is precise and high-speed.The accelerating ratio can reach 10 before optimization and more than 25 after optimization when there are more than 100 thousand Yee cells in simulation field，which shows that the algorithm is fit to simulate electrical-large object.

Key words: Finite Difference Time Domain（FDTD）, parallel computing, Convolutionary Perfectly Matched Layer（CPML）, Computer Unified Device Architecture（CUDA）, General Purpose Graphics Processing Units（GPGPU）, acceleration

摘要： 为解决时域有限差分（FDTD）算法应用于电大尺寸目标仿真的巨大耗时问题，应用FDTD算法的并行特性和通用图形处理器（GPGPU）技术，实现了一种基于计算统一设备架构（CUDA）的三维FDTD并行计算方法，采用了时域卷积完全匹配层（CPML）吸收边界条件模拟开域空间，对不同网格数目标仿真计算。进一步结合FDTD算法和CUDA的特点进行了优化，当计算空间元胞数在十万数量级及以上时，优化前后GPU运算相对于同时期的CPU分别可获得10和25倍以上的加速，结果表明该方法较适合用于实际电磁问题的仿真。

关键词: 时域有限差分（FDTD）, 并行计算, 时域卷积完全匹配层（CPML）, 基于计算统一设备架构（CUDA）, 通用图形处理器（GPGPU）, 加速

HU Yuan，LI Kang，KONG Fanmin，DU Liuge. Three dimensional CPML-FDTD parallel algorithm based on CUDA[J]. Computer Engineering and Applications, 2011, 47(25): 220-223.

胡媛，李康，孔凡敏，杜刘革. 基于CUDA架构的三维CPML-FDTD并行方法[J]. 计算机工程与应用, 2011, 47(25): 220-223.

[1]	DU Wei, FU You. GPU-Based Least Squares Monte Carlo Algorithm Option Pricing [J]. Computer Engineering and Applications, 2020, 56(4): 225-229.
[2]	JIN Zhiyan, YANG Lei, LIN Junmin, WANG Zhe. Communication Avoiding Algorithm of Generalized Conjugate Residual Method [J]. Computer Engineering and Applications, 2020, 56(3): 74-79.
[3]	LIU Jiahua, CHEN Jingyu. Design of Multi-core Parallel Spiking Neural Network Simulator [J]. Computer Engineering and Applications, 2020, 56(22): 244-250.
[4]	WU Yangyang, TANG Jianguo. Research Progress of Attribute Reduction Based on Rough Set in Context of Big Data [J]. Computer Engineering and Applications, 2019, 55(6): 31-38.
[5]	LIU Yang, CHEN Jingwei, FENG Yong, WU Wenyuan. Survey on Parallel LLL Algorithms [J]. Computer Engineering and Applications, 2019, 55(16): 36-41.
[6]	ZENG Youling, CHEN Gengduo, XIONG Wei, LI Zhe. Parallel Design of FBP Reconstruction Algorithm for CT Image Based on Spark [J]. Computer Engineering and Applications, 2019, 55(10): 218-224.
[7]	WEI Zijin1，2，3, XIAO Li2，3. Parallel dynamic level-of-detail construct algorithm based on domain decomposition [J]. Computer Engineering and Applications, 2018, 54(6): 168-177.
[8]	QIN Jinbo1, ZENG Zhiqiang1，2, LIANG Ji1, YANG Mingxiang2, ZHANG Jian1. Review of application GPU technology in hydraulic parallel optimization calculation [J]. Computer Engineering and Applications, 2018, 54(3): 23-29.
[9]	CHEN Yufeng, ZHANG Bo, LI Lin. Research of high-efficient volume rendering algorithm for electromagnetic field based on multi-core CPU+GPU parallel computing [J]. Computer Engineering and Applications, 2018, 54(18): 218-222.
[10]	WEI Zijin1，2，3, XIAO Li2，3. Parallel out-of-core model simplification algorithm based on improved vertex clustering [J]. Computer Engineering and Applications, 2018, 54(13): 181-190.
[11]	PENG Ziran1, WANG Guojun1，2. Study on parallel algorithm of wavelet transform based on MPSoC platform [J]. Computer Engineering and Applications, 2017, 53(4): 33-38.
[12]	XU Yuanbin1, LI Guohui2，3, GUO Kun2，3, GUO Songrong2，3, LIN Wei2，3. Research on parallel clustering of power load based on improved K-Means algorithm [J]. Computer Engineering and Applications, 2017, 53(17): 260-265.
[13]	LIU Bin1, HE Jinrong1, GENG Yaojun1, WANG Zui2. Recent advances in infrastructure architecture of parallel machine learning algorithms [J]. Computer Engineering and Applications, 2017, 53(11): 31-38.
[14]	MA Jun, CHAI Zhilei, WANG Zhibin, ZHONG Chuanjie. FPGA-based dense optical flow computing system [J]. Computer Engineering and Applications, 2016, 52(3): 139-144.
[15]	CAO Bo1, NI Jiancheng2, LI Linlin1, YU Pingping1, YAO Binxiu1. Parallel frequence pattern mining algorithm based on Spark [J]. Computer Engineering and Applications, 2016, 52(20): 86-91.

Three dimensional CPML-FDTD parallel algorithm based on CUDA

基于CUDA架构的三维CPML-FDTD并行方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics