Performance evaluation and optimization of cache on fused CPU-GPU architecture

doi:10.3778/j.issn.1002-8331.1503-0333

Abstract

Abstract: Nowadays the development of the CPU and GPU has met a new bottleneck. “Combination” of the CPUs and GPUs on the same chip has become a new popular architectural trend. These new heterogeneous architectures put more pressure on shared resource management. Particularly, the management of Last-Level Cache（LLC） is very important to performance. Due to the different characteristics of the CPU and GPU applications, managing the shared LLC between CPUs and GPUs brings new challenges. In this paper, the GPU applications’ features are analyzed. And the half-to-half and optimal cache partition on the fused architecture are proposed by absorbing previous cache management schemes. Experimental results indicate that static cache partition can effectively avoid the interference between CPU and GPU applications. Compared to LRU, half-to-half and optimal cache partition improves performance by 7.68% and 11.68% respectively.

Key words: heterogeneous architecture, fusion, shared last-level cache, static cache partition

摘要： 现今CPU和GPU的发展已经出现新的瓶颈，将两者“结合”在同一块芯片上成为一种新的趋势。这种新的异构架构给片上共享资源的管理带来压力。而共享末级缓存（LLC）的管理对性能的影响非常关键。由于CPU程序和GPU程序的不同特性，给CPU和GPU间共享的末级缓存管理带来新的挑战。通过分析GPU程序访存特征，借鉴之前的缓存管理方案，提出对CPU-GPU融合系统的末级缓存进行等量的静态划分和最优静态划分的方案。实验结果表明：通过缓存划分可以有效避免CPU和GPU程序间的干扰。与传统LRU策略相比，等量静态划分和最优静态划分可以使系统整体性能分别提高7.68%和11.62%。

关键词: 异构架构, 融合, 共享末级缓存, 静态缓存划分

SUN Chuanwei, AN Hong, SUN Sun, CHEN Junshi. Performance evaluation and optimization of cache on fused CPU-GPU architecture[J]. Computer Engineering and Applications, 2017, 53(2): 47-52.

孙传伟，安虹，孙荪，陈俊仕. CPU-GPU融合架构上的缓存性能分析与优化[J]. 计算机工程与应用, 2017, 53(2): 47-52.

[1]	LU Lixia, ZOU Junzhong, GUO Yucheng, ZHANG Jian, WANG Bei. Prediction of Knee Injury Based on Multimodal Fusion [J]. Computer Engineering and Applications, 2021, 57(9): 225-232.
[2]	WANG Bing, LE Hongxia, LI Wenjing, ZHANG Menghan. Mask Detection Algorithm Based on Improved YOLO Lightweight Network [J]. Computer Engineering and Applications, 2021, 57(8): 62-69.
[3]	ZHANG Yue, HUANG Yourui, LIU Pengkun. Research on Multi-resolution Human Pose Estimation with Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(8): 126-132.
[4]	DONG Xubin, ZHAO Qinghua. Research and Application of Improved Mask R-CNN in Aerial Image Target Detection [J]. Computer Engineering and Applications, 2021, 57(8): 133-144.
[5]	WANG Ling, WANG Jiapei, WANG Peng, SUN Shuangzi. Siamese Network Tracking Algorithms for Hierarchical Fusion of Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(8): 169-174.
[6]	LI Mingshan, HAN Qingpeng, ZHANG Tianyu, WANG Daolei. Safety Helmet Detection Method of Improved SSD [J]. Computer Engineering and Applications, 2021, 57(8): 192-197.
[7]	HUAI Chuangfeng, GUO Long, JIA Xueyan, ZHANG Zihao. Improved A* Algorithm and Dynamic Window Method for Robot Dynamic Path Planning [J]. Computer Engineering and Applications, 2021, 57(8): 244-248.
[8]	GUO Xiaojing, SUI Haoda. Application of Improved YOLOv3 in Foreign Object Debris Target Detection on Airfield Pavement [J]. Computer Engineering and Applications, 2021, 57(8): 249-255.
[9]	LI Zhongdao, LIU Yuansheng, CHANG Feixiang, ZHANG Jun, LU Ming. Research on UWB and LiDAR Fusion Positioning Algorithm in Indoor Environment [J]. Computer Engineering and Applications, 2021, 57(6): 260-266.
[10]	LIU Chang, QIU Weigen, ZHANG Lichen. Person Re-identification Based on Deformable Mask Alignment Convolution Model [J]. Computer Engineering and Applications, 2021, 57(5): 146-152.
[11]	HAN Wenjing, LUO Xiaoshu, YANG Rixing. Research on Compound Gesture Recognition Method [J]. Computer Engineering and Applications, 2021, 57(4): 108-113.
[12]	ZHAO Hui, LI Zhiwei, FANG Lufa. Feature Information Enhancement Based Single Shot Multibox Detector Algorithm [J]. Computer Engineering and Applications, 2021, 57(4): 148-154.
[13]	GU Meihua, WANG Miaomiao, LI Liyao, FENG Jing. Color Image Multi-scale Fusion Graying Algorithm [J]. Computer Engineering and Applications, 2021, 57(4): 209-215.
[14]	WANG Dianwei, ZHAO Mengying, LIU Ying, SONG Haijun, XIE Yongjun. Improved R-SSD Panoramic Video Image Vehicle Detection Algorithm [J]. Computer Engineering and Applications, 2021, 57(3): 189-195.
[15]	MA Qinglu, TANG Xiaoyao. Vehicle Detection Algorithm Combining Time Domain and Watershed Information [J]. Computer Engineering and Applications, 2021, 57(24): 227-233.

Performance evaluation and optimization of cache on fused CPU-GPU architecture

CPU-GPU融合架构上的缓存性能分析与优化

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics