高性能Benes网络路由求解算法及硬件加速器

doi:10.3778/j.issn.1002-8331.2409-0250

摘要/Abstract

摘要： 光互连网络使用光交叉开关阵列实现光交换。大规模快速光交叉开关阵列多使用紧凑的Benes网络构建，以避免级联过多开关点导致较高物理链路损耗。但对Benes网络的路由求解将引入数百纳秒至数毫秒的开销，产生交换性能瓶颈。为降低此开销并消除性能瓶颈，提出一种利于高性能硬件实现的Benes网络完全重排求解算法，改进了传统Benes网络求解算法的求解次序，提高了并行度。提出基于该算法的硬件加速器，具有优良的频率特性，FPGA版本加速器固定耗时26?ns可完成一次16×16?Benes网络重构求解。通过流水线优化，将连续求解吞吐量提升至700?MOPs。相比现有同类路由求解算法的FPGA实现，其求解速度提升9.85倍，连续求解吞吐量提升2.8倍。若使用ASIC技术而非FPGA构建加速器芯片，预计可将求解耗时降低至与开关阵列重构耗时相仿的水平，彻底消除性能瓶颈。

关键词: Benes网络, 重排求解算法, 并行化, 硬件加速器, FPGA

Abstract: Optical interconnect networks use optical cross-connects (OXC) to enable optical switching. Large-scale fast optical cross-connects tend to apply a compact Benes network to avoid high link loss brought by concatenating too many optical switching points. However, Benes network also brings heavy latency overhead for calculating corresponding statuses of its internal switch points to satisfy given route requests, which usually takes hundreds of nanoseconds to microseconds, thus causing a performance bottleneck. This paper proposes a route-solving algorithm of Benes network to decrease such overhead. The algorithm is designed for high-performance hardware implementation. It re-orders the solving procedure to achieve a higher degree of parallelism, and higher working frequency is guaranteed by its simple design. The FPGA-version of the algorithm only uses 26 ns to solve a route-resolving problem of a 16×16 Benes network. With optimized pipeline, its route-solving throughput is increased to 700 MOPs. The route-resolving speed is 9.85 times faster than other FPGA-versions based on the same kind of algorithm, and the route-resolving throughput is 2.8 times bigger. If ASIC technique is applied to build the accelerator chip instead of FPGA, route-resolving time will match the reconfigure latency of high-speed OXC, thus eliminating the performance bottleneck.

Key words: Benes network, permutation algorithm, parallelization, hardware accelerator, FPGA

秦梦远, 刘宏伟, 郝沁汾. 高性能Benes网络路由求解算法及硬件加速器[J]. 计算机工程与应用, 2025, 61(14): 163-175.

QIN Mengyuan, LIU Hongwei, HAO Qinfen. High-Performance Route-Resolving Algorithm and Hardware Accelerator for Benes Network[J]. Computer Engineering and Applications, 2025, 61(14): 163-175.

参考文献

[1] PARSONSON C, SHABKA Z, CHLUPKA W, et al. Optimal control of SOAs with artificial intelligence for sub-nanosecond optical switching[J]. Journal of Lightwave Technology, 2020, 38(20): 5563-5573.
[2] XU X J, CHEN S W, XU H H, et al. High-speed 2×2 silicon-based electro-optic switch with nanosecond switch time[J]. Chinese Physics B, 2009, 18(9): 3900.
[3] LU L J, ZHAO S Y, ZHOU L J, et al. 16×16 non-blocking silicon optical switch based on electro-optic Mach-Zehnder interferometers[J]. Optics Express, 2016, 24(9): 9295-9307.
[4] QIAO L, TANG W J, CHU T. 32×32 silicon electro-optic switch with built?in monitors and balanced-status units[J]. Scientific Reports, 2017, 7(1): 1-7.
[5] QIAO L, TANG W J, CHU T. Ultra-large-scale silicon optical switches[C]//Proceedings of the IEEE 13th International Conference on Group IV Photonics, 2016: 1-2.
[6] BENE? V. Mathematical theory of connecting networks and telephone traffic[M]. Pittsburgh: Academic Press, 1965.
[7] WANG B, LIU L B, DENG C C, et al. Exploration of benes network in cryptographic processors: a random infection countermeasure for block ciphers against fault attacks[J]. IEEE Transactions on Information Forensics and Security, 2016, 12(2): 309-322.
[8] LIPMAA H. Efficient NIZK arguments via parallel verification of benes networks[C]//Proceedings of the 9th International Conference on Security and Cryptography for Networks , 2014: 416-434.
[9] KYNIGOS M, PASCUAL J, NAVARIDAS J, et al. Power and energy efficient routing for Mach-Zehnder interferometer based photonic switches[C]//Proceedings of the ACM International Conference on Supercomputing, 2021: 177-189.
[10] LO C, SHAM C, FU C. Novel CNN accelerator design with dual benes network architecture[J]. IEEE Access, 2023, 11: 59524-59529.
[11] WAKSMAN A. A permutation network[J]. Journal of the ACM, 1968, 15(1): 159-163.
[12] NIKOLAIDIS D, GROUMAS P, KOULOUMENTAS C, et al. Novel Benes network routing algorithm and hardware implementation[J]. Technologies, 2022, 10(1): 16.
[13] AGHAKHANI K, KARIMI A. A novel routing algorithm in Benes networks[J]. International Journal of Educational Advancement, 2016, 7(1): 168-177.
[14] JIANG Y, YANG M. Hardware implementation of parallel algorithm for setting up Benes networks[C]//Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2016: 10-16.
[15] KOLOKO L, MATSUMOTO T, OBARA H. Design and implementation of fast and hardware‐efficient parallel processing elements to set full and partial permutations in Bene? networks[J]. The Journal of Engineering, 2021, 2021(6): 312-320.
[16] 高飞, 李红燕, 戴紫彬, 等. 可重构比特置换网络配置信息提取算法研究[J]. 计算机应用研究, 2010, 27(5): 1867-1870.
GAO F, LI H Y, DAI Z B, et al. Research on reconfigurable bit permutation network configuration information extraction[J]. Application Research of Computers, 2010, 27(5): 1867-1870.
[17] KARIMI A, AGHAKHANI K, MANAVI S E, et al. Introduction and analysis of optimal routing algorithm in Benes networks[J]. Procedia Computer Science, 2014, 42: 313-319.
[18] CHAKRABARTY A, COLLIER M, MUKHOPADHYAY S. Matrix-based nonblocking routing algorithm for Bene? networks[C]//Proceedings of the Computation World: Future Computing, Service Computation, 2009: 551-556.
[19] ZULFIN M, PINEM M, DIVO A, et al. Performance comparison of matrix-based and division algorithms implemented on the 8×8 Benes network[C]//Proceedings of the 5th International Conference on Electrical, Telecommunication and Computer Engineering , 2021: 24-27.
[20] NASSIMI D, SAHNI S. Parallel algorithms to set up the Benes permutation network[J]. IEEE Transactions on Computers, 1982, 31(2): 148-154.
[21] LEE K. A new Benes network control algorithm[J]. IEEE Transactions on Computers, 1987, 100(6): 768-772.
[22] KHAN I, CHALONY M, GHILLINO E, et al. Machine learning assisted abstraction of photonic integrated circuits in fully disaggregated transparent optical networks[C]//Proceedings of the 22nd International Conference on Transparent Optical Networks , 2020: 1-4.
[23] KHAN I, TUNESI L, MASOOD M, et al. Automatic management of N×N photonic switch powered by machine learning in software-defined optical transport[J]. IEEE Open Journal of the Communications Society, 2021, 2: 1358-1365.
[24] ZECHARIA R, SHAVITT Y. A Parallel algorithm and scalable architecture for routing in Bene? networks[C]//Proceedings of the IEEE Conference on Computer Communications, 2024: 921-930.
[25] BOUTROS A, BETZ V. FPGA architecture: principles and progression[J]. IEEE Circuits and Systems Magazine, 2021, 21(2): 4-29.