Computer Engineering and Applications ›› 2012, Vol. 48 ›› Issue (31): 30-35.

Previous Articles     Next Articles

Asynchronous optical packet switching based high performance computing system

ZHAO Jun1, XU Liguo1, XU Lei1, SUN Xiaohan2   

  1. 1.R & D Center, Nanjing Nari-Relays Electric Co., LTD., Nanjing 211102, China
    2.School of Electronics Engineering, Southeast University, Nanjing 210096, China
  • Online:2012-11-01 Published:2012-10-30

基于异步光分组交换的高性能计算机系统

赵  俊1,许立国1,须  雷1,孙小菡2   

  1. 1.南京南瑞继保电气有限公司 研发中心,南京 211102
    2.东南大学 电子科学与工程学院,南京 210096

Abstract: An asynchronous optical-packet-switching and optical-multicast based High Performance Computing System(HPCS) with multistage distributed management features is presented, which can process the massive burst services and realize all-optical scalability. A compact and huge-capacity Recycling-Fiber-Delay-Line(Rec-FDL) based collision resolution mechanism is proposed to resolve the contentions for asynchronous arriving packets, and its performances are compared with the traditional Packet Retransmission Algorithm(PRA). The system analysis models for the stabilities, packet loss rates and average packet waiting latencies of the PRA(Trerans) and Rec-FDL(TRec-FDL) schemes are built. The simulations show that, for a two-stage system with 40 320 CPUs, TRec-FDL is fixed as 0.004 ns, while Trerans increases with the packet transmission distance D, and equals to 0.006 ns and 0.01ns when D is 10 m and 20 m, respectively. Hence, as the HPCS is typically highly distributed, the Rec-FDL scheme can induce lesser latencies than the PRA.

Key words: High Performance Computing System(HPCS), asynchronous optical packet switching, optical multicast, Recycling-Fiber-Delay-Line(Rec-FDL), Packet Retransmission Algorithm(PRA), latency

摘要: 基于异步光分组交换及光组播技术,提出一种新型高性能计算机系统(HPCS)。采用分布式控制结构进行系统的分级管理,有利于处理大批量突发业务及系统的全光多级扩展。基于循环光纤延时线(Rec-FDL),提出一种紧凑、大容量的异步全光分组冲突解决机制,并与传统的分组重传冲突解决算法(PRA)进行了性能比较;建立了系统的稳定性、丢包率及时延分析模型。仿真结果表明,对于一个能够实现40 320个CPU互连的两级系统,Rec-FDL机制的平均分组等待时延TRec-FDL约为0.004 ns,而PRA算法的平均分组等待时延Trerans随分组传输距离D增加而增加,在D为10 m和20 m时,Trerans分别等于0.006 ns和0.01 ns。因此,对于分布式HPCS而言,Rec-FDL机制具有更优秀的时延特性。

关键词: 高性能计算机系统, 异步光分组交换, 光组播, 循环光纤延时线, 分组重传算法, 时延