计算机工程与应用 ›› 2018, Vol. 54 ›› Issue (12): 57-62.DOI: 10.3778/j.issn.1002-8331.1705-0436

• 理论与研发 • 上一篇    下一篇

视频阵列处理器多层次分布式存储结构设计

蒋  林,崔朋飞,山  蕊,武  鑫,田汝佳   

  1. 西安邮电大学 电子工程学院,西安 710121
  • 出版日期:2018-06-15 发布日期:2018-07-03

Design of distributed memory architecture for video array processor

JIANG Lin, CUI Pengfei, SHAN Rui, WU Xin, TIAN Rujia   

  1. School of Electronic Engineering, Xi’an University of Posts & Telecommunications, Xi’an 710121, China
  • Online:2018-06-15 Published:2018-07-03

摘要: 随着视频编解码标准的不断演进,算法处理的数据量也随之剧增。多核结构并行化处理技术在提升算法计算速度的同时,使得存储结构成为了整个编解码系统性能的瓶颈。针对视频编解码算法访存的局部性、各算法之间数据交互频繁性、算法内部大量临时数据不交互性的特点,设计并实现了由私有存储层和共享存储层构成的多层次分布式存储结构。通过Xilinx公司的Virtex-6系列xc6vlx550T开发板对设计进行测试,实验结果表明,该结构在保持简洁性和可扩展性的同时,最高可提供9.73 GB/s的访存带宽,能够满足视频编解码算法数据访存的需求。

关键词: 视频阵列处理器, 分布式存储结构, 目录协议, 高速缓存, 层次化

Abstract: With the continuous development of video coding standards, the amount of data processed by video codec algorithm is also increasing sharply. Although the parallel processing technology of multi-core architecture promote the speed of processing of video codec algorithm, it makes the memory structure becomes the bottleneck of the whole codec system. Aiming at the characteristics of video codec algorithm of the locality accessing data, the high frequency of data exchange between different algorithm, a large number of temporary internal data does not need to interact, this paper designs and implements a multi level distributed storage structure which contains private storage layer and shared storage layer. The design has been tested on the FPGA development board of the company of Xilinx, the experimental results show that the structure not only maintain the simplicity and scalability, but also can provide data access bandwidth which can up to be 9.73 GB/s, meet the needs of video codec algorithm data access.

Key words: video array processor, distributed storage architecture, directory protocol, cache, hierarchical