计算机工程与应用 ›› 2016, Vol. 52 ›› Issue (2): 1-6.

• 理论与研发 • 上一篇    下一篇

一种用于虚拟集群的镜像按需分发框架

刘圣卓,姜进磊,杨广文   

  1. 1.清华大学 计算机科学与技术系,北京 100084
    2.清华信息科学与技术国家实验室(筹),北京 100084
  • 出版日期:2016-01-15 发布日期:2016-01-28

On-demand image distribution framework for virtual clusters

LIU Shengzhuo, JIANG Jinlei, YANG Guangwen   

  1. 1.Department of Computer Science & Technology, Tsinghua University, Beijing 100084, China
    2.Tsinghua National Laboratory for Information Science and Technology, Tsinghua University, Beijing 100084, China
  • Online:2016-01-15 Published:2016-01-28

摘要: 虚拟集群部署过程中,大量的镜像数据传输导致启动时间过长,可扩展性和效率受到影响。提出了镜像按需分发框架,根据虚拟机启动和运行过程中只对有限数据块访问的特性,仅传输虚拟机启动和运行所需数据块,避免了存储节点和网络瓶颈问题。进一步,提出了差异化的数据块划分方法,兼顾了镜像读写性能和分发效率:去冗余处理采用大数据块,提升管理效率和存取性能;采用小数据块划分启动区数据,提高传输效率。对于50个镜像的分发实验表明,相比于全镜像分发,按需分发速度可以提升13~22倍。

null

关键词: 虚拟集群, 虚拟机, 镜像分发, 按需分发

Abstract: In the process of virtual clusters deployment, transmission of mass image data leads to longer boot time, and the scalability and efficiency are affected. An on-demand image distribution framework is proposed to reduce data transmission and avoid the bottlenecks of storage nodes and network based on the character that virtual machines only access limited data blocks in booting and running. Further, differentiated data block division method is proposed that takes into account the image accessing performance and distribution efficiency:the bigger division block size is used in deduplication processing to enhance index management efficiency and data accessing performance, whereas data in the boot region is divided with smaller data block size to improve data transmission efficiency. Distribution experiment of 50 image files shows that, compared with the full-image distribution, on-demand distribution can improve the distribution speed of 13~22 times.

Key words: virtual clusters, virtual machine, image distribution, on-demand distribution