计算机工程与应用 ›› 2017, Vol. 53 ›› Issue (6): 91-95.DOI: 10.3778/j.issn.1002-8331.1508-0106

• 大数据与云计算 • 上一篇    下一篇

基于MooseFS的纠错码存储方法设计和实现

刘海波1,董玉杰2   

  1. 1.河南理工大学 电气工程与自动化学院,河南 焦作 454000
    2.河南理工大学 万方科技学院,河南 焦作 454000
  • 出版日期:2017-03-15 发布日期:2017-05-11

Design and implementation of error-correcting code storage based on MooseFS

LIU Haibo1, DONG Yujie2   

  1. 1.School of Electric Engineering and Automation, Henan Polytechnic University, Jiaozuo, Henan 454000, China
    2.Wanfang College of Science & Technology, Henan Polytechnic University, Jiaozuo, Henan 454000, China
  • Online:2017-03-15 Published:2017-05-11

摘要: 在云存储技术中,云存储系统的数据容错十分重要,直接关系到整个系统的可用性。当前多数分布式存储系统通过多副本来保证数据的可用性,然而,多副本存储方式也使得数据存储空间翻倍增加,为了降低存储空间,提高数据可用性,有些分布式存储系统开始采用纠错码技术来提高数据可用性和降低数据存储空间占用。通过对MooseFS分布式文件系统进行分析,提出了一种基于MooseFS的纠错码实现方法。通过数据存储效率测试,该方法能够保证常用的“热数据”按照多副本存储,不常用的“冷数据”按照纠错码方式存储,在保证可靠性的同时极大地降低了多副本方式空间占用量。

关键词: 分布式文件系统, 数据容错, MooseFS, 纠错码

Abstract: In the cloud storage technology, data fault-tolerance of cloud storage system is very important, which is directly related to the availability of the whole system. The current majority of distributed storage system through multiple transcripts to ensure availability of data, however, many copies storage makes double increase data storage space. In order to reduce the storage space and improve the data availability, some distributed storage systems adopt error correcting code technology to improve data availability and reduce the data storage space. Through analysis of MooseFS distributed file system, this paper proposes a method of error correction code implementation based on MooseFS. Through the test of data storage efficiency, this method can guarantee the commonly used “hot data” stored with multiple copies, “cold data” stored with the error correcting code, which greatly reduces space footprint and ensures the reliability at the same time.

Key words: distributed file system, data fault-tolerance, MooseFS, error-correcting code