计算机工程与应用 ›› 2008, Vol. 44 ›› Issue (19): 140-141.

• 数据库、信号与信息处理 • 上一篇    下一篇

用改进的游程编码方法压缩值索引

陈富强1,2,奚建清1   

  1. 1.华南理工大学 计算机科学与工程学院,广州 510641
    2.广东技术师范学院 商务系,广州 510262
  • 收稿日期:2008-02-25 修回日期:2008-04-23 出版日期:2008-07-01 发布日期:2008-07-01
  • 通讯作者: 陈富强

Compress value-list index using improved run-length encode approach

CHEN Fu-qiang1,2,XI Jian-qing1   

  1. 1.School of Computer Science & Engineering,South China University of Technology,Guangzhou 510641,China
    2.Department of Business,Guangdong Polytechnic Normal University,Guangzhou 510262,China
  • Received:2008-02-25 Revised:2008-04-23 Online:2008-07-01 Published:2008-07-01
  • Contact: CHEN Fu-qiang

摘要: 为了进一步提高采用游程编码方法压缩值索引的效率,针对原有方法的不足之处,采取了两个改进措施,一是在索引前排序被索引的数据表使得值索引中的0串和1串更长,二是改进现有编码算法使之可以同时压缩0串和1串。实验结果表明,采用改进后方法压缩值索引形成的压缩文件体积仅为采用原有方法时的11.83%。

Abstract: To compress value-list index more efficiently,this paper analyzes the disadvantages of the traditional run-length encode approach,and then presents two corresponding improvements,one is to sort the data table to prolong the 0’s and 1’s in the value-list index vectors,the other is to modify the encoding algorithm to compress the 0’s and 1’s at the same time.Experimental results show that the volume of the compressed file when new approach adapted is only 11.83% of that when traditional approach adapted.