Computer Engineering and Applications ›› 2019, Vol. 55 ›› Issue (14): 87-97.DOI: 10.3778/j.issn.1002-8331.1806-0002

Previous Articles     Next Articles

Greybox Fuzzing Adaptive Technology Research

XU Hang, ZHAO Shibin, ZHU Junhu, PENG Jianshan, LIN Hongyang   

  1. State Key Laboratory of Mathematical Engineering and Advanced Computing, Zhengzhou 450002, China
  • Online:2019-07-15 Published:2019-07-11

灰盒模糊测试自适应技术研究

许  航,赵世斌,朱俊虎,彭建山,林宏阳   

  1. 数学工程与先进计算国家重点实验室,郑州 450002

Abstract: This paper analyzes the problem of inadaptability in various stages of greybox fuzzing under different test environments, and proposes self-adaptive techniques to solve the problems existing in feedback acquisition, feedback processing and sample generation. Aiming at the defects of format destruction existing in the sample generation mechanism, resulting in low test efficiency, a sample format repair technique is proposed. For the problem that some defects in the feedback acquisition mechanism result in missed reports of some evolutionary samples, a trace bitmap adaptation technique is proposed. For the flaws in the feedback processing mechanism, resulting in a lower efficiency of the preferred corpus, a corpus selecting adaption technique is proposed. Based on the theory proposed in this paper, a demo system is realized. The efficiency of code branch discovery is improved by more than 20% when it tests the software such as MathType, unzip, binutils, etc. And the problem of missing reports of some evolutionary samples is solved. Some commercial software vulnerabilities have been discovered based on the technology proposed in this paper.

Key words: greybox fuzzing test, adaptive technology, sample format repair technique, trace bitmap, corpus selecting

摘要: 分析了在不同测试环境下灰盒模糊测试各个阶段的不适应问题,针对反馈获取、反馈处理和样本生成三个阶段机制存在的问题分别提出了自适应技术进行解决。针对样本生成机制存在的格式破坏缺陷,导致测试效率低下的问题,提出了样本格式修复技术;针对反馈获取机制存在的缺陷,导致部分进化样本被漏报的问题,提出了路径追踪位图自适应技术;针对反馈处理机制存在的缺陷,导致优选语料库效率较低的问题,提出了语料选择自适应技术。基于提出的理论实现了原型系统,该原型系统在针对MathType、unzip、binutils等软件进行测试时代码分支发现的效率提高了20%以上,解决了部分进化样本的漏报问题,发现了一些商用软件的漏洞。

关键词: 灰盒模糊测试, 自适应技术, 样本格式修复, 路径追踪位图, 语料选择