计算机工程与应用 ›› 2024, Vol. 60 ›› Issue (22): 1-17.DOI: 10.3778/j.issn.1002-8331.2404-0372

• 热点与综述 • 上一篇    下一篇

轻量化深度卷积神经网络设计研究进展

周志飞,李华,冯毅雄,陆见光,钱松荣,李少波   

  1. 1.贵州大学 省部共建公共大数据国家重点实验室,贵阳 550025
    2.清华大学 机械工程系,北京 100084
  • 出版日期:2024-11-15 发布日期:2024-11-14

Research Progress on Designing Lightweight Deep Convolutional Neural Networks

ZHOU Zhifei, LI Hua, FENG Yixiong, LU Jianguang, QIAN Songrong, LI Shaobo   

  1. 1.State Key Laboratory of Public Big Data, Guizhou University, Guiyang 550025, China
    2.Department of Mechanical Engineering, Tsinghua University, Beijing 100084, China
  • Online:2024-11-15 Published:2024-11-14

摘要: 轻量化设计是解决深度卷积神经网络(deep convolutional neural network,DCNN)对设备性能和硬件资源依赖性的流行范式,轻量化的目的是在不牺牲网络性能的前提下,提高计算速度和减少内存占用。综述了DCNN的轻量化设计方法,着重回顾了近年来DCNN的研究进展,包括体系设计和模型压缩两大轻量化策略,深入比较了这两类方法的创新性、优势与局限性,并探讨了支撑轻量化模型的底层框架。此外,对轻量化网络已经成功应用的场景进行了描述,并对DCNN轻量化的未来发展趋势进行了预测,旨在为深度卷积神经网络的轻量化研究提供有益的见解和参考。

关键词: 深度卷积神经网络, 轻量化, 体系设计, 模型压缩

Abstract: Lightweight design is a popular paradigm to address the dependence of deep convolutional neural network (DCNN) on device performance and hardware resources, and the purpose of lightweighting is to increase the computational speed and reduce the memory footprint without sacrificing the network performance. An overview of lightweight design approaches for DCNNs is presented, focusing on a review of the research progress in recent years, including two major lightweighting strategies, namely, system design and model compression, as well as an in-depth comparison of the innovativeness, strengths and limitations of these two types of approaches, and an exploration of the underlying framework that supports the lightweighting model. In addition, scenarios in which lightweight networks have been successfully applied are described, and predictions are made for the future development trend of DCNN lightweighting, aiming to provide useful insights and references for the research on lightweight deep convolutional neural networks.

Key words: deep convolutional neural network, lightweight, architecture design, model compression