计算机工程与应用 ›› 2025, Vol. 61 ›› Issue (3): 48-61.DOI: 10.3778/j.issn.1002-8331.2407-0031

• 热点与综述 • 上一篇    下一篇

三维卷积神经网络方法改进及其应用综述

李泽慧,张琳,山显英   

  1. 北京建筑大学 电气与信息工程学院,北京 102616
  • 出版日期:2025-02-01 发布日期:2025-01-24

Review on Improvement and Application of 3D Convolutional Neural Networks

LI Zehui, ZHANG Lin, SHAN Xianying   

  1. School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing 102616, China
  • Online:2025-02-01 Published:2025-01-24

摘要: 三维卷积神经网络作为一种深度神经网络,在计算机视觉领域,特别是视频动作识别方面展现了优异的效果。然而三维卷积神经网络仍存在一些问题,针对这些问题,对现有的基于三维卷积的视频动作识别改进方法进行了总结和分析。在轻量化、特征提取、计算效率、组合模型等方面对三维卷积神经网络的改进进行归纳,并介绍了三维卷积神经网络的实际应用,总结了流行的数据集,并对这些改进方法的实验结果进行了比较和分析。展望了视频动作识别未来的发展方向。

关键词: 三维卷积神经网络(3DCNN), 行为识别, 深度学习

Abstract: 3D convolutional neural network, as a kind of deep neural network, has shown excellent results in the field of computer vision, especially in video action recognition. However, there are still some problems in 3D convolutional neural networks. In order to solve these problems, this paper summarizes and analyzes the existing improved methods of video action recognition based on 3D convolution. The improvement of 3D convolutional neural network is summarized in the aspects of lightweight, feature extraction, computational efficiency, combination model, etc. The practical application of 3D convolutional neural network is introduced, the popular data sets are summarized, and the experimental results of these improved methods are compared and analyzed. Finally, the future development direction of video action recognition is prospected.

Key words: 3D convolutional neural networks (3DCNN), behavior recognition, deep learning