多维度自适应3D卷积神经网络原子行为识别

doi:10.3778/j.issn.1002-8331.1707-0347

计算机工程与应用 ›› 2018, Vol. 54 ›› Issue (4): 174-178.DOI: 10.3778/j.issn.1002-8331.1707-0347

多维度自适应3D卷积神经网络原子行为识别

高大鹏，朱建刚

中国民航飞行学院计算机学院，四川广汉 618307

出版日期:2018-02-15 发布日期:2018-03-07

Atom action recognition by multi-dimensional adaptive 3D convolutional neural networks

GAO Dapeng, ZHU Jiangang

School of Computer, Civil Aviation Flight University of China, Guanghan, Sichuan 618307, China

Online:2018-02-15 Published:2018-03-07

摘要/Abstract

摘要： 针对现有的3D卷积神经网络（3D Convolutional Neural Networks，3DCNN）行为识别算法将输入视频分块划分为固定长度，其包含的行为信息可能冗余或不全的问题，提出了解决方案。利用人体运动质点轨迹的特性定义了人体原子行为；以原子行为的长度作为视频分块的长度进行视频划分，得到包含完整信息的人体行为。3DCNN要求输入数据必须是相同维度，而原子行为视频块长度不同。为此改进了空间金字塔池化（3D Spatial Pyramid Pooling，3D SPP）技术，以适用于不同长度视频处理。把SPP层放置在全连接层前，处理3DCNN卷积层输出的不同长度特征图，以输出相同长度特征向量。与相关算法相比，实验数据说明该算法对输入数据要求更低，由于视频分块信息的完整性，识别率有显著提高。

关键词: 行为识别, 视频分析, 3D空间金字塔池化, 原子行为, 3D卷积神经网络

Abstract: A novel action recognition algorithm is proposed for 3D Convolutional Neural Networks（3DCNN）’s disadvantage that demands a fixed length for all video clips as the input data. This disadvantage makes lack of information or data redundancy situation because of the fixed size video clips. Firstly, human atom action is defined by human action particle trajectory. Then the length of video clips is defined by the length of human atom action. The divided video clips include unabridged information for a human action. However, the length of these clips is different. There is a conflict for classification and identification in 3DCNN, because 3DCNN needs the same length of input data. To solve the problem, 3D Spatial Pyramid Pooling（SPP） algorithm is improved for processing different length video data. 3D SPP, which is put before fully-connected layers in 3DCNN, outputs the same size representation vectors. This technology is compared with several related algorithms in experiments. The experimental results show that there are two advantages in this technology: a lower requirement for input data and higher recognition rate with a intact information in clips.

Key words: action recognition, video analysis, 3D spatial pyramid pooling, atom action, 3D Convolutional Neural Networks

高大鹏，朱建刚. 多维度自适应3D卷积神经网络原子行为识别[J]. 计算机工程与应用, 2018, 54(4): 174-178.

GAO Dapeng, ZHU Jiangang. Atom action recognition by multi-dimensional adaptive 3D convolutional neural networks[J]. Computer Engineering and Applications, 2018, 54(4): 174-178.

[1]	刘勇，谢若莹，丰阳，王亚辉，刘亚清. 智能家居中的居民日常行为识别综述[J]. 计算机工程与应用, 2021, 57(4): 35-42.
[2]	王子儒，李振民. 融合数据增强的迁移字典学习[J]. 计算机工程与应用, 2021, 57(23): 193-199.
[3]	刘锁兰，顾嘉晖，王洪元，张云鹏. 基于关联分区和ST-GCN的人体行为识别[J]. 计算机工程与应用, 2021, 57(13): 168-175.
[4]	李元祥，谢林柏. 基于深度运动图和密集轨迹的行为识别算法[J]. 计算机工程与应用, 2020, 56(3): 194-200.
[5]	惠飞，郭静，贾硕，邢美华. 基于双向长短记忆网络的异常驾驶行为检测[J]. 计算机工程与应用, 2020, 56(24): 116-122.
[6]	祁大健，杜慧敏，张霞，常立博. 基于上下文特征融合的行为识别算法[J]. 计算机工程与应用, 2020, 56(2): 171-175.
[7]	胡睿，何小海，滕奇志，卿粼波，廖浚斌. 结合注意力的3D卷积网络脑胶质瘤分割算法[J]. 计算机工程与应用, 2020, 56(12): 187-192.
[8]	盖赟1，荆国栋2. 多尺度方法结合卷积神经网络的行为识别[J]. 计算机工程与应用, 2019, 55(2): 100-103.
[9]	王正杰，杨伟丽，王喆，侯玉珊，郭银景. 基于CSI的行为识别研究综述[J]. 计算机工程与应用, 2018, 54(5): 14-23.
[10]	贾小云1，王二虎1，吴敬一2. Android平台下的实时人体行为识别[J]. 计算机工程与应用, 2018, 54(24): 164-167.
[11]	鹿天然，于凤芹，杨慧中，陈莹. 基于显著性检测和稠密轨迹的人体行为识别[J]. 计算机工程与应用, 2018, 54(14): 163-167.
[12]	周鑫燚，甘胜江，孙连海，匡胤. 改进联合彩色和深度图像特征的人体行为识别[J]. 计算机工程与应用, 2017, 53(8): 180-185.
[13]	赵中堂1，3，陈继光1，3，马倩2. 摔倒检测中的样本失衡问题研究[J]. 计算机工程与应用, 2017, 53(23): 142-146.
[14]	赵晓梅1，孙建德1，张元元2. 基于分层匹配五元组Codebook的运动目标检测算法[J]. 计算机工程与应用, 2016, 52(7): 196-201.
[15]	徐仙1，2，卢先领1，2，王洪斌1，2. 行为识别中基于GA优化的加速度特征选择方法[J]. 计算机工程与应用, 2016, 52(6): 139-143.

多维度自适应3D卷积神经网络原子行为识别

Atom action recognition by multi-dimensional adaptive 3D convolutional neural networks

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics