Atom action recognition by multi-dimensional adaptive 3D convolutional neural networks

doi:10.3778/j.issn.1002-8331.1707-0347

Abstract

Abstract: A novel action recognition algorithm is proposed for 3D Convolutional Neural Networks（3DCNN）’s disadvantage that demands a fixed length for all video clips as the input data. This disadvantage makes lack of information or data redundancy situation because of the fixed size video clips. Firstly, human atom action is defined by human action particle trajectory. Then the length of video clips is defined by the length of human atom action. The divided video clips include unabridged information for a human action. However, the length of these clips is different. There is a conflict for classification and identification in 3DCNN, because 3DCNN needs the same length of input data. To solve the problem, 3D Spatial Pyramid Pooling（SPP） algorithm is improved for processing different length video data. 3D SPP, which is put before fully-connected layers in 3DCNN, outputs the same size representation vectors. This technology is compared with several related algorithms in experiments. The experimental results show that there are two advantages in this technology: a lower requirement for input data and higher recognition rate with a intact information in clips.

Key words: action recognition, video analysis, 3D spatial pyramid pooling, atom action, 3D Convolutional Neural Networks

摘要： 针对现有的3D卷积神经网络（3D Convolutional Neural Networks，3DCNN）行为识别算法将输入视频分块划分为固定长度，其包含的行为信息可能冗余或不全的问题，提出了解决方案。利用人体运动质点轨迹的特性定义了人体原子行为；以原子行为的长度作为视频分块的长度进行视频划分，得到包含完整信息的人体行为。3DCNN要求输入数据必须是相同维度，而原子行为视频块长度不同。为此改进了空间金字塔池化（3D Spatial Pyramid Pooling，3D SPP）技术，以适用于不同长度视频处理。把SPP层放置在全连接层前，处理3DCNN卷积层输出的不同长度特征图，以输出相同长度特征向量。与相关算法相比，实验数据说明该算法对输入数据要求更低，由于视频分块信息的完整性，识别率有显著提高。

关键词: 行为识别, 视频分析, 3D空间金字塔池化, 原子行为, 3D卷积神经网络

GAO Dapeng, ZHU Jiangang. Atom action recognition by multi-dimensional adaptive 3D convolutional neural networks[J]. Computer Engineering and Applications, 2018, 54(4): 174-178.

高大鹏，朱建刚. 多维度自适应3D卷积神经网络原子行为识别[J]. 计算机工程与应用, 2018, 54(4): 174-178.

[1]	WANG Ziru, LI Zhenmin. Transferable Dictionary Learning Fused Data Augmentation [J]. Computer Engineering and Applications, 2021, 57(23): 193-199.
[2]	CHEN Yanjie, SHU Dawei, YANG Jijiang, WANG Huan, WANG Qing, LEI Yi. Review of AI Diagnosis System of Developmental Coordination Disorder [J]. Computer Engineering and Applications, 2021, 57(2): 28-36.
[3]	ZHOU Xiaojing, CHEN Junhong, YANG Zhenguo, LIU Wenyin. Manipulation Action Recognition Based on Gesture Feature Fusion [J]. Computer Engineering and Applications, 2021, 57(14): 169-175.
[4]	LIU Jing, YANG Xu, LIU Dongjingdian, NIU Qiang. Multi-person Smoking Action Recognition Algorithm Based on Human Joint Points [J]. Computer Engineering and Applications, 2021, 57(1): 234-241.
[5]	LI Yuanxiang, XIE Linbo. Human Action Recognition Based on Depth Motion Map and Dense Trajectory [J]. Computer Engineering and Applications, 2020, 56(3): 194-200.
[6]	SANG Haifeng, TIAN Qiuyang. Rapid Action Recognition System for Human-Computer Interaction [J]. Computer Engineering and Applications, 2019, 55(6): 101-107.
[7]	GE Yun1, JING Guodong2. Human Action Recognition Based on Convolution Neural Network Combined with Multi-Scale Method [J]. Computer Engineering and Applications, 2019, 55(2): 100-103.
[8]	YANG Shiqiang, LUO Xiaoyu, LI Xiaoli, YANG Jiangtao, LI Dexin. Human Action Recognition Based on DBN-HMM [J]. Computer Engineering and Applications, 2019, 55(15): 169-176.
[9]	ZHAO Xiaoli, TIAN Lihua, LI Chen. Action recognition method based on sparse coding local spatio-temporal descriptors [J]. Computer Engineering and Applications, 2018, 54(7): 29-35.
[10]	ZHU Hongmin1，2, DAI Daoqing1, LI Jingzheng2. Research of intelligent video analysis in transformer substation based on image processing [J]. Computer Engineering and Applications, 2018, 54(7): 264-270.
[11]	ZHU Dayong1，2, GUO Xing1，2, WU Jianguo1，2. Action recognition method using kinect 3D skeleton data [J]. Computer Engineering and Applications, 2018, 54(20): 152-158.
[12]	LU Tianran, YU Fengqin, YANG Huizhong, CHEN Ying. Human action recognition based on dense trajectories with saliency detection [J]. Computer Engineering and Applications, 2018, 54(14): 163-167.
[13]	ZHOU Xinyi, GAN Shengjiang, SUN Lianhai, KUANG Yin. Human action recognition through combined RGB and depth image feature [J]. Computer Engineering and Applications, 2017, 53(8): 180-185.
[14]	WANG Song, DANG Jianwu, WANG Yangping, DU Xiaogang. Research on real-time action recognition approach [J]. Computer Engineering and Applications, 2017, 53(3): 28-31.
[15]	ZOU Xiangyang1，2, HOU Yunjiang1. Spatio-temporal pyramid for action recognition based on depth sequences [J]. Computer Engineering and Applications, 2017, 53(19): 211-215.

Atom action recognition by multi-dimensional adaptive 3D convolutional neural networks

多维度自适应3D卷积神经网络原子行为识别

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics