YU Haigang, HE Ning, LIU Shengjie, HAN Wenjing. Research on Human Behavior Recognition Based on Temporal and Spatial Information Fusion[J]. Computer Engineering and Applications, 2023, 59(3): 202-208.
[1] SIMONYAN K,ZISSERMAN A.Two-stream convolutional networks for action recognition in videos[J].arXiv:1406. 2199,2014.
[2] TRAN D,BOURDEV L,FERGUS R,et al.Learning spatiotemporal features with 3d convolutional networks[C]//Proceedings of the IEEE International Conference on Computer Vision,2015:4489-4497.
[3] FEICHTENHOFER C.X3d:expanding architectures for efficient video recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2020:203-213.
[4] ROY A G,NAVAB N,WACHINGER C.Concurrent spatial and channel ‘squeeze & excitation’in fully convolutional networks[C]//International Conference on Medical Image Computing and Computer-Assisted Intervention.Cham:Springer,2018:421-429.
[5] CHEN Y,GONG S M.Human action recognition network based on improved channel attention mechanism[J].Journal of Electronics & Information Technology,2021:43(12):3538-3545.
[6] CHO S,MAQBOOL M,LIU F,et al.Self-attention network for skeleton-based human action recognition[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision,2020:635-644.
[7] WANG Z,SHE Q,SMOLIC A.ACTION-Net:multipath excitation for action recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2021:13214-13223.
[8] FEICHTENHOFER C,PINZ A,ZISSERMAN A.Convolutional two-stream network fusion for video action recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,2016:1933-1941.
[9] LIU C,YING J,YANG H,et al.Improved human action recognition approach based on two-stream convolutional neural network model[J].The Visual Computer,2021,37(6):1327-1341.
[10] WANG L,XIONG Y,WANG Z,et al.Temporal segment networks:towards good practices for deep action recognition[C]//European Conference on Computer Vision.Cham:Springer,2016:20-36.
[11] HE K,ZHANG X,REN S,et al.Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,2016:770-778.
[12] XIE S,GIRSHICK R,DOLLáR P,et al.Aggregated residual transformations for deep neural networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,2017:1492-1500.
[13] SZEGEDY C,LIU W,JIA Y,et al.Going deeper with convolutions[C]//Proceedings of the IEEE Conferenceon Computer Vision and Pattern Recognition,2015:1-9.
[14] GAO S,CHENG M M,ZHAO K,et al.Res2net:a new multi-scale backbone architecture[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2021,43(2):652-662.
[15] DENG J,DONG W,SOCHER R,et al.Imagenet:a large-scale hierarchical image database[C]//2009 IEEE Conference on Computer Vision and Pattern Recognition,2009:248-255.
[16] ZACH C,POCK T,BISCHOF H.A duality based approach for realtime tv-l 1 optical flow[C]//Joint Pattern Recognition Symposium.Berlin,Heidelberg:Springer,2007:214-223.
[17] ZHANG C C,HE N.Human motion recognition based on key frame two-stream convolutional network[J].Journal of Nanjing University of Information Science and Technology(Science Edition),2019,11(6):716-721.
[18] ZHOU Y,SUN X,LUO C,et al.Spatiotemporal fusion in 3D CNNs:a probabilistic view[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2020:9829-9838.
[19] WOO S,PARK J,LEE J Y,et al.Cbam:convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision,2018:3-19.
[20] YUAN C M,NIU Y,GUO T,et al.Pedestrian re-recognition based on clothing feature transfer[J].Computer Science and Exploration,2020,15(9):1740-1752.