Discriminative spatio-temporal pyramid compact representations algorithm

doi:10.3778/j.issn.1002-8331.1607-0105

Abstract

Abstract: In Spatio-Temporal Pyramid Representation（STPR）, the video is divided into a series of increasingly finer cubic unit cells on each pyramid level. Local features are extracted from all of the cubic unit cells and are concatenated to rebuild a high dimensional feature vector. As a result, when the samples are trained and tested, high computational costs are required. Moreover, because the partitioning strategy for the video is divided into parts which is designed by hand, there is poor theoretical evidence for the optimal partitioning strategy for good behavior recognition. This paper proposes discriminative STPR, which is a new representation that constructs the video feature as a weighted sum of semi-local features over all pyramid levels. This weights are automatically calculated by using partial least square method to maximize a discriminative power. The resulting representation is compact and reserves high discriminative power. Furthermore, this representation can reveal the distinctive cubic unit cells and the number of pyramid level simultaneously by observing the optimal weights of cubic unit cells generated from the fine cubic unit cells.

Key words: Spatio-Temporal Pyramid Representation（STPR）, Partial Least Square（PLS） method, sparse coding, pooling, low-level feature

摘要： 当传统时空金字塔层数较多时，特征描述符的维数会非常高，使得此类描述符在训练和测试阶段计算效率非常低。此外，在时空金字塔的分层及每层立方体单元的划分中，至今仍然采用手动划分视频，使得视频划分策略没有强的理论依据。鉴于以上缺点，提出一种高显著性的时空金字塔精简描述符算法。形成的新描述符是所有时空金字塔层中每个立方体单元局部特征的权重和，而不是把所有立方体单元局部特征描述符串联起来形成一个巨大的特征描述符，每个立方体单元的权重可以通过偏最小二乘法自动获取，由此产生的视频全局描述符精简并且具有高的显著性。此外通过观测其精细立方体单元的权重，还可以展现出显著性时空金字塔每个立方体单元及每层金字塔的贡献，由此，可以根据权重自动划分视频。采用HMDB51和YouTube两个动作数据库进行实验验证，与时空金字塔描述符和超稀疏编码向量相比，此描述符精简并能在低维度下取得较好的识别效果。

关键词: 时空金字塔, 偏最小二乘法, 稀疏编码, 池化技术, 底层特征

CUI Xuehong, LIU Yun, WANG Chuanxu, LI Hui. Discriminative spatio-temporal pyramid compact representations algorithm[J]. Computer Engineering and Applications, 2018, 54(1): 210-216.

崔雪红，刘云，王传旭，李辉. 高显著性的时空金字塔精简描述符算法研究[J]. 计算机工程与应用, 2018, 54(1): 210-216.

[1]	XU Jian, HUANG Lei, CHEN Qianqian, LU Zhen, WU Shupei. Research on Pedestrian Gait Recognition Based on Multi-scale Feature Transfer Learning [J]. Computer Engineering and Applications, 2021, 57(20): 180-187.
[2]	ZHANG Hongli, BAI Xiangyu. Facial Expression Recognition Method Using Optimized Pruning GoogLeNet [J]. Computer Engineering and Applications, 2021, 57(19): 179-188.
[3]	CHEN Zhiwu, CHENG Xi, ZENG Li, QIAN Xiaoliang. Research Progress Review of Co-saliency Detection [J]. Computer Engineering and Applications, 2021, 57(17): 37-45.
[4]	ZHAI Yiming, WANG Binjun, ZHOU Zhining, TONG Xin. Multi-head Attention Pooling-Based RCNN Model for Text Classification [J]. Computer Engineering and Applications, 2021, 57(12): 155-160.
[5]	ZHAO Jingxia, QIAN Yurong, NAN Fangzhe, ZHANG Han, XING Yanni. Method with CNN Multi-Layer Feature Fusion and ELM Diagnosis for Breast Diseases [J]. Computer Engineering and Applications, 2020, 56(4): 122-127.
[6]	GUO Keyou, MA Liping, HU Wei. Facial Feature Point Detection and Facial Orientation Calculation Based on DCNN [J]. Computer Engineering and Applications, 2020, 56(4): 202-208.
[7]	CUI Haoyang, DING Xie, ZHANG Jingyi. Research on Classification of Histopathological Image Based on Cell Graph Convolutional Network [J]. Computer Engineering and Applications, 2020, 56(24): 223-228.
[8]	YANG Jie, HU Mingdi, LI Li, ZHAI Xiaohong, XU Tianyi, ZHANG Zhongmao. Research on Classification Method of Mammography on Human Network [J]. Computer Engineering and Applications, 2020, 56(24): 164-168.
[9]	ZHANG Aimei, XU Yang. Attention Hierarchical Bilinear Pooling Residual Network for Expression Recognition [J]. Computer Engineering and Applications, 2020, 56(23): 161-166.
[10]	FANG Xilu, FU Wei, HU Zhengyan, ZHU Fanchao, ZHOU Jianhan. Classification of Remote Sensing Images Based on Random Sub-image Model [J]. Computer Engineering and Applications, 2020, 56(21): 204-209.
[11]	GUO Yuhan, HU Dejia. Vehicle Multiplication Solution Based on Random Forest and Variable Neighborhood Decline [J]. Computer Engineering and Applications, 2020, 56(13): 243-253.
[12]	DAI Qianlong, SUN Wei. Vehicle Identification Based on Improved Sparse Stack Coding [J]. Computer Engineering and Applications, 2020, 56(1): 136-141.
[13]	YANG Shuo, LIU Bing, ZHOU Yong. Semi-Supervised Low-Rank Kernel Learning Algorithm Based on Sparse Coding [J]. Computer Engineering and Applications, 2019, 55(7): 175-181.
[14]	SUN Dengdi1, MENG Qianqian1，2, MA Yunpeng1，2. Cross-Domain Image Classification with Graph Regularization Transfer Sparse Concept Coding [J]. Computer Engineering and Applications, 2019, 55(6): 197-203.
[15]	ZHANG Hua, CAO Lin. Face Sketch Synthesis Method Combining pHash and Sparse Coding [J]. Computer Engineering and Applications, 2019, 55(22): 187-194.

Discriminative spatio-temporal pyramid compact representations algorithm

高显著性的时空金字塔精简描述符算法研究

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics