Research on Multi-resolution Human Pose Estimation with Attention Mechanism

doi:10.3778/j.issn.1002-8331.2010-0317

Abstract

Abstract:

In order to solve the problem that spatial information of feature maps is unable to effectively utilize when multi-resolution feature representations are directly fused in human pose estimation task, the multi-resolution human pose estimation network is proposed based on the High-Resolution Net（HRNet） for structural design, namely GCT-Nonlocal Net （GNNet）, which combines both channel domain and spatial domain attention mechanism and contains improved exchange units, Gateneck module and Gateblock module. The exchange units are improved to extract more useful spatial information from the various feature representations by adding spatial attention mechanism before the multi-scale fusions, which make the information fusions between the different resolution representations better and result in the final high-resolution representation containing richer representation information. In addition, the Gateneck module and the Gateblock module are able to model channel relationships more explicitly to extract channel information more effectively by introducing channel attention mechanism. The verification results on MS COCO VAL 2017 dataset show that the proposed GNNet achieves higher accuracy with the similar parameter and computation complexities, compared with the state-of-the-art human pose estimation network, HRNet, and the mAP is improved by 1.4 percentage points. As a result, the improved exchanged units make multi-scale information fusions more effective between the various resolution representations.

Key words: convolutional neural network, human pose estimation, multi-resolution feature representation fusion, spatial attention mechanism, channel attention mechanism

摘要：

针对人体姿态估计任务中多分辨率特征表征直接融合时存在无法有效利用特征图空间特征信息的问题，基于High-Resolution Net（HRNet）进行结构设计，构建出结合了通道域注意力和空间域注意力机制的多分辨率人体姿态估计网络GCT-Nonlocal Net（GNNet），提出了一种基于注意力机制的多分辨率表征融合方法，在不同分辨率表征融合前由空间注意力提取出各分辨率表征更有用的空间特征信息来改进融合单元，使得各分辨率表征间的信息融合效果更佳，最终输出的高分辨率表征含有更丰富的特征信息，同时构造了Gateneck模块和Gateblock模块，其通过引入通道注意力更明确地对通道关系建模从而高效地提取通道信息。在MS COCOVAL 2017进行验证，结果显示提出的GNNet相较于SOTA级表现的HRNet在相当参数量与运算量的情况下获得了更高的准确度，mAP提高了1.4个百分点。实验结果表明，所提方法有效地提高了多分辨率特征表征融合效果。

关键词: 卷积神经网络, 人体姿态估计, 多分辨率特征表征融合, 空间域注意力机制, 通道域注意力机制

ZHANG Yue, HUANG Yourui, LIU Pengkun. Research on Multi-resolution Human Pose Estimation with Attention Mechanism[J]. Computer Engineering and Applications, 2021, 57(8): 126-132.

张越，黄友锐，刘鹏坤. 引入注意力机制的多分辨率人体姿态估计研究[J]. 计算机工程与应用, 2021, 57(8): 126-132.

[1]	RAN Rong, XU Xinghua, QIU Shaohua, CUI Xiaopeng, OUYANG Bin. Review of Crack Detection Methods Based on Deep Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(9): 23-35.
[2]	MOU Qingping, ZHANG Ying, ZHANG Dongbo, WANG Xinjie, YANG Zhiqiao. Research on Visual Tracking Algorithm and Application of Target Loss Discrimination Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 140-147.
[3]	BAO Zhiqiang, XING Yu, LYU Shaoqing, HUANG Qiongdan. Improved YOLO V2 6D Object Pose Estimation Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 148-153.
[4]	HUANG Dongyi, YANG Bing, WU Zihao, KUANG Jiayi, YAN Zeming. Spatio-Temporal Fully Connected Convolutional Neural Networks for Citywide Cellular Prediction [J]. Computer Engineering and Applications, 2021, 57(9): 168-175.
[5]	ZHAO Zhiyan, YANG Hua, HU Zhiwei, YU Haiping. Identification Model of Pests on Yuluxiang Pear Leaves Based on TACNN [J]. Computer Engineering and Applications, 2021, 57(9): 176-181.
[6]	ZHOU Lungang, SUN Yifeng, WANG Kun, WU Jiang, HUANG Weigui, LI Binglong. End to End Object Recognition Algorithm for Multi-attributes of Multi-values [J]. Computer Engineering and Applications, 2021, 57(9): 182-190.
[7]	ZHANG Cheng, DAI Junfeng, XIONG Wenxin. Improved Handwritten Date Recognition in Scanned Documents Combined with LeNet-5 [J]. Computer Engineering and Applications, 2021, 57(9): 207-211.
[8]	MA Zhexu, YANG Feng, QIAO Xu. Intelligent Detection Method of Railway Subgrade Defect [J]. Computer Engineering and Applications, 2021, 57(9): 272-278.
[9]	LIANG Fangxuan, YANG Feng, LU Liyun, YIN Mengxiao. Review of Brain Tumor Segmentation Methods Based on Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(7): 34-43.
[10]	YANG Peiwei, ZHOU Yuhong, XING Gang, TIAN Zhiqiang, XU Xiayu. Applications of Convolutional Neural Network in Biomedical Image [J]. Computer Engineering and Applications, 2021, 57(7): 44-58.
[11]	CHANG Hao, CHEN Xiaolei, ZHANG Aihua, LI Ce, LIN Dongmei. Continuous Blood Pressure Prediction Based on Improved SENet Convolutional Neural Network [J]. Computer Engineering and Applications, 2021, 57(7): 130-135.
[12]	WANG Chong, HAN Zhenqi, XU Haoyu, ZHU Yongxin, XU Sheng, CHEN Xia. Efficient Crack Detection Algorithm Based on Improved Saliency Map [J]. Computer Engineering and Applications, 2021, 57(6): 219-224.
[13]	HUANG Jinjie, LIN Jiangquan, HE Yongjun, HE Jinjie, WANG Yajun. Chinese Short Text Classification Algorithm Based on Local Semantics and Context [J]. Computer Engineering and Applications, 2021, 57(6): 94-100.
[14]	ZHANG Liang, ZHANG Zeng, SHU Weihua, MEI Kuizhi. Convolutional Layered Pruning Based on YOLOv3 [J]. Computer Engineering and Applications, 2021, 57(6): 131-137.
[15]	HU Wei’an, ZOU Junzhong, GUO Yucheng, ZHANG Jian, WANG Bei. Lesion Recognition Method of Pathological Images Based on Multidimensional Features [J]. Computer Engineering and Applications, 2021, 57(6): 144-151.

Research on Multi-resolution Human Pose Estimation with Attention Mechanism

引入注意力机制的多分辨率人体姿态估计研究

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics