Improved Semantic Segmentation Network for Indoor Scenes

doi:10.3778/j.issn.1002-8331.2009-0295

Abstract

Abstract:

Aiming at the problem that the current indoor scene semantic segmentation method cannot well integrate the RGB information and depth information of the image, an improved indoor scene semantic segmentation network is proposed. In order to enable the model to selectively fuse the depth features and RGB features of the image, and introduce the idea of attention mechanism, a feature fusion module is designed. According to the characteristics of depth feature map and RGB feature map, the module can adjust network parameters learning, and more effectively carry out deep fusion of depth features and RGB features. At the same time, multi-scale joint training is used to accelerate network convergence and improve segmentation accuracy. Through the verification on the SUNRGB-D and NYUDV2 datasets, compared to the current mainstream semantic segmentation networks such as RGB-D Fully Convolutional Neural Network（DFCN） with a Depth-sensitive fully-connected Conditional Random Field（DCRF）, Depth-aware convolutional neural networks （Depth-aware CNN）, Multi-path Refinement Network （RefineNet）, etc., the proposed network has higher segmentation accuracy, Mean Intersection over Union （mIoU） reached 46.6% and 48.0%, respectively.

Key words: indoor scene semantic segmentation, deep learning, attention mechanism, feature fusion, multi-scale joint training

摘要：

针对目前室内场景语义分割网络无法很好融合图像的RGB信息和深度信息的问题，提出一种改进的室内场景语义分割网络。为使网络能够有选择性地融合图像的深度特征和RGB特征，引入注意力机制的思想，设计了特征融合模块。该模块能够根据深度特征图和RGB特征图的特点，学习性地调整网络参数，更有效地对深度特征和RGB特征进行融合；同时使用多尺度联合训练，加速网络收敛，提高分割准确率。通过在SUNRGB-D和NYUDV2数据集上验证，相比于包含深度敏感全连接条件随机场的RGB-D全卷积神经网络（DFCN-DCRF）、深度感知卷积神经网络（Depth-aware CNN）、多路径精炼网络（RefineNet）等目前主流的语义分割网络，所提网络具有更高的分割精度，平均交并比（mIoU）分别达到46.6%和48.0%。

关键词: 室内场景语义分割, 深度学习, 注意力机制, 特征融合, 多尺度联合训练

HE Zhaomeng, KONG Guangqian, WU Yun. Improved Semantic Segmentation Network for Indoor Scenes[J]. Computer Engineering and Applications, 2021, 57(16): 197-202.

贺照蒙，孔广黔，吴云. 一种改进的室内场景语义分割网络[J]. 计算机工程与应用, 2021, 57(16): 197-202.

[1]	HUANG Dongyi, YANG Bing, WU Zihao, KUANG Jiayi, YAN Zeming. Spatio-Temporal Fully Connected Convolutional Neural Networks for Citywide Cellular Prediction [J]. Computer Engineering and Applications, 2021, 57(9): 168-175.
[2]	ZHOU Lungang, SUN Yifeng, WANG Kun, WU Jiang, HUANG Weigui, LI Binglong. End to End Object Recognition Algorithm for Multi-attributes of Multi-values [J]. Computer Engineering and Applications, 2021, 57(9): 182-190.
[3]	ZHANG Cheng, DAI Junfeng, XIONG Wenxin. Improved Handwritten Date Recognition in Scanned Documents Combined with LeNet-5 [J]. Computer Engineering and Applications, 2021, 57(9): 207-211.
[4]	ZHANG Zhentong, SHAN Yugang, YUAN Jie. Remote Sensing Image Detection Algorithm Combining Multi-scale and Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(9): 212-216.
[5]	LU Lixia, ZOU Junzhong, GUO Yucheng, ZHANG Jian, WANG Bei. Prediction of Knee Injury Based on Multimodal Fusion [J]. Computer Engineering and Applications, 2021, 57(9): 225-232.
[6]	WU Wenjie, SONG Wen’ai, GAO Xuemei, YANG Jijiang, WANG Qing, HUANG Liping, LEI Yi. Review of X-Ray-Based Computer-Aided Diagnosis of Adult OSA [J]. Computer Engineering and Applications, 2021, 57(9): 1-8.
[7]	XU Hao, ZHANG Kai, TIAN Yingjie, CHONG Faguang, WANG Zichao. Review of Deep Neural Network-Based Image Caption [J]. Computer Engineering and Applications, 2021, 57(9): 9-22.
[8]	RAN Rong, XU Xinghua, QIU Shaohua, CUI Xiaopeng, OUYANG Bin. Review of Crack Detection Methods Based on Deep Convolutional Neural Networks [J]. Computer Engineering and Applications, 2021, 57(9): 23-35.
[9]	LI Xiaoxiao, HU Xiaoguang, WANG Ziqiang, DU Zhuoqun. Survey of Instance Segmentation Based on Deep Learning [J]. Computer Engineering and Applications, 2021, 57(9): 60-67.
[10]	LI Mingshan, HAN Qingpeng, ZHANG Tianyu, WANG Daolei. Safety Helmet Detection Method of Improved SSD [J]. Computer Engineering and Applications, 2021, 57(8): 192-197.
[11]	GUO Xiaojing, SUI Haoda. Application of Improved YOLOv3 in Foreign Object Debris Target Detection on Airfield Pavement [J]. Computer Engineering and Applications, 2021, 57(8): 249-255.
[12]	ZENG Chunyan, YAN Kang, WANG Zhifeng, YU Yan, JI Chunmei. Survey of Interpretability Research on Deep Learning Models [J]. Computer Engineering and Applications, 2021, 57(8): 1-9.
[13]	XU Degang, WANG Lu, LI Fan. Review of Typical Object Detection Algorithms for Deep Learning [J]. Computer Engineering and Applications, 2021, 57(8): 10-25.
[14]	JIANG Bin, ZHONG Rui, ZHANG Qiuwen, ZHANG Huanlong. Survey of Non-frontal Facial Expression Recognition by Using Deep Learning Methods [J]. Computer Engineering and Applications, 2021, 57(8): 48-61.
[15]	ZHAO Yuanli, LIANG Zhijian. Research on Stance Detection Based on Dual Attention Mechanism of Heteronuclear Convolution [J]. Computer Engineering and Applications, 2021, 57(8): 119-125.

Improved Semantic Segmentation Network for Indoor Scenes

一种改进的室内场景语义分割网络

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics