Improved DeepLabv2 Real-time Image Semantic Segmentation Algorithm

doi:10.3778/j.issn.1002-8331.1911-0392

Abstract

Abstract:

Image semantic segmentation is one of the important components of computer vision perception system. Aiming at the problem of slow segmentation speed of existing semantic segmentation algorithms, an improved real-time image semantic segmentation algorithm based on DeepLabv2 is proposed. Compared with DeepLabv2, this algorithm uses light-weight convolution neural network which is Xception as the encoder, adds the decode process by the Feature Pyramid Net （FPN）, reduces the number of parameters of Atrous convolution Spatial Pyramid Pooling （ASPP）, so that greatly compresses the algorithm model and improves the algorithm’s segmentation speed. In addition, this paper improves the problem that the Focal Loss function is difficult to select hyper-parameters in multi-classification tasks and applies it to the algorithm in this paper to improve the segmentation accuracy of the algorithm. The experimental results on Cityscapes and Pascal VOC2012 show that the proposed algorithm can achieve real-time segmentation speed and has the advantage of high segmentation accuracy. Meanwhile, it also shows that the proposed hyper-parameter selection method can further improve the segmentation accuracy of the algorithm.

Key words: semantic segmentation, Convolution Neural Network（CNN）, image segmentation, unmanned

摘要：

图像语义分割是计算机视觉感知系统的重要组成之一，针对现有的语义分割算法存在分割速度慢的问题提出基于DeepLabv2改进的实时图像语义分割算法。与DeepLabv2相比，改进后的算法使用轻量卷积神经网络Xception作为编码器，增加特征金字塔网络（Feature Pyramid Net，FPN）解码特征的过程，减少空洞金字塔池化网络（Atrous convolution Spatial Pyramid Pooling，ASPP）参数的数量，进而大幅度压缩了算法模型，提升了算法分割速度。此外，还对Focal Loss损失函数在多分类任务中难以选择超参数的问题做出改进，并用于提升算法分割精度。在Cityscapes和Pascal VOC2012数据集上的实验结果表明改进后的算法可达到实时分割速度且具有分割精度高的优点，同时还表明提出的超参数选择方法可进一步提升算法分割精度。

关键词: 语义分割, 卷积神经网络, 图像分割, 无人驾驶

MA Shuhao, AN Jubai, YU Bo. Improved DeepLabv2 Real-time Image Semantic Segmentation Algorithm[J]. Computer Engineering and Applications, 2020, 56(18): 157-164.

马书浩，安居白，于博. 改进DeepLabv2的实时图像语义分割算法[J]. 计算机工程与应用, 2020, 56(18): 157-164.

[1]	LI Xiaoxiao, HU Xiaoguang, WANG Ziqiang, DU Zhuoqun. Survey of Instance Segmentation Based on Deep Learning [J]. Computer Engineering and Applications, 2021, 57(9): 60-67.
[2]	XU Shaojie, CAO Chuqing, WANG Yongjuan. Application Research of Visual SLAM in Indoor Dynamic Scenes [J]. Computer Engineering and Applications, 2021, 57(8): 175-179.
[3]	ZHAO Yang, ZHANG Junhua. Multi-scale Feature Fusion Method for Spinal X-Ray Image Segmentation [J]. Computer Engineering and Applications, 2021, 57(8): 214-219.
[4]	PAN Peixin, PAN Zhongliang. Active Contour Image Segmentation Combined with Saliency [J]. Computer Engineering and Applications, 2021, 57(8): 225-230.
[5]	DONG Peng, ZHOU Feng, ZHAO Congcong, WANG Yafei, MI Zetian, FU Xianping. Automatic Measurement of Underwater Sea Cucumber Size Based on Binocular Vision [J]. Computer Engineering and Applications, 2021, 57(8): 271-278.
[6]	SHI Chuntian, ZENG Yanyang, HOU Shouming. Summary of Application of Swarm Intelligence Algorithms in Image Segmentation [J]. Computer Engineering and Applications, 2021, 57(8): 36-47.
[7]	LI Xianguo, FENG Xinxin, LI Jianxiong. Sigle Image Super-Resolution Reconstruction Based on Multi-scale Residual Network [J]. Computer Engineering and Applications, 2021, 57(7): 215-221.
[8]	YANG Bo, TAO Qingchuan, DONG Peijun. Surgical Instrument Segmentation Method Based on Improved Deeplab v3+ Network [J]. Computer Engineering and Applications, 2021, 57(7): 222-227.
[9]	HOU Xuan, XUE Fei, CHEN Tao. UAV Target Detection on Quantum Multi-pattern Recognition Optimization Algorithm [J]. Computer Engineering and Applications, 2021, 57(7): 228-236.
[10]	YUAN Mingyang, HUANG Hongbo, ZHOU Changsheng. Research Progress of Image Semantic Segmentation Based on Fully Supervised Learning [J]. Computer Engineering and Applications, 2021, 57(4): 43-54.
[11]	YU Xiaojie, HE Yong, LIU Shenghua. Improved ORB Feature Optical Flow Algorithm for Indoor Positioning of Unmanned Aerial Vehicle [J]. Computer Engineering and Applications, 2021, 57(4): 266-271.
[12]	PENG Jing, LUO Haoyu, ZHAO Gansen, LIN Chengchuang, YI Xusheng, CHEN Shaojie. Survey of Medical Image Segmentation Algorithm in Deep Learning [J]. Computer Engineering and Applications, 2021, 57(3): 44-57.
[13]	YANG Yanan, ZHANG Hongming, LI Hanghao, YANG Jiangtao, QUAN Kai. Research on UAV Terrace Recognition Method Based on FCN and DenseCRF Model [J]. Computer Engineering and Applications, 2021, 57(3): 222-230.
[14]	LIN Shubin, WU Guishan, XU Jiayun, YANG Wenyuan. Multi-frame Surveillance of Correlation Filter in UAV Object Tracking [J]. Computer Engineering and Applications, 2021, 57(24): 152-160.
[15]	GU Haiyan, CHEN Liang, WANG Duodian. Space-Time Cooperative Path Planning for Multi-UAV Using Model Predictive Control [J]. Computer Engineering and Applications, 2021, 57(23): 270-279.

Improved DeepLabv2 Real-time Image Semantic Segmentation Algorithm

改进DeepLabv2的实时图像语义分割算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics