改进DeepLabv2的实时图像语义分割算法

doi:10.3778/j.issn.1002-8331.1911-0392

计算机工程与应用 ›› 2020, Vol. 56 ›› Issue (18): 157-164.DOI: 10.3778/j.issn.1002-8331.1911-0392

改进DeepLabv2的实时图像语义分割算法

马书浩，安居白，于博

大连海事大学信息科学技术学院，辽宁大连 116026

出版日期:2020-09-15 发布日期:2020-09-10

Improved DeepLabv2 Real-time Image Semantic Segmentation Algorithm

MA Shuhao, AN Jubai, YU Bo

College of Information Science and Technology, Dalian Maritime University, Dalian, Liaoning 116026, China

Online:2020-09-15 Published:2020-09-10

摘要/Abstract

摘要：

图像语义分割是计算机视觉感知系统的重要组成之一，针对现有的语义分割算法存在分割速度慢的问题提出基于DeepLabv2改进的实时图像语义分割算法。与DeepLabv2相比，改进后的算法使用轻量卷积神经网络Xception作为编码器，增加特征金字塔网络（Feature Pyramid Net，FPN）解码特征的过程，减少空洞金字塔池化网络（Atrous convolution Spatial Pyramid Pooling，ASPP）参数的数量，进而大幅度压缩了算法模型，提升了算法分割速度。此外，还对Focal Loss损失函数在多分类任务中难以选择超参数的问题做出改进，并用于提升算法分割精度。在Cityscapes和Pascal VOC2012数据集上的实验结果表明改进后的算法可达到实时分割速度且具有分割精度高的优点，同时还表明提出的超参数选择方法可进一步提升算法分割精度。

关键词: 语义分割, 卷积神经网络, 图像分割, 无人驾驶

Abstract:

Image semantic segmentation is one of the important components of computer vision perception system. Aiming at the problem of slow segmentation speed of existing semantic segmentation algorithms, an improved real-time image semantic segmentation algorithm based on DeepLabv2 is proposed. Compared with DeepLabv2, this algorithm uses light-weight convolution neural network which is Xception as the encoder, adds the decode process by the Feature Pyramid Net （FPN）, reduces the number of parameters of Atrous convolution Spatial Pyramid Pooling （ASPP）, so that greatly compresses the algorithm model and improves the algorithm’s segmentation speed. In addition, this paper improves the problem that the Focal Loss function is difficult to select hyper-parameters in multi-classification tasks and applies it to the algorithm in this paper to improve the segmentation accuracy of the algorithm. The experimental results on Cityscapes and Pascal VOC2012 show that the proposed algorithm can achieve real-time segmentation speed and has the advantage of high segmentation accuracy. Meanwhile, it also shows that the proposed hyper-parameter selection method can further improve the segmentation accuracy of the algorithm.

Key words: semantic segmentation, Convolution Neural Network（CNN）, image segmentation, unmanned

马书浩，安居白，于博. 改进DeepLabv2的实时图像语义分割算法[J]. 计算机工程与应用, 2020, 56(18): 157-164.

MA Shuhao, AN Jubai, YU Bo. Improved DeepLabv2 Real-time Image Semantic Segmentation Algorithm[J]. Computer Engineering and Applications, 2020, 56(18): 157-164.

[1]	王文曦, 李乐林. 深度学习在点云分类中的研究综述[J]. 计算机工程与应用, 2022, 58(1): 26-40.
[2]	张欣, 朱江. 面向样本不平衡的网络安全态势要素获取[J]. 计算机工程与应用, 2022, 58(1): 134-142.
[3]	张鹏, 孔韦韦, 滕金保. 基于多尺度特征注意力机制的人脸表情识别[J]. 计算机工程与应用, 2022, 58(1): 182-189.
[4]	杨有为, 周刚. 面向自然场景文本检测的改进NMS算法[J]. 计算机工程与应用, 2022, 58(1): 204-208.
[5]	谢宏, 王立宸, 袁小芳, 陈海滨. 机械臂卷积神经网络滑模轨迹跟踪控制[J]. 计算机工程与应用, 2022, 58(1): 268-273.
[6]	牟清萍，张莹，张东波，王新杰，杨知桥. 目标丢失判别机制的视觉跟踪算法及应用研究[J]. 计算机工程与应用, 2021, 57(9): 140-147.
[7]	包志强，邢瑜，吕少卿，黄琼丹. 改进YOLO V2的6D目标姿态估计算法[J]. 计算机工程与应用, 2021, 57(9): 148-153.
[8]	赵志焱，杨华，胡志伟，宇海萍. 基于TACNN的玉露香梨叶虫害识别[J]. 计算机工程与应用, 2021, 57(9): 176-181.
[9]	周伦钢，孙怡峰，王坤，吴疆，黄维贵，李炳龙. 目标多种多值属性的端端快速识别网络[J]. 计算机工程与应用, 2021, 57(9): 182-190.
[10]	张成，戴俊峰，熊闻心. 融合LeNet-5改进的扫描文档手写日期识别[J]. 计算机工程与应用, 2021, 57(9): 207-211.
[11]	麻哲旭，杨峰，乔旭. 铁路路基病害智能检测方法[J]. 计算机工程与应用, 2021, 57(9): 272-278.
[12]	冉蓉，徐兴华，邱少华，崔小鹏，欧阳斌. 基于深度卷积神经网络的裂纹检测方法综述[J]. 计算机工程与应用, 2021, 57(9): 23-35.
[13]	李晓筱，胡晓光，王梓强，杜卓群. 基于深度学习的实例分割研究进展[J]. 计算机工程与应用, 2021, 57(9): 60-67.
[14]	徐少杰，曹雏清，王永娟. 视觉SLAM在室内动态场景中的应用研究[J]. 计算机工程与应用, 2021, 57(8): 175-179.
[15]	赵阳，张俊华. 多尺度特征融合的脊柱X线图像分割方法[J]. 计算机工程与应用, 2021, 57(8): 214-219.

改进DeepLabv2的实时图像语义分割算法

Improved DeepLabv2 Real-time Image Semantic Segmentation Algorithm

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics