计算机工程与应用 ›› 2024, Vol. 60 ›› Issue (20): 215-223.DOI: 10.3778/j.issn.1002-8331.2307-0174

• 图形图像处理 • 上一篇    下一篇

边缘优化和注意力融合的遥感地物分割算法

闵锋,彭伟明,况永刚,毛一新,郝琳琳   

  1. 1.武汉工程大学 计算机科学与工程学院,武汉 430205
    2.武汉工程大学 智能机器人湖北省重点实验室,武汉 430205
  • 出版日期:2024-10-15 发布日期:2024-10-15

Remote Sensing Ground Object Segmentation Algorithm Based on Edge Optimization and Attention Fusion

MIN Feng, PENG Weiming, KUANG Yonggang, MAO Yixin, HAO Linlin   

  1. 1.School of Computer Science and Engineering, Wuhan Institute of Technology, Wuhan 430205, China
    2.Hubei Province Key Laboratory of Intelligent Robot, Wuhan Institute of Technology, Wuhan 430205, China
  • Online:2024-10-15 Published:2024-10-15

摘要: 针对遥感地物图像种类众多且目标边缘较复杂的特点,以及现有分割网络中局部卷积的感受野有限,对图像上下文信息利用不足,导致分割目标边缘模糊以及分割精度低等问题,提出一种基于UNet3+网络的遥感地物分割算法。在解码过程中引入相似性感知点关联算子作为上采样方式,通过聚合特征金字塔中的多个建议,改善目标边界细节的分割能力;在编码过程中引入选择性内核模块,优化下采样方式,以实现神经元的自适应感受野大小,充分地获取目标特征的多尺度信息,精准捕捉有用的细节语义信息;在跳跃连接阶段添加双多尺度注意力模块,对不同尺度的特征进行加权融合,使模型更好地关注局部细节和全局上下文信息。在WHDLD、ISPRS Potsdam数据集上的实验表明,改进算法的平均交并比分别达到了64.4%、75.4%,较基线模型分别提升了约2.6个百分点、3.2个百分点,同时验证了改进算法在分割边缘模糊问题上的有效性。

关键词: 遥感地物, UNet3+, 相似性感知点关联, 选择性内核模块, 双多尺度注意力

Abstract: Considering the characteristics of remote sensing land cover images with a wide variety of types and complex object edges as well as the limited receptive field of local convolutions in existing segmentation networks resulting in inadequate utilization of contextual information, leading to issues such as blurred object edges and low segmentation accuracy, this paper proposes a remote sensing land cover segmentation algorithm based on the UNet3+ network architecture. Firstly, during the decoding process, a similarity-aware point affiliation operator is introduced as an upsampling method. This operator aggregates multiple proposals from the feature pyramid to enhance the segmentation capability for object boundary details. Secondly, in the encoding process, a selective kernel module is introduced to optimize the downsampling approach. This module enables neurons to achieve an adaptive receptive field size, facilitating the acquisition of multi-scale information from target features and precise capture of valuable detailed semantic information. Finally, in the skip-connection phase, a dual multi-scale attention module is added to perform weighted fusion of features from different scales, enabling the model to better focus on both local details and global contextual information. Experimental results on the WHDLD and ISPRS Potsdam datasets demonstrate that the proposed algorithm achieves mean intersection over union (MIoU) improvements of 64.4% and 75.4% respectively, compared to baseline models, the improvement is about 2.6 percentage points and 3.2 percentage points respectively. This also validates the effectiveness of the proposed algorithm in addressing the issue of blurry segmentation edges.

Key words: remote sensing land cover, UNet3+, similarity-aware point affiliation, selective kernel module, dual multi-scale attention