Computer Engineering and Applications ›› 2024, Vol. 60 ›› Issue (4): 173-182.DOI: 10.3778/j.issn.1002-8331.2301-0012

• Pattern Recognition and Artificial Intelligence • Previous Articles     Next Articles

Few-Shot Scene Classification with Attention Mechanism in Remote Sensing

ZHANG Duona, ZHAO Hongjia, LU Yuanyao, CUI Jian, ZHANG Baochang   

  1. 1. School of Information Science and Technology, North China University of Technology, Beijing 100144, China
    2. Institute of Artificial Intelligence, Beihang University, Beijing 100191, China
  • Online:2024-02-15 Published:2024-02-15

融入注意力机制的小样本遥感图像场景分类

张多纳,赵宏佳,鲁远耀,崔健,张宝昌   

  1. 1. 北方工业大学 信息学院,北京  100144
    2. 北京航空航天大学 人工智能研究院,北京  100191

Abstract: Remote sensing scene classification is a hot research topic in the field of computer vision, and it is of great significance to semantic understanding of remote sensing images. At present, remote sensing scene classification methods based on deep learning occupy a dominant position in this field. However, it suffers from the lack of samples and poor model generalization ability in actual application scenarios. Therefore, this paper proposes a few-shot remote scene classification method based on attention mechanism, and designs a structure of dual-branches similarity measurement. This method is based on the meta-learning training strategy to divide the dataset into tasks. At the meantime, the input images are divided into blocks in order to preserve the feature distribution in the remote sensing image. Then the lightweight attention module is introduced into the feature extraction network to reduce the risk of overfitting and ensure the acquisition of discriminative features. Finally, based on earth mover’s distance (EMD), a dual-branches similarity measurement module is added to improve the discriminative ability of the classifier. The results show that compared with the classic small-sample learning method, the few-shot remote scene classification method proposed in this paper can significantly improve the classification performance.

Key words: remote sensing scene classification, few-shot learning, meta-learning, attention mechanism, dual-branches similarity measurement

摘要: 遥感图像场景分类是计算机视觉领域的热点研究方向,对遥感图像场景及其语义理解意义重大。目前,基于深度学习的遥感图像场景分类方法在该领域占据主导地位。然而实际应用场景面临着样本数据较少、模型泛化能力较差的问题,致使基于深度学习的遥感图像场景分类方法实现难度较大,性能大幅下降。针对上述难点,提出了基于注意力机制的小样本遥感图像场景分类方法,设计了一种双分支判别结构进行相似性度量。该方法基于元学习训练策略对数据集进行任务制划分;为最大限度保留遥感图像中的特征分布,对输入图像进行重叠分块;在特征提取网络中引入轻量级注意力模块,降低过拟合风险并保证判别性特征的获取;在EMD(earth mover’s distance)距离的基础上设计添加双分支相似性度量模块,提升分类器的判别能力。实验结果表明,相较于经典小样本学习方法,所提出的小样本遥感图像场景分类方法能够显著提升分类性能。

关键词: 遥感图像场景分类, 小样本学习, 元学习, 注意力机制, 双分支判别