计算机工程与应用 ›› 2021, Vol. 57 ›› Issue (19): 164-170.DOI: 10.3778/j.issn.1002-8331.2104-0275

• 模式识别与人工智能 • 上一篇    下一篇

基于注意力机制和图卷积的小样本分类网络

王晓茹,张珩   

  1. 1.北京邮电大学 计算机学院,北京 100876
    2.北京市网络系统与网络文化重点实验室,北京 100876
  • 出版日期:2021-10-01 发布日期:2021-09-29

Relation Network Based on Attention Mechanism and Graph Convolution for Few-Shot Learning

WANG Xiaoru, ZHANG Heng   

  1. 1.College of Computer Science and Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China
    2.Beijing Key Laboratory of Network System and Network Culture, Beijing 100876, China
  • Online:2021-10-01 Published:2021-09-29

摘要:

Deep neural networks have dominated image recognition task with large amounts of labeled data. But training a well-performing network on a smaller dataset is still a very challenging task. How to learn from limited labeled data is a key research with excellent scenarios and potential applications. There are many ways to solve few-shot recognition problem, but there is still a problem of low recognition accuracy. The fundamental reason is that in few-shot learning, the traditional neural network can only accept a small amount of labeled data, which makes the network unable to obtain enough information for identification. Therefore, the paper proposes a few-shot classification model based on attention mechanism and graph convolutional neural network, which can not only extract features better, but also make full use of the features to classify the target image. Through the attention mechanism, it can guide the neural network to pay attention to more useful information, and graph convolution enables the network to make more accurate judgments by using the information from other classes of support set. Through many experiments, it is proved that the classification accuracy of the model on the Omniglot dataset and the miniImageNet dataset surpasses the original relational network which based on traditional neural network.

关键词: few-shot learning, image recognition, attention mechanism, graph convolutional network

Abstract:

Deep neural networks have dominated image recognition task with large amounts of labeled data. But training a well-performing network on a smaller dataset is still a very challenging task. How to learn from limited labeled data is a key research with excellent scenarios and potential applications. There are many ways to solve few-shot recognition problem, but there is still a problem of low recognition accuracy. The fundamental reason is that in few-shot learning, the traditional neural network can only accept a small amount of labeled data, which makes the network unable to obtain enough information for identification. Therefore, the paper proposes a few-shot classification model based on attention mechanism and graph convolutional neural network, which can not only extract features better, but also make full use of the features to classify the target image. Through the attention mechanism, it can guide the neural network to pay attention to more useful information, and graph convolution enables the network to make more accurate judgments by using the information from other classes of support set. Through many experiments, it is proved that the classification accuracy of the model on the Omniglot dataset and the miniImageNet dataset surpasses the original relational network which based on traditional neural network.

Key words: few-shot learning, image recognition, attention mechanism, graph convolutional network