Computer Engineering and Applications ›› 2023, Vol. 59 ›› Issue (10): 94-103.DOI: 10.3778/j.issn.1002-8331.2208-0486

• Pattern Recognition and Artificial Intelligence • Previous Articles     Next Articles

Multi-Modal Meteorological Forecasting Based on Transformer

XIANG Deping, ZHANG Pu, XIANG Shiming, PAN Chunhong   

  1. 1.Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
    2.School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 100190, China
  • Online:2023-05-15 Published:2023-05-15

基于Transformer的多模态气象预测

向德萍,张普,向世明,潘春洪   

  1. 1.中国科学院 自动化研究所,北京 100190
    2.中国科学院大学 人工智能学院,北京 100190

Abstract: Thanks to the rapid development of meteorological observation technology, the meteorological industry has accumulated massive meteorological data, which provides an opportunity to build new data-driven meteorological forecasting methods. Due to the long-term dependence and large-scale spatial correlation hidden in meteorological data, and due to the complex coupling relationship between different modalities, meteorological forecasting with deep learning is still a challenging research topic. This paper presents a deep learning model for meteorological forecasting based on multi-modal fusion, using sequential multi-modal data in same atmospheric pressure levels composed of four classical meteorological elements:temperature, relative humidity, U-component of wind and V-component of wind. Specifically, convolutional network is used to learn features from every modality, and with those features, the gating mechanism is introduced to multi-modal weighted fusion. Secondly, the attention mechanism is introduced, which replaces the traditional attention mechanism with parallel spatial-temporal axial attention, in order to effectively learn long-term dependencies and large-scale spatial associations. Architecturally, the Transformer encoder-decoder structure is employed as the overall framework. Extensive comparative experiments have been conducted on the regional ERA5 reanalysis dataset, demonstrating that the proposed method is effective and superior in the prediction of temperature, relative humidity and wind.

Key words: meteorological forecasting, multi-modal fusion, Transformer, axial attention

摘要: 随着气象观测技术的快速发展,气象行业积累了海量的气象大数据,为构建新型的数据驱动的气象预测模型提供了机遇。由于气象数据中存在的长时依赖关系和大范围空间关联关系,以及多模态气象要素间存在的复杂跨模态耦合关系,基于深度学习的气象预测是一个具有挑战性的研究课题。针对“温度、相对湿度、纬向风速、经向风速”四种经典气象要素组成的等气压层时序多模态数据,提出了一种基于多模态融合的气象预测深度学习模型。首先采用卷积网络来学习各个模态的特征,并在此基础上引入门控机制实现多模态加权融合;然后引入注意力机制,以并行时空轴向注意力代替传统的注意力机制,从而有效地学习长时依赖关系和大范围空间关联关系。整体结构上,采用了基于Transformer的编码器-解码器结构。在ERA5再分析数据集(子区域)上进行了对比实验,实验结果表明了所提方法在温度、相对湿度、风速等预测任务上的有效性和优越性。

关键词: 气象预测, 多模态融合, Transformer, 轴向注意力