计算机工程与应用 ›› 2024, Vol. 60 ›› Issue (24): 166-176.DOI: 10.3778/j.issn.1002-8331.2406-0301

• 模式识别与人工智能 • 上一篇    下一篇

融合大模型微调与图神经网络的知识图谱问答

陈俊臻,王淑营,罗浩然   

  1. 1.西南交通大学 计算机与人工智能学院,成都 611756
    2.北京邮电大学 计算机学院,北京 100876
  • 出版日期:2024-12-15 发布日期:2024-12-12

Combining Large Model Fine-Tuning and Graph Neural Networks for Knowledge Graph Question Answering

CHEN Junzhen, WANG Shuying, LUO Haoran   

  1. 1.School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu 611756, China
    2.School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Online:2024-12-15 Published:2024-12-12

摘要: 传统知识图谱问答系统在处理自然语言问句时,常因语义解析不精确而导致错误。为解决这一问题,提出一种融合大模型微调和图神经网络的知识图谱问答方法。收集问题并定义问题的逻辑形式;利用大型预训练语言模型的强大语义解析能力,通过对问题及其对应逻辑形式构成的问答对进行微调,提升问题解析的精度;采用模糊集方法增强微调后的逻辑形式,提高其检索精度;利用图神经网络对这些逻辑形式进行关系投影和逻辑运算获取最终答案。在通用领域标准数据集WebQSP和ComplexWebQuestions上的实验验证表明,该方法在F1、Hit@1和ACC这三个指标上均优于基准模型。同时,方法也在垂直领域风电装备数据集、高速列车数据集上进行了应用和验证。

关键词: 知识图谱问答, 大模型微调, 逻辑形式, 图神经网络检索

Abstract: To address the challenges posed by inaccurate semantic parsing in traditional knowledge graph question answering systems when processing natural language queries, this paper proposes a method that integrates large model fine-tuning with graph neural networks. The approach begins with the collection of questions and the definition of their corresponding logical forms. Leveraging the robust semantic parsing capabilities of large pre-trained language models, the accuracy of question parsing is significantly enhanced through fine-tuning on question-answer pairs, where each pair includes a question and its associated logical form. Subsequently, the fuzzy set method is applied to further refine the fine-tuned logical forms, improving retrieval precision. Finally, graph neural networks are employed to perform relation projection and logical operations on these enhanced logical forms to derive the final answers. Experimental validation on standard general-domain datasets, such as WebQSP and ComplexWebQuestions, demonstrates that this method surpasses baseline models in terms of F1, Hit@1, and ACC metrics. Additionally, the method has been successfully applied and validated on domain-specific datasets, including those related to wind power equipment and high-speed trains, confirming its effectiveness in specialized domains.

Key words: knowledge graph Q&, A, large model fine-tuning, logical form, graph neural network retrieval