Computer Engineering and Applications ›› 2025, Vol. 61 ›› Issue (12): 120-128.DOI: 10.3778/j.issn.1002-8331.2409-0336

• Pattern Recognition and Artificial Intelligence • Previous Articles     Next Articles

Retrieval?Augmented Question?Answering Generation Based on Question?Oriented Prompt Learning and Multi-Channel Reasoning

WANG Yuting, CHEN Bo, YAN Qiang, FAN Yixing, YU Zhihua, GUO Jiafeng   

  1. 1.CAS Key Laboratory of Network Data Science & Technology, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
    2.University of Chinese Academy of Sciences, Beijing 100049, China
  • Online:2025-06-15 Published:2025-06-13

基于问题导向提示学习和多路推理的检索增强生成问答

王昱婷,陈波,闫强,范意兴,余智华,郭嘉丰   

  1. 1.中国科学院 计算技术研究所 中国科学院网络数据科学与技术重点实验室,北京 100190
    2.中国科学院大学,北京 100049

Abstract: The current large language models have important applications in various fields due to their superior performance. However, large language models have problems such as outdated knowledge, insufficient knowledge, and incorrect generated results. Retrieval augmented generation alleviates these issues by providing accurate and timely professional external knowledge input to large language models. However, how to improve the accuracy of generated answers is still a hot problem worth paying attention to. This paper designs question-oriented prompts and optimizes the prompt engineering to effectively stimulate the understanding ability of large language models for different types of questions, and fully utilizes external information to improve the accuracy of retrieval augmented generation in answering question and answer tasks for different types of questions. This paper uses auxiliary multi-way reasoning to optimize prompts and generate answer summaries to reduce the illusion of model generated answers. The experimental results on simple questions, comparison questions, set questions, multi-hop questions, and total data in the English retrieval augmented generation benchmark dataset show that the proposed method can achieve good implementation results.

Key words: retrieval-augmented generation, prompt learning, large language model

摘要: 当前大语言模型因其优越的性能,在各个领域都有着重要的应用。但大语言模型存在知识过时、知识不充分及生成结果错误等问题。检索增强生成通过给大语言模型输入精准及时的专业外部知识缓解了这些问题。然而,如何提高生成答案的准确性仍是值得关注的热点问题。设计问题导向提示,通过优化提示工程,有效激发了大语言模型对于不同类型问题的理解能力,并充分利用外部信息,提高检索增强生成在回答不同类型问题的问答任务的准确性。同时使用辅助多路推理优化提示与生成答案总结降低模型生成答案的幻觉。在英文检索增强生成基准数据集中的简单问题、比较问题、集合问题与多跳问题以及总体数据进行的实验结果表明,提出的方法能够取得比较好的实现效果。

关键词: 检索增强生成, 提示学习, 大语言模型