计算机工程与应用 ›› 2025, Vol. 61 ›› Issue (7): 1-24.DOI: 10.3778/j.issn.1002-8331.2409-0300

• 热点与综述 • 上一篇    下一篇

基于大语言模型的智能问答系统研究综述

任海玉,刘建平,王健,顾勋勋,陈曦,张越,赵昌顼   

  1. 1.北方民族大学 计算机科学与工程学院,银川 750021
    2.北方民族大学 图像图形智能处理国家民委重点实验室,银川 750021
    3.中国农业科学院 农业信息研究所,北京 100081
  • 出版日期:2025-04-01 发布日期:2025-04-01

Research on Intelligent Question Answering System Based on Large Language Model

REN Haiyu, LIU Jianping, WANG Jian, GU Xunxun, CHEN Xi, ZHANG Yue, ZHAO Changxu   

  1. 1.College of Computer Science and Engineering, North Minzu University, Yinchuan 750021, China
    2.The Key Laboratory of Images & Graphics Intelligent Processing of State Ethnic Affairs Commission, North Minzu University, Yinchuan 750021, China
    3. Agricultural Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
  • Online:2025-04-01 Published:2025-04-01

摘要: 智能问答是自然语言处理中的一个核心的子领域,旨在理解并回答用户提出的自然语言问题的系统。传统的问答系统通常依赖于预定义的规则和有限的语料库,无法处理复杂的多轮对话。大语言模型是一种基于深度学习技术的自然语言处理模型,拥有数十亿甚至上千亿个参数,不仅能够理解和生成自然语言,还能显著提升问答系统的准确性和效率,推动智能问答技术的发展。近年来,基于大模型技术的智能问答逐渐成为研究热点,但对该领域的系统性综述仍然较为欠缺。因此,针对大模型的智能问答系统进行系统综述,介绍了问答系统的基本概念和数据集及其评价指标;介绍了基于大模型的问答系统,其中包括基于提示学习的问答系统、基于知识图谱的问答系统、基于检索增强生成的问答系统和基于智能代理的问答系统以及微调在问答任务中的技术路线,并对比了五种方法在问答系统中的优缺点和应用场景;对于当前基于大语言模型的问答系统面临的研究挑战和未来发展趋势进行了总结。

关键词: 大语言模型, 智能问答, 自然语言处理, 检索增强生成, 提示学习, 知识图谱

Abstract: Intelligent question answering is a core subfield in natural language processing, aiming at systems that understand and answer natural language questions posed by users. Traditional question answering systems usually rely on predefined rules and limited corpora and are unable to handle complex multi-round dialogues. Large language models are natural language processing models based on deep learning technology, with billions or even hundreds of billions of parameters. They can not only understand and generate natural language but also significantly improve the accuracy and efficiency of question answering systems, promoting the development of intelligent question answering technology. In recent years, intelligent question answering based on large model technology has gradually become a research hotspot, but a systematic review in this field is still relatively lacking. Therefore, this article conducts a systematic review of intelligent question answering systems based on large models. Firstly, it introduces the basic concepts of question answering systems, datasets, and their evaluation metrics. Secondly, it presents question answering systems based on large models, including those based on prompt learning, knowledge graphs, retrieval-augmented generation, and intelligent agents, as well as the technical route of fine-tuning in question answering tasks, and compares the advantages, disadvantages, and application scenarios of the five methods in question answering systems. Finally, it summarizes the current research challenges and future development trends of question answering systems based on large language models.

Key words: large language model, intelligent question-answering, natural language processing, retrieval-augmented generation, prompt learning, knowledge graph