计算机工程与应用 ›› 2025, Vol. 61 ›› Issue (16): 146-159.DOI: 10.3778/j.issn.1002-8331.2412-0021

• 模式识别与人工智能 • 上一篇    下一篇

LLMs与ML优势互补:政务回复质量检测及可解释的算法框架

方岢愿,许珂维   

  1. 香港科技大学(广州) 创新创业与公共政策学域,广州 511400
  • 出版日期:2025-08-15 发布日期:2025-08-15

Complementary Strengths of LLMs and ML: Government Service Response Quality Detection and Explanation Algorithm Framework

FANG Keyuan, XU Kewei   

  1. Innovation, Policy and Entrepreneurship Thrust, The Hong Kong University of Science and Technology (Guangzhou), Guangzhou 511400, China
  • Online:2025-08-15 Published:2025-08-15

摘要: 为了克服传统政务平台回复的人工质量检测难以满足庞大市民政务需求的问题,以及以文本特征为主的机器学习方法缺乏可解释性的问题,研究采用了一种结合大语言模型(large language models,LLMs)和机器学习(machine learning,ML)模型,并以非文本特征为主的政务回复质量检测算法框架。实验表明该算法能达到比其他单一使用LLMs或ML的方法更高的准确度,接近政务问答质量的人工判断结果,并且收敛速度快,训练成本更低。同时采用SHAP(Shapley additive explanations)框架提高了该方法的可解释性,结合八种非文本特征的重要性和对最终结果的贡献度,识别出影响政务问答质量的关键因素和特征作用机制,以此筛选得到高质量回复和提高平台服务质量,增进市民对政务问答平台的满意度和信任度。

关键词: 电子政务, 回复质量检测, 大语言模型, 机器学习, 特征工程

Abstract: To overcome the issues of the manual quality detection of digital government platform responses not being able to meet the vast demands of citizens for government services and the lack of interpretability in text-feature-based machine learning methods, this study adopts a government response quality detection algorithm framework that combines large language models (LLMs) with machine learning (ML) models, mainly based on non-text features. Experiments show that this method can achieve higher accuracy than other methods solely using LLMs or ML, approaching the results of manual evaluation on the quality of government responses, with faster convergence and lower training costs. The study also introduces the SHAP (Shapley additive explanations) model to enhance the interpretability of the proposed method, integrating the importance ranking of eight non-text features and their impact on the quality detection results, identifying the significant features that can improve the quality of government responses and feature mechanisms, which helps to efficiently select high-quality replies and improve the service quality of government platforms, promoting citizens?? trust in digital government platforms.

Key words: digital government, response quality detection, large language models, machine learning, feature engineering