Computer Engineering and Applications ›› 2024, Vol. 60 ›› Issue (6): 155-162.DOI: 10.3778/j.issn.1002-8331.2210-0071

• Pattern Recognition and Artificial Intelligence • Previous Articles     Next Articles

Medical Report Extraction Generation Model Integrated with BioCopy Mechanism

LIU Lan, TAN Hongye   

  1. 1.School of Computer and Information Technology, Shanxi University, Taiyuan 030006, China
    2.Key Laboratory of Ministry of Education Intelligence and Chinese Information Processing, Shanxi University, Taiyuan 030006, China
  • Online:2024-03-15 Published:2024-03-15

融入BioCopy机制的医疗报告抽取生成模型

刘岚,谭红叶   

  1. 1.山西大学 计算机与信息技术学院,太原 030006
    2.山西大学 计算智能与中文信息处理教育部重点实验室,太原 030006

Abstract: Wise information technology of med (WITMED) is a new health care service mode that integrates information technologies such as artificial intelligence. Among them, automatic generation of medical reports is an important task in the field of WITMED. This task generates semi-structured medical reports based on patient self-report and doctor-patient dialogue. The medical report not only contains the chief complaint and other sub parts, but also contains a large number of medical terms from the original text. In view of these characteristics, a summary model integrating extraction and abstraction of BioCopy mechanism is adopted. Firstly, the model extracts key sentences for each sub-part to eliminate the interference of irrelevant information. Then, the BioCopy mechanism is added when generating the medical report to copy the medical terms in the key sentences to ensure the accuracy of the results. The experimental results on CCL 2021 datasets show that this model is superior to the main baseline and has achieved good results.

Key words: automatic generation of medical reports, extraction and generation, BioCopy

摘要: 智慧医疗是融合了人工智能技术的新型健康医疗服务模式,其中医疗报告自动生成是智慧医疗领域的一项重要任务,该任务依据病人自述和医患对话,生成半结构化的医疗报告。医疗报告不仅包含主诉等多个子部分,而且包含大量来自原文的医疗术语。针对这些特点,采用了融入BioCopy机制的抽取与生成结合的摘要模型,模型首先对每个子部分进行关键句抽取,排除无关信息的干扰;然后在生成医疗报告时加入BioCopy机制以复制关键句中的医疗术语,保证结果的准确性。在CCL 2021相关数据集上的实验结果表明:该模型优于主要baseline,取得了较好的效果。

关键词: 医疗报告自动生成, 抽取与生成, BioCopy