深度学习模型可解释性研究综述

doi:10.3778/j.issn.1002-8331.2012-0357

计算机工程与应用 ›› 2021, Vol. 57 ›› Issue (8): 1-9.DOI: 10.3778/j.issn.1002-8331.2012-0357

深度学习模型可解释性研究综述

曾春艳，严康，王志锋，余琰，纪纯妹

1.湖北工业大学太阳能高效利用及储能运行控制湖北省重点实验室，武汉 430068
2.华中师范大学数字媒体技术系，武汉 430079
3.中国移动通信集团广东有限公司汕头分公司，广东汕头 515041

出版日期:2021-04-15 发布日期:2021-04-23

Survey of Interpretability Research on Deep Learning Models

ZENG Chunyan, YAN Kang, WANG Zhifeng, YU Yan, JI Chunmei

1.Hubei Key Laboratory for High-efficiency Utilization of Solar Energy and Operation Control of Energy Storage System, Hubei University of Technology, Wuhan 430068, China
2.Department of Digital Media Technology, Central China Normal University, Wuhan 430079, China
3.Shantou Branch, China Mobile Group Guangdong Co., Ltd., Shantou, Guangdong 515041, China

Online:2021-04-15 Published:2021-04-23

摘要/Abstract

摘要：

深度学习技术以数据驱动学习的特点，在自然语言处理、图像处理、语音识别等领域取得了巨大成就。但由于深度学习模型网络过深、参数多、复杂度高等特性，该模型做出的决策及中间过程让人类难以理解，因此探究深度学习的可解释性成为当前人工智能领域研究的新课题。以深度学习模型可解释性为研究对象，对其研究进展进行总结阐述。从自解释模型、特定模型解释、不可知模型解释、因果可解释性四个方面对主要可解释性方法进行总结分析。列举出可解释性相关技术的应用，讨论当前可解释性研究存在的问题并进行展望，以推动深度学习可解释性研究框架的进一步发展。

关键词: 深度学习, 可解释性, 人工智能, 因果可解释, 自解释

Abstract:

With the characteristics of data-driven learning, deep learning technology has made great achievements in the fields of natural language processing, image processing, and speech recognition. However, due to the deep learning model featured by deep networks, many parameters, high complexity and other characteristics, the decisions and intermediate processes made by the model are difficult for humans to understand. Therefore, exploring the interpretability of deep learning has become a new topic in the current artificial intelligence field. This review takes the interpretability of deep learning models as the research object and summarizes its progress. Firstly, the main interpretability methods are summarized and analyzed from four aspects：self-explanatory model, model-specific explanation, model-agnostic explanation, and causal interpretability. At the same time, it enumerates the application of interpretability related technologies, and finally discusses the existing problems of current interpretability research to promote the further development of the deep learning interpretability research framework.

Key words: deep learning, interpretability, artificial intelligence, causal interpretability, self-explanatory

曾春艳，严康，王志锋，余琰，纪纯妹. 深度学习模型可解释性研究综述[J]. 计算机工程与应用, 2021, 57(8): 1-9.

ZENG Chunyan, YAN Kang, WANG Zhifeng, YU Yan, JI Chunmei. Survey of Interpretability Research on Deep Learning Models[J]. Computer Engineering and Applications, 2021, 57(8): 1-9.

[1]	武文杰，宋文爱，高雪梅，杨吉江，王青，黄丽萍，雷毅. 基于X线的成人OSA计算机辅助诊断综述[J]. 计算机工程与应用, 2021, 57(9): 1-8.
[2]	冉蓉，徐兴华，邱少华，崔小鹏，欧阳斌. 基于深度卷积神经网络的裂纹检测方法综述[J]. 计算机工程与应用, 2021, 57(9): 23-35.
[3]	李晓筱，胡晓光，王梓强，杜卓群. 基于深度学习的实例分割研究进展[J]. 计算机工程与应用, 2021, 57(9): 60-67.
[4]	黄冬宜，杨兵，吴子豪，匡佳一，颜泽明. 用于全市蜂窝流量预测的时空全连接卷积网络[J]. 计算机工程与应用, 2021, 57(9): 168-175.
[5]	周伦钢，孙怡峰，王坤，吴疆，黄维贵，李炳龙. 目标多种多值属性的端端快速识别网络[J]. 计算机工程与应用, 2021, 57(9): 182-190.
[6]	张成，戴俊峰，熊闻心. 融合LeNet-5改进的扫描文档手写日期识别[J]. 计算机工程与应用, 2021, 57(9): 207-211.
[7]	许德刚，王露，李凡. 深度学习的典型目标检测算法研究综述[J]. 计算机工程与应用, 2021, 57(8): 10-25.
[8]	蒋斌，钟瑞，张秋闻，张焕龙. 采用深度学习方法的非正面表情识别综述[J]. 计算机工程与应用, 2021, 57(8): 48-61.
[9]	赵圆丽，梁志剑. 基于异核卷积双注意机制的立场检测研究[J]. 计算机工程与应用, 2021, 57(8): 119-125.
[10]	李明山，韩清鹏，张天宇，王道累. 改进SSD的安全帽检测方法[J]. 计算机工程与应用, 2021, 57(8): 192-197.
[11]	李健，孙大松，张备伟. 结合双编码器与对抗训练的图像修复[J]. 计算机工程与应用, 2021, 57(7): 192-197.
[12]	杨波，陶青川，董沛君. 改进Deeplab v3+网络的手术器械分割方法[J]. 计算机工程与应用, 2021, 57(7): 222-227.
[13]	刘迪，贾金露，赵玉卿，钱育蓉. 基于深度学习的图像去噪方法研究综述[J]. 计算机工程与应用, 2021, 57(7): 1-13.
[14]	杨培伟，周余红，邢岗，田智强，许夏瑜. 卷积神经网络在生物医学图像上的应用进展[J]. 计算机工程与应用, 2021, 57(7): 44-58.
[15]	唐国智，李顶根. 深度学习及时空约束的行人跟踪算法研究[J]. 计算机工程与应用, 2021, 57(7): 121-129.

深度学习模型可解释性研究综述

Survey of Interpretability Research on Deep Learning Models

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics