融合注意力机制与多任务学习的DR分级模型

doi:10.3778/j.issn.1002-8331.2007-0385

摘要/Abstract

摘要：

在糖尿病患者中，糖尿病视网膜病变（Diabetic Retinopathy，DR）是导致失明的主要原因。针对眼底图像中存在极难发现的微动脉瘤等微小病理特征的问题，提出了一种注意力机制模块。该模块通过融合特征图原本的特征信息与注意力单元得到的通道信息，为微小特征增加了网络的权重，再使用除操作去除特征图中的冗余信息，得到注意力机制特征作为双任务的输入；针对均方误差（Mean Square Error，MSE）损失难优化和交叉熵（Cross Entropy，CE）损失未考虑错分DR等级的代价，设计了多任务学习模块，加权融合了回归任务的MSE损失和分类任务的CE损失。基于这两个模块的设计，提出了融合注意力机制的多任务学习网络（Fusion of Attention mechanism and Multi-Tasking learning network，FAMT）。在kaggle数据集上的实验表明，FAMT网络在验证集上的Kappa比仅使用回归任务的网络高出了2%，比仅使用分类任务的网络提高了4%；FAMT网络在测试集上的Kappa比EfficientNet网络高出1%，比M2CNN网络高出了5%。

关键词: 糖尿病视网膜分级, 深度学习, 注意力机制, 多任务学习, 卷积神经网络

Abstract:

In diabetic patients, Diabetic Retinopathy（DR） is the main cause of blindness. Based on extremely difficult to find in the fundus images of tiny pathological characteristics such as microaneurysm, an attention mechanism module is proposed. Through fusing the original feature information of feature map and channel information obtained by attention unit, the weight of the network is increased for the tiny feature, the redundant information in the graph is removed, by the division operation, attention mechanism features are obtained as the input into the double tasks. For the Mean Square Error（MSE） loss is difficult to optimize and Cross Entropy（CE） loss does not consider the cost of misclassified DR level, a multi-task learning module is designed, regression MSE loss of tasks and CE loss of classification tasks are weighted fusion. Based on the design of the two modules, a Fusion of Attention mechanism and Multi-Tasking learning network（FAMT） is proposed. Experiments on the kaggle dataset show that FAMT network has a Kappa on the validation set that is 2% higher than the network that only uses regression tasks, and 4% higher than the network that only uses classification tasks. The Kappa ratio of the FAMT network on the test set is 1% higher than the EfficientNet network and 5% higher than the M2CNN network.

Key words: diabetic retina grading, deep learning, attention mechanism, multi-task learning, convolutional neural network

徐常转，吴云，蓝林，黄自萌. 融合注意力机制与多任务学习的DR分级模型[J]. 计算机工程与应用, 2021, 57(24): 212-218.

XU Changzhuan, WU Yun, LAN Lin, HUANG Zimeng. DR Classification Model of Fusing Attention Mechanism and Multi-tasking Learning[J]. Computer Engineering and Applications, 2021, 57(24): 212-218.

参考文献

[1] STITT A W，CURTIS T M.The progress in understanding and treatment of diabetic retinopathy[J].Progress in Retinal and Eye Research，2016，51：156-186.
[2] MOOKIAH M R K，ACHARYA U R，CHUA C K.Computer-aided diagnosis of diabetic retinopathy：a review[J].Computers in Biology and Medicine，2013，43（12）：2136-2155.
[3] PRATT H，COENEN F，BROADBENT D M.Convolutional neural networks for diabetic retinopathy[J].Procedia Computer Science，2016，90：200-205.
[4] GARDNER G G，KEATING D，WILLIAMSON T H.Automatic detection of diabetic retinopathy using an artificial neural network：a screening tool[J].British Journal of Ophthalmology，1996，80（11）：940-944.
[5] ROYCHOWDHURY S，KOOZEKANANI D D，PARHI K K.Dream：diabetic retinopathy analysis using machine learning[J].IEEE Journal of Biomedical and Health Informatics，2014，18（5）：1717-1728.
[6] PRIYA R，ARUNA P.Diagnosis of diabetic retinopathy using machine learning techniques[J].Journal on Soft Computing，2013，3（4）：563-575.
[7] NAYAK J，BHAT P，ACHARYA R，et al.Automated identification of diabetic retinopathy stages using digital fundus images[J].Journal of Medical Systems，2008，32（2）：107-115.
[8] ACHARYA R，CHUA C K，NG E Y K，et al.Application of higher order spectra for the identification of diabetes retinopathy stages[J].Journal of Medical Systems，2008，32（6）：481-488.
[9] ADARSH P，JEYAKUMARI D.Multiclass SVM-based automated diagnosis of diabetic retinopathy[C]//2013 International Conference on Communications and Signal Processing（ICCSP），Sharijah，2013：206-210.
[10] ZHOU K，GU Z W，LIU W，et al.Multi-cell multi-task convolutional neural networks for diabetic retinopathy grading[C]//Annual International Conference of the IEEE Engineering in Medicine and Biology Society Conference（EMBC），Honolulu，Hawaii，USA，2018：2724-2727.
[11] TYMCHENKO B，MARCHENKO P，SPODARETS D.Deep learning approach to diabetic retinopathy detection[EB/OL].（2020）[2020-07-21].https：//arxiv.org/abs/2003.02261.
[12] ZHAO Z Y，ZHANG K R，HAO X J，et al.BiRA-Net：bilinear attention net for diabetic retinopathy grading[C]//IEEE International Conference on Image Processing（ICIP），2019.
[13] TAN M X，LE Q V.EfficientNet：rethinking model scaling for convolutional neural networks[C]//International Conference on Machine Learning（ICML），2019.
[14] LIN M，CHEN Q，YAN S C.Network in network[J].arXiv：1312.4400，2013.
[15] Hospital，Aravind Eye.APTOS 2019 blindness detection[EB/OL].[2020-07-20].https：//www.kaggle.com/c/aptos2019-
blindness-detection.
[16] 杨松霖.卷积神经网络在糖尿病性视网膜病变分类中的研究与应用[D].武汉：华中科技大学，2019.
YANG S L.Research and application of convolutional neural network in the classification of diabetic retinopathy[D].Wuhan：Huazhong University of Science and Technology，2019.
[17] 庞浩，王枞.用于糖尿病视网膜病变检测的深度学习模型[J].软件学报，2017，28（11）：3018-3029.
PANG H，WANG C.Deep learning model for diabetic retinopathy detection[J].Journal of Software，2017，28（11）：3018-3029.
[18] KINGMA D P，BA J.Adam：a method for stochastic optimization[J].arXiv：1412.6980，2014.
[19] SMITH L N.Cyclical learning rates for training neural networks[J].arXiv：1506.01186，2015.
[20] WANG Z，YIN Y X，SHI J P，et al.Zoom-in-Net：deep mining lesions for diabetic retinopathy detection[C]//International Conference of the IEEE Engineering in Medicine and Biology Society（Embc），Jeju，Island，Korea，2017：2724-2727.