Computer Engineering and Applications ›› 2020, Vol. 56 ›› Issue (17): 93-99.DOI: 10.3778/j.issn.1002-8331.1908-0298

Previous Articles     Next Articles

Application Research of Deep Auto Encoder in Data Anomaly Detection

ZHANG Changhua, ZHOU Xiongtu, ZHANG Yong’ai, YAO Jianmin, GUO Tailiang, YAN Qun   

  1. 1.College of Physics and Information Engineering, Fuzhou University, Fuzhou 350108, China
    2.RichSense Electronic Technology Co., Ltd., Jinjiang, Fujian 362200, China
  • Online:2020-09-01 Published:2020-08-31



  1. 1.福州大学 物理与信息工程学院,福州 350108
    2.博感电子科技有限公司,福建 晋江 362200


Normal data for training is usually required in Auto Encoder(AE) network, which limits its applications in data anomaly detection. This paper proposes an unsupervised data anomaly detection method based on a Deep Auto Encoder(DAE) network model. In this model, Principal Components Analysis(PCA) is introduced, and the anomaly data is isolated by differencing each reconstruction output of AE and the input data. That is, the input data is divided into normal data and anomaly data, where the normal data is reconstructed via the AE network, and the anomaly data is optimized before outputting. Finally, the whole model is trained by the Alternating Direction Method of Multipliers(ADMM), and the results are outputted when the predetermined number of training times is sucessfully achieved. The DAE model is compared with eight machine learning models and AE model in seven real datasets. The results show that the DAE model can effectively carry out model training without inputting normal data and prevent model from overfitting, and the overall performances are better than those using the traditional machine learning model and AE model. The AUC values of DAE model are optimal in 4 datasets, among which, the AUC value obtained from the DAE model is 10.93% higher than that from the Isolated Forest(IF) method in the mnist datasets.

Key words: data anomaly detection, auto encoder network, Deep Auto Encoder network(DAE), Area Under the Curve(AUC)



关键词: 数据异常检测, 自编码网络, 深度自编码网络, 曲线下面积(AUC)