### Automatic data preprocessing technology for Dendritic Cell Algorithm

DANG Huazheng, FANG Xianjin

1. School of Computer Science and Enginerring, Anhui University of Science & Technology, Huainan, Anhui 232001, China
• Online:2014-10-01 Published:2014-09-29

### DCA自动数据预处理技术研究

1. 安徽理工大学 计算机科学与工程学院，安徽 淮南 232001

Abstract: The Dendritic Cell algorithm（DCA） can efficiently and effectively process large datasets in terms of data size. However, data size is not the only concern when handling complex datasets, high dimensionality is often a bigger problem. Complexity occurs at the data preprocessing stage of the DCA when dimensionality reduction is required. Previously, the data pre-processing of the DCA is performed manually based on users’ expert knowledge of a given problem domain, which is time consuming and sometime difficult to achieve. In this paper, automating the data pre-processing for DCA is proposed using Principal Component Analysis（PCA）, which extracts and selects relevant features, and adapts the algorithm to characteristics of the underlying data. The application of PCA to the DCA in KDDCUP’99 data set shows feasibility and generates useful and accurate classification results.