计算机工程与应用 ›› 2017, Vol. 53 ›› Issue (4): 19-24.DOI: 10.3778/j.issn.1002-8331.1607-0331

• 热点与综述 • 上一篇    下一篇

基于双隐层极限学习机的模糊XML文档分类

赵  震1,2,马宗民1,张  富1,林晓庆1   

  1. 1.东北大学 计算机科学与工程学院,沈阳 110819
    2.渤海大学 信息科学与技术学院,辽宁 锦州 121013
  • 出版日期:2017-02-15 发布日期:2017-05-11

 Classification of fuzzy XML documents based on double hidden layer ELM

ZHAO Zhen1,2, MA Zongmin1, ZHANG Fu1, LIN Xiaoqing1   

  1. 1.School of Computer Science and Engineering, Northeastern University, Shenyang 110819, China
    2.College of Information Science and Technology, Bohai University, Jinzhou, Liaoning 121013, China
  • Online:2017-02-15 Published:2017-05-11

摘要: 随着大数据时代的到来,对异构和分布式的模糊XML数据管理显得越来越重要。在模糊XML数据的管理中,模糊XML文档的分类是关键问题。针对模糊XML文档的分类,提出采用双隐层极限学习机模型来实现模糊XML文档自动分类。这个模型可以分为两个部分:第一层采用极限学习机提取模糊XML文档的相应特征,第二层利用核极限学习机根据这些特征进行最终的模糊XML文档分类。通过实验验证了所提方法的性能优势。首先对主要的调节参数包括隐藏层节点的数目[L],常量[C]和核参数[γ]进行了研究,接下来的对比实验说明提出的基于双隐层ELM(Extreme Learning Machine)的方法相较于传统单隐层ELM以及SVM(Support Vector Machine)方法,分类精度得到较大提高,训练时间进一步缩减。

关键词: 模糊, XML文档, 分类, 特征提取, 极限学习机

Abstract:  With the arrival of the era of big data, the management of distributed and heterogeneous fuzzy XML data is also becoming more and more important. In the management of fuzzy XML data, the classification of fuzzy XML documents is the key problem. In order to study the classification for fuzzy XML documents, in this paper, a new ELM-based double hidden layer framework is proposed. The proposed architecture is divided into two main components:the feature extraction of fuzzy XML documents are performed using Extreme Learning Machine in first layer, and then use these characteristics to classify the fuzzy XML documents by KELM Kernel Extreme Learning Machine in second layer. Finally, the performance advantages of the proposed method are verified by experiments. Firstly, the parameters including the number of hidden neuron, and the constant parameter [C] and kernel parameter [γ] are investigated in detail. Compared with the traditional single hidden layer ELM(Extreme Learning Machine) and SVM(Support Vector Machine) method, the classification accuracy has been greatly improved and the training time has been decreased by approach based on the double hidden layer ELM proposed in this paper.

Key words: fuzzy, XML documents, classification, feature extracting, Extreme Learning Machine(ELM)