流形极限学习机自编码特征表示

doi:10.3778/j.issn.1002-8331.2006-0007

摘要/Abstract

摘要：

极限学习机（ELM）作为一种无监督分类方法，具有学习速度快、泛化性能高、逼近能力好的优点。随着无监督学习的发展，将ELM与自动编码器集成已成为无标签数据集提取特征的新视角，如极限学习机自动编码器（ELM-AE）是一种无监督的神经网络，无需迭代即可找到代表原始样本和其学习过程的主要成分。其重建输入信号获取原始样本的主要特征，且考虑了原始数据的全局信息以避免信息的丢失，然而这类方法未考虑数据的固有流形结构即样本间的近邻结构关系。借鉴极限学习机自动编码器的思想，提出了一种基于流形的极限学习机自动编码器算法（M-ELM）。该算法是一种非线性无监督特征提取方法，结合流形学习保持数据的局部信息，且在特征提取过程中同时对相似度矩阵进行学习。通过在IRIS数据集、脑电数据集和基因表达数据集上进行实验，将该算法与其他无监督学习方法PCA、LPP、NPE、LE和ELM-AE算法经过[k]-means聚类后的准确率进行了比较，以表明该算法的有效性。

关键词: 极限学习机, 极限学习机自动编码器, 流形学习, 无监督学习, 特征提取

Abstract:

Extreme Learning Machine（ELM） is an effective classification learning algorithm which has the advantages of fast learning speed, high generalization performance, and good approximation ability. With the development of unsupervised learning, integrating ELM with autoencoders has become a new perspective for extracting features using unlabeled data sets, for example, Extreme Learning Machine Autoencoder（ELM-AE）, which is an unsupervised neural network. The main components representing the original sample and its learning process can be found without iteration. It reconstructs the input signal to obtain the main characteristics of the original sample, and considers the global information of the original data to avoid the loss of information. However, this type of method does not consider the inherent Manifold structure of the data, that is, the neighbor structure relationship between the samples. This paper draws on the idea of extreme learning machine autoencoder, and proposes a Manifold-based Extreme Learning Machine autoencoder algorithm （M-ELM）. The algorithm is a nonlinear unsupervised feature extraction method, which combines manifold learning to maintain local information of the data, and simultaneously learns the similarity matrix during feature extraction instead of using a specific formula to calculate the similarity between samples. By conducting experiments on IRIS data set, EEG data set and gene expression data set, the accuracy of this algorithm and other unsupervised learning methods including PCA, LPP, NPE, LE and ELM-AE algorithm after [k]-means are compared, to show the effectiveness of this algorithm.

Key words: extreme learning machine, extreme learning machine autoencoder, manifold learning, unsupervised learning, feature extraction

陈媛，陈晓云. 流形极限学习机自编码特征表示[J]. 计算机工程与应用, 2020, 56(17): 150-155.

CHEN Yuan, CHEN Xiaoyun. Manifold Extreme Learning Machine Autoencoder with Feature Representation[J]. Computer Engineering and Applications, 2020, 56(17): 150-155.

[1]	宋飞, 夏克文, 杨文彪. 融合多策略的鸟群算法及油层识别ELM模型优化[J]. 计算机工程与应用, 2022, 58(9): 279-287.
[2]	卢鹏, 陈金宇, 邹国良, 万莹, 郑宗生, 王振华. 无监督图像翻译的个性化手写汉字生成方法[J]. 计算机工程与应用, 2022, 58(8): 221-229.
[3]	杨荣莹, 何庆, 杜逆索. 门控多特征提取器的中文命名实体识别[J]. 计算机工程与应用, 2022, 58(8): 117-124.
[4]	郭馨蔚, 马楠, 刘伟锋, 孙富春, 张津丽, 陈洋, 张国平. 咽拭子采集机器人表情识别与交互[J]. 计算机工程与应用, 2022, 58(8): 125-135.
[5]	赵宏, 傅兆阳, 赵凡. 基于BERT和层次化Attention的微博情感分析研究[J]. 计算机工程与应用, 2022, 58(5): 156-162.
[6]	胡春生, 闫小鹏, 魏红星, 李国利. 基于立体视觉的目标检测与轨迹预测研究综述[J]. 计算机工程与应用, 2022, 58(3): 50-65.
[7]	肖雪, 李成城. 手写汉字评价方法研究进展[J]. 计算机工程与应用, 2022, 58(2): 27-42.
[8]	朱弥雪, 刘志强, 张旭, 李文静, 苏佳新. 林火视频烟雾检测算法综述[J]. 计算机工程与应用, 2022, 58(14): 16-26.
[9]	陈红花, 岑健, 刘溪, 杨卓洪. 深度学习在化学流程工业故障诊断的研究进展[J]. 计算机工程与应用, 2022, 58(13): 48-62.
[10]	吴启睿, 黄树成. 结合卷积神经网络和三支决策的入侵检测算法[J]. 计算机工程与应用, 2022, 58(13): 119-127.
[11]	袁金丽, 赵琳琳, 郭志涛, 苏逸, 卢成钢. 改进U型残差网络用于肺结节检测[J]. 计算机工程与应用, 2022, 58(13): 195-203.
[12]	邱颖豫, 张柯, 杨欣毅. 面向旋转机械故障诊断的深度流形迁移学习[J]. 计算机工程与应用, 2022, 58(12): 289-298.
[13]	赵小虎, 李晓, 叶圣, 李晓, 冯伟, 尤星懿. 基于改进U-Net网络的多尺度番茄病害分割算法[J]. 计算机工程与应用, 2022, 58(10): 216-223.
[14]	邬满, 文莉莉, 孙苗. 注意力机制海洋场景图像理解算法[J]. 计算机工程与应用, 2022, 58(10): 231-239.
[15]	王光, 陶燕, 沈慧芳, 周树东. 基于多特征融合与CELM的场景分类算法[J]. 计算机工程与应用, 2022, 58(1): 232-240.

流形极限学习机自编码特征表示

Manifold Extreme Learning Machine Autoencoder with Feature Representation

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics