计算机工程与应用 ›› 2009, Vol. 45 ›› Issue (17): 125-128.DOI: 10.3778/j.issn.1002-8331.2009.17.038
• 数据库、信息处理 • 上一篇 下一篇
史东辉
收稿日期:
修回日期:
出版日期:
发布日期:
通讯作者:
SHI Dong-hui
Received:
Revised:
Online:
Published:
Contact:
摘要: 对统计数据的散度情况,即数据变异指标,进行了说明,变异指标可以使我们对数据的总体特征有更进一步的了解,进而对数据的分布情况有所了解,变异指标对发现数据中的离群数据有一定的作用。作者使用变异指标对基于偏差的离群数据的发现方法进行改进,改进后的算法适合于多维数值数据。
关键词: 统计变异, 离群数据, 偏差数据
Abstract: In this paper,the degree to which numeric data tend to spread is called the dispersion,or variance of the data.It allows us to make better understanding of the data’s overall features,and thus understanding the distribution of data which is used to find the outliers data.A kind of outlier mining methods is improved,based on the deviation of data.The improved algorithm is suitable for multi-dimensional numerical data.
Key words: statistical dispersion, outlier mining, deviation of data
史东辉. 使用统计变异指标研究离群数据挖掘方法[J]. 计算机工程与应用, 2009, 45(17): 125-128.
SHI Dong-hui. Research on methods of outlier data mining of using statistical dispersion[J]. Computer Engineering and Applications, 2009, 45(17): 125-128.
0 / 推荐
导出引用管理器 EndNote|Ris|BibTeX
链接本文: http://cea.ceaj.org/CN/10.3778/j.issn.1002-8331.2009.17.038
http://cea.ceaj.org/CN/Y2009/V45/I17/125