计算机工程与应用 ›› 2017, Vol. 53 ›› Issue (24): 64-68.DOI: 10.3778/j.issn.1002-8331.1611-0153

• 大数据与云计算 • 上一篇    下一篇

基于数学形态学的时间序列相似性度量研究

臧艺超1,邱  菡1,2,周天阳1,2,朱俊虎1,2   

  1. 1.信息工程大学 数学工程与先进计算国家重点实验室,郑州 450001
    2.国家数字交换系统工程技术研究中心,郑州 450001
  • 出版日期:2017-12-15 发布日期:2018-01-09

Mathematical morphology based time series similarity measure

ZANG Yichao1, QIU Han1,2, ZHOU Tianyang1,2, ZHU Junhu1,2   

  1. 1.State Key Laboratory of Mathematical Engineering and Advanced Computing, Information & Engineering University, Zhengzhou 450001, China
    2.National Engineering Technology Research Center of the National Digital Switching System, Zhengzhou 450001, China
  • Online:2017-12-15 Published:2018-01-09

摘要: 时间序列相似性度量在挖掘时间序列模式,提取时间序列关联关系上发挥着重要作用。分析了当前主流的时间序列相似性度量算法,分别指出了各度量算法在度量时序数据相似性时存在的缺陷,并提出了基于数学形态学的时间序列相似性度量算法。通过将归一化的时间序列二值图像化表示,再引入了图像处理领域中的膨胀、腐蚀操作对时序数据进行形态变换分析,提高相似时序数据部分的抗噪性,同时又不降低时序数据非相似部分间的差异度,实现时序数据相似性度量分类精度的提高。在八种时间序列测试数据集合上进行分类实验,实验结果表明提出的基于数学形态学的时间序列相似性度量算法在时间序列分类精度上得到有效改善,相比于DTW相似性度量算法,分类精度平均水平提升了8.74%,最高提升20%。

关键词: 数学形态学, 时间序列, 相似性度量

Abstract: Similarity measure is a keystone in mining time series data. Numerous methods have been proposed to deal with it over the past decades. This paper summarizes the mainstream similarity measure algorithms so far, points out the defects within each of them. A newly mathematical morphology based similarity measure method is proposed to overcome the low discriminated precision problem. The core part of the mentioned method is dilation and erosion operation, which can strengthen the noise-resistance performance while keeping the difference among different time series at the same time, improving the precision of measurement. The experiment is tested on 8 dataset using KNN classification as evaluation metrics, which turns out that the proposed method improves at most 20% in classification precision, compared with DTW algorithm.

Key words: mathematical morphology, time series, similarity measure