计算机工程与应用 ›› 2013, Vol. 49 ›› Issue (15): 123-129.

• 数据库、数据挖掘、机器学习 • 上一篇    下一篇

分布式环境下连续概率Skyline查询

樊明锁,汤志俊,陈华辉,钱江波,董一鸿   

  1. 宁波大学 信息科学与工程学院,浙江 宁波 315211
  • 出版日期:2013-08-01 发布日期:2013-07-31

Continuous probabilistic Skyline queries under distributed environment

FAN Mingsuo, TANG Zhijun, CHEN Huahui, QIAN Jiangbo, DONG Yihong   

  1. College of Information Science and Engineering, Ningbo University, Ningbo, Zhejiang 315211, China
  • Online:2013-08-01 Published:2013-07-31

摘要: Skyline计算是多准则决策,数据挖掘和数据库可视化的重要操作。移动对象在运动过程中,由于位置信息的不确定,导致局部各数据点间的支配关系不稳定,从而影响全局概率Skyline集合。针对分布式环境下不确定移动对象的连续概率Skyline查询更新进行研究,提出了一种降低通信开销的连续概率Skyline查询的有效算法CDPS-UMO,该算法在局部节点中对局部概率Skyline点的变化进行跟踪;提出了有效的排序方法和反馈机制,大大降低了通信开销和计算代价;提出一种基本算法naive,与CDPS-UMO进行了对比实验,实验结果证明了算法的有效性。

关键词: 概率Skyline, 分布式数据库, 不确定数据, 支配概率, 移动对象

Abstract: Skyline computation has played a significant role in the fields of multi-criteria decision making, data mining and database visualization. The uncertainty of moving objects makes the dominant relationship of data instable, which will affect global probabilistic skyline set. In this paper, the updating of continuous probabilistic Skyline queries is studied, which is under distributed environment with the uncertainty of moving objects. A continuous probabilistic Skyline queries algorithm in order to reduce communication cost called CDPS-UMO is proposed. The change of local probabilistic Skyline points in local sites is traced. The SM(Sort Method) is introduced, and the feedback rules are proposed, which will reduce the correspondence and computation cost. A base algorithm naive is proposed to be compared with CDPS-UMO. The experiments have positive results that show effectiveness of the proposed algorithm.

Key words: probabilistic Skyline, distributed database, uncertain data, dominant probability, moving objects