Vector data index method supporting efficient parallel compute

doi:10.3778/j.issn.1002-8331.1603-0223

Computer Engineering and Applications ›› 2017, Vol. 53 ›› Issue (11): 79-84.DOI: 10.3778/j.issn.1002-8331.1603-0223

Previous Articles Next Articles

Vector data index method supporting efficient parallel compute

CHU Longxian1，3, LI Xiaoying2，3, CHEN Xu3, CHU Chunjie4

1.School of Software, Pingdingshan University, Pingdingshan, Henan 467000, China
2.Campus of Nanning, Guilin University of Technology, Nanning 530001, China
3.State Key Laboratory of Software Engineering, Wuhan University, Wuhan 430072, China
4.School of Resources and Environmental Science, Pingdingshan University, Pingdingshan, Henan 467000, China

Online:2017-06-01 Published:2017-06-13

一种支持高效并行处理的矢量数据索引方法

褚龙现1，3，李晓英2，3，陈旭3，楚纯洁4

1.平顶山学院软件学院，河南平顶山 467000
2.桂林理工大学南宁分校，南宁 530001
3.武汉大学软件工程国家重点实验室，武汉 430072
4.平顶山学院资源与环境科学学院，河南平顶山 467000

Abstract

Abstract: By analyzing the HBase storage model and the parallel compute mechanism of Spark, a distributed storage, index and parallel regional query method of vector spatial data is proposed. A row key storage scheme which combines the Hilbert code of central point and decimal place of longitude and latitude is designed. This scheme reaches the uniqueness of row key and guarantees the effect that the most nearest elements in geographical position are stored in the adjacent rows. A spatial index parallel build and regional query method based on Spark is realized, which generates index quickly by using the Hilbert code of spatial central points, and filters the query result by the minimum bounding rectangle of polygon regions. Simulation results show that the parallel build of index is reliability and fast, and the parallel compute algorithm based on regional query is feasible and efficient.

Key words: hilbert, vector data, spatial index, distributed storage

摘要： 分析了HBase的存储模型和Spark的并行处理机制，提出一种矢量空间数据的分布式存储、索引和并行区域查询方法。设计了基于空间对象中心点的行键存储方案，将中心点的Hilbert编码与经纬度小数位结合实现行键的唯一性，保证地理位置接近的要素在表中存储在相邻的行。实现了基于Spark的空间索引并行构建和区域查询方法，借助空间对象中心点的Hilbert编码快速构建索引，通过多边形区域的最小外接矩形过滤查询结果。实验结果表明，索引并行构建可靠性好速度快，区域查询并行处理算法可行且效率高。

关键词: spark, hilbert, 矢量数据, 空间索引, 分布式存储

CHU Longxian1，3, LI Xiaoying2，3, CHEN Xu3, CHU Chunjie4. Vector data index method supporting efficient parallel compute[J]. Computer Engineering and Applications, 2017, 53(11): 79-84.

褚龙现1，3，李晓英2，3，陈旭3，楚纯洁4. 一种支持高效并行处理的矢量数据索引方法[J]. 计算机工程与应用, 2017, 53(11): 79-84.

[1]	CHENG Zhenjing, CHENG Yaodong, CHEN Gang, WANG Lu, LI Haibo, HU Qingbao. High Energy Physics Data Placement Strategy Based on Random Forest [J]. Computer Engineering and Applications, 2020, 56(21): 60-64.
[2]	LI Xiongwei1, WEI Yanhai1, WANG Xiaohan1, XU Lu2, SUN Ping3. Improved Incremental SVDD Learning Algorithm for Hardware Trojan Detection [J]. Computer Engineering and Applications, 2019, 55(9): 43-48.
[3]	WU Jingfeng, JIN Weidong, TANG Peng. Catenary Pillar Image Anomaly Detection Combined with SVDD and CNN [J]. Computer Engineering and Applications, 2019, 55(10): 193-198.
[4]	YI Yang, ZHOU Shaoguang, ZHAO Pengfei, HU Yiqun. Classification method of remote sensing image based on positive and unlabeled data [J]. Computer Engineering and Applications, 2018, 54(4): 160-166.
[5]	HUANG Gangjing1，2, FAN Yugang1，2, FENG Zao1，2, LIU Yingjie1，2. Research on fault diagnosis method based on generalized morphological filter and MRSVD [J]. Computer Engineering and Applications, 2018, 54(3): 217-221.
[6]	HAN Guijin. Human pose estimation based on improved CNN and weighted SVDD algorithm [J]. Computer Engineering and Applications, 2018, 54(24): 198-203.
[7]	KONG Xiangxin, ZHOU Wei, WANG Xiaodan, YU Mingqiu. Removing algorithm for incremental SVDD learning [J]. Computer Engineering and Applications, 2018, 54(18): 174-179.
[8]	JIANG Lin, CUI Pengfei, SHAN Rui, WU Xin, TIAN Rujia. Design of distributed memory architecture for video array processor [J]. Computer Engineering and Applications, 2018, 54(12): 57-62.
[9]	GUO Rui, FAN Yamin. Algorithm based on extreme learning machine to restrain the end effect of BS-EMD and its application [J]. Computer Engineering and Applications, 2017, 53(7): 256-262.
[10]	HAO Kun, XIN Junchang, HUANG Da, WANG Guoren. Decentralized model for distributed storage system [J]. Computer Engineering and Applications, 2017, 53(24): 1-7.
[11]	KONG Xiangxin, ZHOU Wei, WANG Xiaodan. Parameter optimization for SVDD based on improved krill herd algorithm [J]. Computer Engineering and Applications, 2017, 53(22): 137-142.
[12]	YANG Zexue. Research on hybrid index structure in spatial database [J]. Computer Engineering and Applications, 2017, 53(20): 20-23.
[13]	HAN Guijin. Research and appllication of weighted SVDD algorithm on human pose estimation [J]. Computer Engineering and Applications, 2017, 53(15): 132-136.
[14]	DING Qianglong, WANG Jin, ZHANG Xuejie. Research on ETL method of transforming relational data to graph data based on sub-schema [J]. Computer Engineering and Applications, 2017, 53(12): 76-84.
[15]	LI Zhezhu1，2, GAO Peixin3, TONG Kun4, ZHAO Dazhe1，2, LIU Jiren1，2. Research of fault diagnosis method of hydraulic pipeline cracks based on HHT [J]. Computer Engineering and Applications, 2016, 52(20): 221-226.

Vector data index method supporting efficient parallel compute

一种支持高效并行处理的矢量数据索引方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics