OLAP cache mechanism based on naive Bayesian

doi:10.3778/j.issn.1002-8331.1508-0105

Abstract

Abstract: In the era of the big data, cache can be seen as one of the most effective ways to enhance data processing technique, and therefore it is widely researched. The majority of cache mechanism saves the query results as the file, thus there is nearly no way to reuse the partial data in the cache under specific situations, and consequently cache resources are wasted. Based on learning the cache techniques both here and abroad, this project designs one data warehouse cache mechanism by using incremental learning naive Bayesian algorithm. This cache mechanism can decide whether to cache the current query results according to users’ recent operations, and ultimately can increase the hit rate of cache. Finally, the results of the experiment illustrate the effectiveness and efficiency of this cache mechanism by analyzing both average query time and the hit rate of cache.

Key words: On-Line Analytical Processing（OLAP）, cache, On-Line Analytical Processing（OLAP） cache, naive Bayesian algorithm, caching mechanism, data warehouse

摘要： 大数据时代，缓存作为一种提高数据处理性能的有效技术而被广泛研究。目前大多数缓存机制将查询结果以文件的形式保存了下来，命中率较低，造成了缓存资源的浪费。以国内外的缓存技术为基础，结合用户的查询习惯，借助增量朴素贝叶斯算法设计了一种新的数据仓库缓存机制，此缓存机制可根据用户的操作习惯判断每次查询的结果是否需要被缓存，以此提高缓存命中率。并通过实验从平均查询时间以及缓存命中率两方面验证了该缓存机制的有效性。

关键词: 联机分析处理（OLAP）, 缓存, 联机分析处理（OLAP）缓存, 朴素贝叶斯算法, 缓存机制, 数据仓库

MAN Yi, ZHANG Jiongmin, XU Xiaojin. OLAP cache mechanism based on naive Bayesian[J]. Computer Engineering and Applications, 2017, 53(6): 85-90.

满毅，章炯民，徐晓锦. 一种基于朴素贝叶斯算法的OLAP缓存机制[J]. 计算机工程与应用, 2017, 53(6): 85-90.

[1]	SUN Ming, CHEN Xin. Design Method of Convolutional Neural Network Accelerator [J]. Computer Engineering and Applications, 2021, 57(13): 77-84.
[2]	AN Weipeng, CHENG Xiaobo, LIU Yu. Application of Fleiss’ Kappa Coefficient in Bayesian Decision Tree Algorithm [J]. Computer Engineering and Applications, 2020, 56(7): 137-140.
[3]	WANG Huiyu, YANG Wang. Cache Design of Streaming Application Distribution System Based on User Behavior [J]. Computer Engineering and Applications, 2020, 56(4): 37-43.
[4]	CHAI Xiaofei, LIU Song, QU Bin, WANG Qian, WU Weiguo. Vectorization-Friendly Tile Size Selection Algorithm [J]. Computer Engineering and Applications, 2020, 56(15): 37-42.
[5]	AN Likui, HAN Liyan. Instruction Prefetching and Cache Partitioning for Multicore Cache WCEC Optimization [J]. Computer Engineering and Applications, 2020, 56(1): 69-75.
[6]	XU Qi1，2, WANG Cong1，2, CHENG Yaodong1, CHEN Gang1. Cross-Domain File System for Distributed Sites [J]. Computer Engineering and Applications, 2019, 55(8): 1-8.
[7]	ZHOU Linteng, LIU Ming. Cloud to Side Hybrid Caching Strategy for Content Heterogeneous 5G Wireless Networks [J]. Computer Engineering and Applications, 2019, 55(6): 94-100.
[8]	LIU Qilie, LI Jianxiong. Multiple Parameter Detection Algorithm of Cache Pollution Attack in Content Centric Networking [J]. Computer Engineering and Applications, 2019, 55(4): 130-136.
[9]	DAI Min. Web Cache Replacement Strategy Based on NB Classifier for Re-access Probability Prediction [J]. Computer Engineering and Applications, 2019, 55(19): 134-140.
[10]	SUN Dandan1，2, LUO Yonglong1，2, FAN Guoting1，2, GUO Liangmin1，2, ZHENG Xiaoyao1，2. Location privacy-preserving method against attacks under P2P communication [J]. Computer Engineering and Applications, 2018, 54(9): 75-83.
[11]	HU Sensen, SU Jiafu. Optimizing adaptive run-time reconfigurable cache [J]. Computer Engineering and Applications, 2018, 54(4): 25-30.
[12]	MA Zhen, HALIDAN Abudureyimu, LI Xitong. Research on access optimization of small files in massive sample data sets [J]. Computer Engineering and Applications, 2018, 54(22): 80-84.
[13]	JIANG Lin, CUI Pengfei, SHAN Rui, WU Xin, TIAN Rujia. Design of distributed memory architecture for video array processor [J]. Computer Engineering and Applications, 2018, 54(12): 57-62.
[14]	LIU Jiaxing, CHEN Feixiang, CHEN Xinghan . Tile cache strategy based on geographic unit heat [J]. Computer Engineering and Applications, 2017, 53(5): 90-96.
[15]	MIAO Xiaolong1, CHEN Hao1, ZHONG Jiang2. Energy-conserving strategies of file storage based on cluster scale adjustment [J]. Computer Engineering and Applications, 2017, 53(24): 80-85.

OLAP cache mechanism based on naive Bayesian

一种基于朴素贝叶斯算法的OLAP缓存机制

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics