Knowledge reduction algorithm for boundary region partition in cloud computing

Computer Engineering and Applications ›› 2015, Vol. 51 ›› Issue (24): 159-164.

Previous Articles Next Articles

Knowledge reduction algorithm for boundary region partition in cloud computing

CHANG Yuhui1，2, LV Ping1，2, QIAN Jin1，2

1.School of Computer Engineering, Jiangsu University of Technology, Changzhou, Jiangsu 213001, China
2.Key Laboratory of Cloud Computing & Intelligent Information Processing of Changzhou City, Jiangsu University of Technology, Changzhou, Jiangsu 213001, China

Online:2015-12-15 Published:2015-12-30

云计算下保持边界域划分的知识约简算法研究

常玉慧1，2，吕萍1，2，钱进1，2

1.江苏理工学院计算机工程学院，江苏常州 213001
2.江苏理工学院云计算与智能信息处理常州市重点实验室，江苏常州 213001

Abstract

Abstract: Knowledge reduction in rough set theory is the critical process of knowledge acquisition among data mining applications. Classical knowledge reduction algorithms assume all the datasets can be loaded into the main memory, while the existing parallel knowledge reduction algorithms only implement reduction tasks concurrently, which are infeasible for large-scale datasets. Massive data with high dimension makes attribute reduction a challenging task. To solve this problem, the concept of indiscernibility object pairs is defined and a new knowledge reduction algorithm for boundary region partition preserving is proposed. The relationship among these algorithms is illustrated in detail. Then, the parallelism strategies of data and task parallel are implemented and discussed. The corresponding attribute reduction framework model for boundary region partition preserving is presented. The experimental results demonstrate that knowledge reduction algorithms in cloud computing can efficiently process massive datasets on Hadoop platform.

Key words: cloud computing, rough set, knowledge reduction, data parallel, task parallel

摘要： 知识约简是数据挖掘应用中知识获取的重要步骤。经典的知识约简算法是一次性将小数据集装入内存中进行知识约简，而传统的并行知识约简仅仅利用任务并行来提高约简算法效率，都无法处理海量数据。通过分析经典的知识约简算法，构建了不可辨识的对象对，提出了保持边界域划分的知识约简算法，并探讨了保持边界域划分的知识约简算法之间的关系。深入剖析了知识约简算法中数据和任务同时并行的可行性，提出了云计算环境下保持边界域划分的知识约简算法框架模型，在Hadoop平台上构建了云计算环境并进行了相关实验。实验结果表明该知识约简算法可以处理海量数据集。

关键词: 云计算, 粗糙集, 知识约简, 数据并行, 任务并行

CHANG Yuhui1，2, LV Ping1，2, QIAN Jin1，2. Knowledge reduction algorithm for boundary region partition in cloud computing[J]. Computer Engineering and Applications, 2015, 51(24): 159-164.

常玉慧1，2，吕萍1，2，钱进1，2. 云计算下保持边界域划分的知识约简算法研究[J]. 计算机工程与应用, 2015, 51(24): 159-164.

[1]	WENG Xiaoyong. Research on Blockchain Based Cloud Computing Data Sharing System [J]. Computer Engineering and Applications, 2021, 57(3): 120-124.
[2]	GAO Tianyu, WANG Qingrong, YANG Lei. Data Mining Model Based on Attribute Dependability Enhancement of Rough Set [J]. Computer Engineering and Applications, 2021, 57(3): 87-93.
[3]	WANG Qingrong, MA Chenkun. Forecast of Emergency Supplies for Case Consumption Reasoning [J]. Computer Engineering and Applications, 2021, 57(22): 281-287.
[4]	TIAN Zhuojing, HUANG Zhenchun, ZHANG Yinong. Review of Task Scheduling Methods in Cloud Computing Environment [J]. Computer Engineering and Applications, 2021, 57(2): 1-11.
[5]	HU Heng, JIN Fenglin, LANG Siqi. Survey of Research on Computation Offloading Technology in Mobile Edge Computing Environment [J]. Computer Engineering and Applications, 2021, 57(14): 60-74.
[6]	LIU Yufeng, SUN Wenxin. Generalized Multi-granulation Quantization Soft Rough Set Model [J]. Computer Engineering and Applications, 2021, 57(12): 137-143.
[7]	LIU Guizhi. Incremental Attribute Reduction of Incomplete Hybrid Data Based on Dimension Change [J]. Computer Engineering and Applications, 2021, 57(12): 161-169.
[8]	YU Bo, TAI Xianqing, MA Zhijie. Study on Attribute and Trust-Based RBAC Model in Cloud Computing [J]. Computer Engineering and Applications, 2020, 56(9): 84-92.
[9]	TONG Le, HAO Rong, YU Jia. Secure Outsourcing Scheme for Bilinear Pairing Based on Single Untrusted Server [J]. Computer Engineering and Applications, 2020, 56(9): 131-135.
[10]	JIANG Jiao, CAI Linqin, WEI Pengcheng, LI Li. Aretrieval Scheme Supporting Verifiable Ciphertext Fuzzy Keyword [J]. Computer Engineering and Applications, 2020, 56(7): 74-80.
[11]	WANG Jie, CHEN Zhigang, LIU Jialing, CHENG Hongbing. Privacy Behavior Mining Technology for Cloud Computing Based on Clustering [J]. Computer Engineering and Applications, 2020, 56(5): 80-84.
[12]	ZHANG Bo, JIA Huayu, MA Jun. Estimation of Air Condition for Unmanned Aerial Vehicle Based on RS-GA Neural Network [J]. Computer Engineering and Applications, 2020, 56(4): 209-213.
[13]	MOU En, ZHANG Xianyong, YAO Yuesong, DENG Qie. Class-Specific Attribute Reduct and Its Heuristic Algorithm of Neighborhood Approximation Condition-Entropy [J]. Computer Engineering and Applications, 2020, 56(24): 175-180.
[14]	ZHANG Jianhua, LI Fangfang, YANG Lan. Research on Matching Supply and Demand of Case Knowledge Based on PFS and RS [J]. Computer Engineering and Applications, 2020, 56(23): 139-145.
[15]	LUO Gongzhi, MEI Tao. Multi-granulation Decision-Theoretic Rough Set Method Based on Supervisory Mechanism and Its Application [J]. Computer Engineering and Applications, 2020, 56(18): 214-220.

Knowledge reduction algorithm for boundary region partition in cloud computing

云计算下保持边界域划分的知识约简算法研究

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics