Research on Logistics Path Frequent Patterns Based on Parallel Apriori

doi:10.3778/j.issn.1002-8331.1803-0236

Abstract

Abstract: The traditional method of frequent path mining analysis is realized by the association rule algorithm. However, when dealing with large data sets, the traditional association rules algorithm will take up too much memory and process data slowly. In this paper, a parallel Apriori algorithm based on Fuzzy [c]-means clustering algorithm is proposed. The model performs clustering analysis of the original data set by Fuzzy [c]-means algorithm, divides the logistics path data which is considered as the same district into a data cluster with high similarity. Then the model utilizes the Apriori algorithm to mine the frequent paths in this district, so as to obtain the frequent logistics path of each area. Meanwhile, the algorithm is parallelized through the Hadoop platform, which can effectively improve the efficiency and the quality of the algorithm. Through the analysis of the frequent path of logistics, managers can better understand the flow of goods and make the decision of the optimization of the delivery path.

Key words: big data, frequent path, Hadoop, Fuzzy [c]-means clustering algorithm, Apriori algorithm

摘要： 传统的频繁路径挖掘分析主要通过关联规则算法实现，但其在处理大型数据集时，会产生占用内存过多，数据处理速度慢等问题，对此提出一种基于Fuzzy [c]-means聚类算法的并行Apriori算法模型。该模型通过Fuzzy [c]-means算法完成对原始数据集的聚类分析，将同一区域的物流路径数据划分到内部相似度较高的数据类，并利用Apriori算法对各数据类中的频繁模式进行挖掘分析，进而获得各区域的物流频繁路径。同时通过Hadoop平台实现算法的并行化，有效提高算法运行效率和质量。通过对物流频繁路径的挖掘分析，使管理者更清楚货物流向，可为配送路径优化等决策提供支持。

关键词: 大数据, 频繁路径, Hadoop, Fuzzy [c]-means聚类算法, Apriori算法

CAO Jingjing1, REN Xinxin2, XU Xianhao2. Research on Logistics Path Frequent Patterns Based on Parallel Apriori[J]. Computer Engineering and Applications, 2019, 55(11): 257-264.

曹菁菁1，任欣欣2，徐贤浩2. 基于并行Apriori的物流路径频繁模式研究[J]. 计算机工程与应用, 2019, 55(11): 257-264.

[1]	WU Hao, XU Xingjian, MENG Fanjun. Knowledge Graph-Assisted Multi-task Feature-Based Course Recommendation Algorithm [J]. Computer Engineering and Applications, 2021, 57(21): 132-139.
[2]	WU Dongyang, DOU Jianping, LI Jun. Design of Digital Twin System for Quadrotor [J]. Computer Engineering and Applications, 2021, 57(16): 237-244.
[3]	LI Leixiao, DENG Dan, LI Jie, WANG Yongsheng. All-to-All Comparison Computing Data Distribution Strategy Based on Particle Swarm Optimization [J]. Computer Engineering and Applications, 2021, 57(15): 109-117.
[4]	LI Ling, GU Xiaomei, LIU Zihao. Application Research of Multi-subdomain Random Forest in Context-Aware Recommendation [J]. Computer Engineering and Applications, 2020, 56(22): 132-141.
[5]	WANG Yonggui, GUO Xintong. Efficient Frequent Set Mining Algorithm for Adaptive Data Sets on SparkSql [J]. Computer Engineering and Applications, 2020, 56(21): 72-78.
[6]	ZHANG Meng, SUN Bingzhen, CHU Xiaoli. Gout Diagnosis Model Based on Neighborhood Cost Sensitive Three-Way Decision [J]. Computer Engineering and Applications, 2020, 56(16): 218-225.
[7]	WU Yangyang, TANG Jianguo. Research Progress of Attribute Reduction Based on Rough Set in Context of Big Data [J]. Computer Engineering and Applications, 2019, 55(6): 31-38.
[8]	YANG Zhen, GENG Xiuli. Research on Mining Association Rules Based on Multi-Granularity Attribute Reduction [J]. Computer Engineering and Applications, 2019, 55(6): 133-139.
[9]	LIU Jun, LI Wei, WU Mengting, CHEN Qifeng. New Design of Image Parallel Processing Model Based on Hadoop Platform [J]. Computer Engineering and Applications, 2019, 55(6): 186-190.
[10]	WANG Jingyu, LUAN Junqing, TAN Yuesheng. Research on Big Data Access Control Model Based on Data Sensitivity [J]. Computer Engineering and Applications, 2019, 55(23): 70-77.
[11]	HOU Yu1，2, QIN Xiaolin2, PENG Haoyue1，2, ZHANG Lige1，2. Feature Selection Based on Global Pitch Adjusting Harmony Search Algorithm [J]. Computer Engineering and Applications, 2019, 55(2): 21-27.
[12]	CAO Weidong1，2, XU Daidai2, WANG Jing2, WANG Jialiang2. NOSHOW Prediction and Strong Factor Association Analysis in Civil Aviation [J]. Computer Engineering and Applications, 2019, 55(2): 221-227.
[13]	WANG Dexian, HE Xianbo, HE Chunlin, ZHOU Kun, CHEN Minzhi. Latent Factor Prediction Model Combining L1 and L2 Regularization Constraints [J]. Computer Engineering and Applications, 2019, 55(19): 121-127.
[14]	WANG Yuan, PENG Chenhui, WANG Zhiqiang, FAN Qiang, YAO Yiyang, HUA Zhaoyun. Application of Knowledge Graph in Full-Service Unified Data Center of National Grid [J]. Computer Engineering and Applications, 2019, 55(15): 104-109.
[15]	LI Yufan1, ZHANG Huifu2, LIU Shangli2, TANG Bing1. Research Progress on Educational Data Mining [J]. Computer Engineering and Applications, 2019, 55(14): 15-23.

Research on Logistics Path Frequent Patterns Based on Parallel Apriori

基于并行Apriori的物流路径频繁模式研究

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics