Collaborative filtering algorithm based on clustering and random forests

doi:10.3778/j.issn.1002-8331.1712-0089

Abstract

Abstract: To handle the inefficiency problem of online recommendation of neighborhood-based collaborative filtering algorithms, this paper proposes a method to train a rating prediction model offline. The method firstly reduces the dimensions of the user vectors and the item vectors in the user-item rating matrix, and transforms this matrix so as to use supervised learning models. A random forest model is then trained by using the transformed data, and the online rating prediction is made by the previous trained model without the search of the nearest neighborhoods. The experiment results show that the method performs much better than neighborhood-based collaborative filtering algorithms in term of online recommendation efficiency without decreasing the precision of rating prediction.

Key words: collaborative filtering, recommendation algorithm, clustering, random forests

摘要： 针对基于邻近关系的协同过滤算法在线推荐效率低的问题，提出了一种可离线训练评分预测模型的算法。通过聚类算法降低用户-项目评分矩阵中用户向量和项目向量的维数，并对数据进行转换使其适用于监督模型；利用转换后的数据离线训练随机森林模型，在线推荐时只需根据随机森林模型的规则进行评分预测，无需查找最邻近用户或项目。实验结果表明，该算法在不降低评分预测精度的情况下，在线推荐效率远高于基于邻近关系的协同过滤算法。

关键词: 协同过滤, 推荐算法, 聚类, 随机森林

YANG Xingyu, LI Huaping, ZHANG Yubo. Collaborative filtering algorithm based on clustering and random forests[J]. Computer Engineering and Applications, 2018, 54(16): 152-157.

杨兴雨，李华平，张宇波. 基于聚类和随机森林的协同过滤推荐算法[J]. 计算机工程与应用, 2018, 54(16): 152-157.

[1]	LAN Hong, HUANG Min. Fusion of KNN Optimized Density Peaks and FCM Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(9): 81-88.
[2]	ZHANG Qishan, CHEN Lulu. Slope One Algorithm Based on Grey Correlational Analysis by Method of Degree of Balance and Approach [J]. Computer Engineering and Applications, 2021, 57(9): 96-102.
[3]	WANG Yonggui, LI Qianyu. Hybrid Collaborative Filtering Recommendation Algorithm Based on KNN-GBDT [J]. Computer Engineering and Applications, 2021, 57(9): 103-108.
[4]	GUO Xiaojing, SUI Haoda. Application of Improved YOLOv3 in Foreign Object Debris Target Detection on Airfield Pavement [J]. Computer Engineering and Applications, 2021, 57(8): 249-255.
[5]	LI Li, JI Xinyuan, SONG Song. Prediction Model for Number of Software Defects in Loop [J]. Computer Engineering and Applications, 2021, 57(7): 158-163.
[6]	HUO Guangyu, ZHANG Yong, SUN Yanfeng, YIN Baocai. Research on Archive Data Intelligent Classification Based on Semantic [J]. Computer Engineering and Applications, 2021, 57(6): 247-253.
[7]	YANG Fang, YIN Xi, SI Jianhui, LIU Hongyuan, WANG Xue. Mathematical Expression Similarity Calculation Method Based on Focus Clustering [J]. Computer Engineering and Applications, 2021, 57(6): 88-93.
[8]	ZHAO Fan, ZHANG Lin, WEN Zhiquan, YANG Linlin, LIN Guangfeng. Direct and Efficient Natural Scene Chinese Character Approaching Spotting Method [J]. Computer Engineering and Applications, 2021, 57(6): 159-167.
[9]	YANG Yemin, ZHANG Huijun, ZHANG Xiaolong. Research on Interpretable Visual Analysis Method of Random Forest [J]. Computer Engineering and Applications, 2021, 57(6): 168-175.
[10]	PENG Qihui, XUAN Shibin, GAO Qing. Distribution Automatic Threshold Density Peak Clustering Algorithm [J]. Computer Engineering and Applications, 2021, 57(5): 71-78.
[11]	LI Yongzhen, LIAO Husheng. Multi-view Clustering via Graph Convolutional Neural Network [J]. Computer Engineering and Applications, 2021, 57(5): 115-122.
[12]	WANG Changlong, ZHANG Yuandong, MIAO Hong, YANG Yuheng. Application of Double Channel Convolutional Neural Network in Pumpkin Diseases Identification [J]. Computer Engineering and Applications, 2021, 57(5): 183-189.
[13]	HU Xiaomin, WANG Mingfeng, ZHANG Shourong, LI Min. New Differential Evolution with Particle Swarm Optimization Algorithm for Text Clustering [J]. Computer Engineering and Applications, 2021, 57(4): 61-67.
[14]	WANG Junling, LU Xinming. Video Key Frame Extraction Algorithm Based on Semantic Correlation [J]. Computer Engineering and Applications, 2021, 57(4): 192-198.
[15]	WANG Fuyin, ZHANG Desheng, ZHANG Xiao. Adaptive Density Peaks Clustering Algorithm Combining with Whale Optimization Algorithm [J]. Computer Engineering and Applications, 2021, 57(3): 94-102.

Collaborative filtering algorithm based on clustering and random forests

基于聚类和随机森林的协同过滤推荐算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics