计算机工程与应用 ›› 2024, Vol. 60 ›› Issue (10): 47-60.DOI: 10.3778/j.issn.1002-8331.2308-0014
高广尚
出版日期:
2024-05-15
发布日期:
2024-05-15
GAO Guangshang
Online:
2024-05-15
Published:
2024-05-15
摘要: 探讨神经网络如何结合注意力机制及其变种,以更好地学习用户和物品间复杂和隐含的关系,从而提高推荐的准确性和个性化水平。从多层感知机、卷积神经网络、循环神经网络、自编码器、图神经网络以及反向传播神经网络这六类典型神经网络出发,研究它们与注意力机制相结合进行推荐的过程,具体结合点击率预测、标签推荐和评论评分预测等典型应用场景进行优缺点分析。通过将神经网络与注意力机制相结合,模型能够聚焦于输入数据中的关键信息,降低对次要信息的注意程度,甚至直接过滤掉无关信息。现有将注意力机制与神经网络结合的推荐模型,在很大程度上能够满足常见的推荐任务需求。但是这类模型在跨域推荐、深度强化学习推荐以及多模态推荐等复杂推荐场景中,仍面临一些挑战,例如跨域推荐需要模型具备迁移学习的能力,强化学习推荐需要进行长期奖励建模。
高广尚. 推荐系统中神经网络结合注意力机制研究综述[J]. 计算机工程与应用, 2024, 60(10): 47-60.
GAO Guangshang. Review of Research on Neural Network Combined with Attention Mechanism in Recommendation System[J]. Computer Engineering and Applications, 2024, 60(10): 47-60.
[1] 王喆. 深度学习推荐系统[M]. 北京: 电子工业出版社, 2020. WANG Z. Deep learning recommender systems[M]. Beijing: Electronics Industry Press, 2020. [2] 黄昕, 赵伟, 王本友. 推荐系统与深度学习[M]. 北京: 清华大学出版社, 2019. HUANG X, ZHAO W, WANG B Y. Recommender systems and deep learning[M]. Beijing: Tsinghua University Press, 2019. [3] ROY D, DUTTA M. A systematic review and research perspective on recommender systems[J]. Journal of Big Data, 2022, 9(1): 59. [4] XU K, BA J, KIROS R, et al. Show, attend and tell: neural image caption generation with visual attention[C]//Proceedings of the 32nd International Conference on Machine Learning, 2015: 2048-2057. [5] VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[C]//Advances in Neural Information Processing Systems 30, 2017: 5998-6008. [6] JHAMB Y, EBESU T, FANG Y. Attentive contextual denoising autoencoder for recommendation[C]//Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval, 2018: 27-34. [7] TAY Y, LUU A T, HUI S C. Multi-pointer co-attention networks for recommendation[C]//Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018: 2309-2318. [8] SOYDANER D. Attention mechanism in neural networks: where it comes and where it goes[J]. Neural Computing and Applications, 2022, 34(16): 13371-13385. [9] 朱张莉, 饶元, 吴渊, 等. 注意力机制在深度学习中的研究进展[J]. 中文信息学报, 2019, 33(6): 1-11. ZHU Z L, RAO Y, WU Y, et al. Research progress of attention mechanism in deep learning[J]. Journal of Chinese Information Processing, 2019, 33(6): 1-11. [10] ZHANG S, YAO L, SUN A, et al. Deep learning based recommender system: a survey and new perspectives[J]. ACM Computing Surveys, 2019, 52(1): 5. [11] 黄立威, 江碧涛, 吕守业, 等. 基于深度学习的推荐系统研究综述[J]. 计算机学报, 2018, 41(7): 1619-1647. HUANG L W, JIANG B T, LV S Y, et al. Survey on deep learning based recommender systems[J]. Chinese Journal of Computers, 2018, 41(7): 1619-1647. [12] XIAO J, YE H, HE X, et al. Attentional factorization machines: learning the weight of feature interactions via attention networks[J]. arXiv:1708.04617, 2017. [13] ZHOU G, ZHU X, SONG C, et al. Deep interest network for click-through rate prediction[C]//Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018: 1059-1068. [14] 吴俊杰, 刘冠男, 王静远, 等. 数据智能: 趋势与挑战[J]. 系统工程理论与实践, 2020, 40(8): 2116-2149. WU J J, LIU G N, WANG J Y, et al. Data intelligence: trends and challenges[J]. Systems Engineering-Theory & Practice, 2020, 40(8): 2116-2149. [15] MU R. A survey of recommender systems based on deep learning[J]. IEEE Access, 2018, 6: 69009-69022. [16] FANG H, GUO G, ZHANG D, et al. Deep learning-based sequential recommender systems: concepts, algorithms, and evaluations[C]//Proceedings of the 19th International Conference on Web Engineering. Cham: Springer, 2019: 574-577. [17] DA’U A, SALIM N. Recommendation system based on deep learning methods: a systematic review and new directions[J]. Artificial Intelligence Review, 2020, 53(4): 2709-2748. [18] BATMAZ Z, YUREKLI A, BILGE A, et al. A review on deep learning for recommender systems: challenges and remedies[J]. Artificial Intelligence Review, 2019, 52(1):1-37. [19] SUN Z, GUO Q, YANG J, et al. Research commentary on recommendations with side information: a survey and research directions[J]. Electronic Commerce Research and Applications, 2019, 37: 100879. [20] 黄敏, 齐海涛, 蒋春林. 基于注意力机制的耦合协同过滤模型[J]. 华南理工大学学报 (自然科学版), 2021, 49(7): 59-65. HUANG M, QI H T, JIANG C L. Coupled collaborative filtering model based on attention mechanism[J]. Journal of South China University of Technology (Natural Science Edition), 2021, 49(7): 59-65. [21] CHENG H T, KOC L, HARMSEN J, et al. Wide & deep learning for recommender systems[C]//Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, 2016: 7-10. [22] DENG L, YU D. Deep learning: methods and applications[J]. Found Trends Signal Process, 2014, 7(3/4): 197-387. [23] GUO H, TANG R, YE Y, et al. DeepFM: a factorization-machine based neural network for CTR prediction[C]//Proceedings of the 26th International Joint Conference on Artificial Intelligence, 2017: 1725-1731. [24] COVINGTON P, ADAMS J, SARGIN E. Deep neural networks for Youtube recommendations[C]//Proceedings of the 10th ACM Conference on Recommender Systems, 2016: 191-198. [25] ZHOU G, MOU N, FAN Y, et al. Deep interest evolution network for click-through rate prediction[C]//Proceedings of the 33rd AAAI Conference on Artificial Intelligence, 2019: 5941-5948. [26] FENG Y, LV F, SHEN W, et al. Deep session interest network for click-through rate prediction[J]. arXiv:1905.06482, 2019. [27] OUYANG W, ZHANG X, LI L, et al. Deep spatio-temporal neural networks for click-through rate prediction[C]//Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019: 2078-2086. [28] LI X, WANG C, TONG B, et al. Deep time-aware item evolution network for click-through rate prediction[C]//Proceedings of the 29th ACM International Conference on Information and Knowledge Management, 2020: 785-794. [29] LEI K, FU Q, YANG M, et al. Tag recommendation by text classification with attention-based capsule network[J]. Neurocomputing, 2020, 391: 65-73. [30] 徐鹏宇, 刘华锋, 刘冰, 等. 标签推荐方法研究综述[J]. 软件学报, 2021, 33(4): 1244-1266. XU P Y, LIU H F, LIU B, et al. Survey of tag recommendation methods[J]. Journal of Software, 2022, 33(4): 1244-1266. [31] YUAN J, JIN Y, LIU W, et al. Attention-based neural tag recommendation[C]//Proceedings of the 24th International Conference on Database Systems for Advanced Applications, 2019: 350-365. [32] CHEN X, ZHANG Y, AI Q, et al. Personalized key frame recommendation[C]//Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017: 315-324. [33] OORD A V D, DIELEMAN S, SCHRAUWEN B. Deep content-based music recommendation[C]//Advances in Neural Information Processing Systems 26, 2013: 2643-2651. [34] WANG S, WANG Y, TANG J, et al. What your images reveal: exploiting visual contents for point-of-interest recommendation[C]//Proceedings of the 26th International Conference on World Wide Web, 2017: 391-400. [35] HE R, MCAULEY J. Ups and downs: modeling the visual evolution of fashion trends with one-class collaborative filtering[C]//Proceedings of the 25th International Conference on World Wide Web, 2016: 507-517. [36] HE R, MCAULEY J. VBPR: visual Bayesian personalized ranking from implicit feedback[C]//Proceedings of the 30th AAAI Conference on Artificial Intelligence, 2016: 144-150. [37] HE X, DU X, WANG X, et al. Outer product-based neural collaborative filtering[C]//Proceedings of the 27th International Joint Conference on Artificial Intelligence, 2018: 2227-2233. [38] SEO S, HUANG J, YANG H, et al. Interpretable convolutional neural networks with dual local and global attention for review rating prediction[C]//Proceedings of the 11th ACM Conference on Recommender Systems, 2017: 297-305. [39] ASGHAR N. Yelp dataset challenge: review rating prediction[J]. arXiv:1605.05362, 2016. [40] SEO S, HUANG J, YANG H, et al. Representation learning of users and items for review rating prediction using attention-based convolutional neural network[C]//Proceedings of the 2007 International Workshop on Machine Learning Methods for Recommender Systems, 2017: 124-135. [41] LIU D, LI J, DU B, et al. DAML: dual attention mutual learning between ratings and reviews for item recommendation[C]//Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019: 344-352. [42] HUANG J, HAN Z, XU H, et al. Adapted transformer network for news recommendation[J]. Neurocomputing, 2022, 469: 119-129. [43] WANG H, ZHANG F, XIE X, et al. DKN: deep knowledge-aware network for news recommendation[C]//Proceedings of the 2018 World Wide Web Conference, 2018: 1835-1844. [44] WU C, WU F, AN M, et al. NPA: neural news recommendation with personalized attention[C]//Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019: 2576-2584. [45] WU G, LI Y, YAN W, et al. Hashtag recommendation with attention-based neural image hashtagging network[C]//Proceedings of the 25th International Conference on Neural Information Processing, 2018: 52-63. [46] GONG Y, ZHANG Q. Hashtag recommendation using attention-based convolutional neural network[C]//Proceedings of the 25th International Joint Conference on Artificial Intelligence, 2016: 2782-2788. [47] HIDASI B, KARATZOGLOU A, BALTRUNAS L, et al. Session-based recommendations with recurrent neural networks[J]. arXiv:1511.06939, 2015. [48] JANNACH D, LUDEWIG M. When recurrent neural networks meet the neighborhood for session-based recommendation[C]//Proceedings of the 11th ACM Conference on Recommender Systems, Como, 2017: 306-310. [49] WU C-Y, AHMED A, BEUTEL A, et al. Recurrent recommender networks[C]//Proceedings of the 10th ACM International Conference on Web Search and Data Mining, Cambridge, 2017: 495-503. [50] XIA L, LI Z. A new method of abnormal behavior detection using LSTM network with temporal attention mechanism[J]. The Journal of Supercomputing, 2021, 77(4): 3223-3241. [51] CHOROWSKI J, BAHDANAU D, SERDYUK D, et al. Attention-based models for speech recognition[C]//Advances in Neural Information Processing Systems 28, Montreal, 2015: 577-585. [52] SHI M, SHEN D, KOU Y, et al. Next point-of-interest recommendation by sequential feature mining and public preference awareness[J]. Journal of Intelligent & Fuzzy Systems, 2021, 40(3): 4075-4090. [53] MANOTUMRUKSA J, MACDONALD C, OUNIS I. A contextual attention recurrent architecture for context-aware venue recommendation[C]//Proceedings of the 41st International ACM SIGIR Conference on Research, Development in Information Retrieval, Ann Arbor, 2018: 555-564. [54] 李全, 许新华, 刘兴红, 等. 融合时空感知GRU和注意力的下一个地点推荐[J]. 计算机应用, 2020, 40(3): 677-682. LI Q, XU X H, LIU X H, et al. Next location recommendation based on spatiotemporal-aware GRU and attention[J]. Journal of Computer Applications, 2020, 40(3): 677-682. [55] ZENG J, HE X, TANG H, et al. A next location predicting approach based on a recurrent neural network and self-attention[C]//Proceedings of the 15th EAI International Conference on Collaborative Computing: Networking, Applications and Worksharing, London, Aug 19-22, 2019: 309-322. [56] CHENG X, LI N, RYSBAYEVA G, et al. Influence-aware successive point-of-interest recommendation[J]. World Wide Web, 2023, 26(2): 615-629. [57] GUO Q, SUN Z, ZHANG J, et al. An attentional recurrent neural network for personalized next location recommendation[C]//Proceedings of the 34th AAAI Conference on Artificial Intelligence, 2020: 83-90. [58] CHEN P, LI J. Weighted sequence loss based recurrent model for repurchase recommendation[C]//Materials Science and Engineering Conference Series, 2019: 062062. [59] BAI T, ZOU L, ZHAO W X, et al. CTrec: a long-short demands evolution model for continuous-time recommendation[C]//Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019: 675-684. [60] CHEN P, LI J. A recurrent model with self-attention for product repurchase recommendation[C]//Proceedings of the 2019 4th International Conference on Mathematics and Artificial Intelligence, 2019: 199-203. [61] YIN F, JI M, LI S, et al. Neural TV program recommendation with heterogeneous attention[J]. Knowledge and Information Systems, 2022, 64(7): 1759-1779. [62] YIN F, LI S, JI M, et al. Neural TV program recommendation with label and user dual attention[J]. Applied Intelligence, 2022, 52(1): 19-32. [63] GOODFELLOW I, BENGIO Y, COURVILLE A. Deep learning[M]. Cambridge: MIT Press, 2016. [64] ZHANG G, LIU Y, JIN X. A survey of autoencoder-based recommender systems[J]. Frontiers of Computer Science, 2020, 14(2): 430-450. [65] CHEN J, ZHANG H, HE X, et al. Attentive collaborative filtering: multimedia recommendation with item-and component-level attention[C]//Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017: 335-344. [66] SHEN T, JIA J, LI Y, et al. Enhancing music recommendation with social media content: an attentive multimodal autoencoder approach[C]//Proceedings of the 2020 International Joint Conference on Neural Networks, Jul 19-24, 2020: 1-8. [67] LI L, TAO D, ZHENG C, et al. Attentive auto-encoder for content-aware music recommendation[J]. CCF Transactions on Pervasive Computing and Interaction, 2022, 4(1): 76-87. [68] YIN H, ZHOU X, SHAO Y, et al. Joint modeling of user check-in behaviors for point-of-interest recommendation[C]//Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015: 1631-1640. [69] MA C, ZHANG Y, WANG Q, et al. Point-of-interest recommendation: exploiting self-attentive autoencoders with neighbor-aware influence[C]//Proceedings of the 27th ACM International Conference on Information and Knowledge Management, Torino, 2018: 697-706. [70] WEI Y, WANG X, NIE L, et al. Graph-refined convolutional network for multimedia recommendation with implicit feedback[C]//Proceedings of the 28th ACM International Conference on Multimedia, 2020: 3541-3549. [71] TING C C, BOWLES M, IDEWU I. Micro-video recommendation model based on graph neural network and attention mechanism[J]. arXiv:2205.10588, 2022. [72] MA J, BIAN K, WEN J, et al. SRDPR: social relation-driven dynamic network for personalized micro-video recommendation[J]. Expert Systems with Applications, 2023, 226: 120157. [73] SHOKEEN J, RANA C. A study on features of social recommender systems[J]. Artificial Intelligence Review, 2020, 53: 965-988. [74] FAN W, MA Y, LI Q, et al. Graph neural networks for social recommendation[C]//Proceedings of the 2019 World Wide Web Conference, San Francisco, 2019: 417-426. [75] MA X, DONG L, WANG Y, et al. An enhanced attentive implicit relation embedding for social recommendation[J]. Data & Knowledge Engineering, 2023, 145: 102142. [76] SONG W, XIAO Z, WANG Y, et al. Session-based social recommendation via dynamic graph attention networks[C]//Proceedings of the 12th ACM International Conference on Web Search and Data Mining, Melbourne, 2019: 555-563. [77] LIU C, LI Y, LIN H, et al. GNNRec: gated graph neural network for session-based social recommendation model[J]. Journal of Intelligent Information Systems, 2023, 60(1): 137-156. [78] GOH A T. Back-propagation neural networks for modeling complex systems[J]. Artificial Intelligence in Engineering, 1995, 9(3): 143-151. [79] XI W D, HUANG L, WANG C D, et al. BPAM: recommendation based on BP neural network with attention mechanism[C]//Proceedings of the 28th International Joint Conference on Artificial Intelligence, 2019: 3905-3911. [80] WANG C D, XI W D, HUANG L, et al. A BP neural network based recommender framework with attention mechanism[J]. IEEE Transactions on Knowledge and Data Engineering, 2022, 34(7): 3029-3043. [81] HU Z Y, HUANG J, DENG Z H, et al. BCFNet: a balanced collaborative filtering network with attention mechanism[J]. arXiv:2103.06105, 2021. [82] KHAN M M, IBRAHIM R, GHANI I. Cross domain recommender systems: a systematic literature review[J]. ACM Computing Surveys, 2017, 50(3): 1-34. [83] FERNáNDEZ-TOBíAS I, CANTADOR I, KAMINSKAS M, et al. Cross-domain recommender systems: a survey of the state of the art[C]//Proceedings of the 2012 Spanish Conference on Information Retrieval, 2012. [84] LI L. A perspective on off-policy evaluation in reinforcement learning[J]. Frontiers of Computer Science, 2019, 13(5): 911-912. [85] 梁星星, 冯旸赫, 黄金才, 等. 基于自回归预测模型的深度注意力强化学习方法[J]. 软件学报, 2020, 31(4): 948-966. LIANG X X, FENG Y H, HUANG J C, et al. Novel deep reinforcement learning algorithm based on attention-based value function and autoregressive environment model[J]. Journal of Software, 2020, 31(4): 948-966. [86] TAY Y, LUU A T, HUI S C. Multi-pointer co-attention networks for recommendation[C]//Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018: 2309-2318. [87] HU B, SHI C, ZHAO W X, et al. Leveraging meta-path based context for top-n recommendation with a neural co-attention model[C]//Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018: 1531-1540. |
[1] | 张洋宁, 朱静, 董瑞, 尤泽顺, 王震. 多层级信息增强异构图的篇章级话题分割模型[J]. 计算机工程与应用, 2024, 60(9): 203-211. |
[2] | 陶林娟, 华庚兴, 李波. 基于位置增强词向量和GRU-CNN的方面级情感分析模型研究[J]. 计算机工程与应用, 2024, 60(9): 212-218. |
[3] | 江结林, 朱永伟, 许小龙, 崔燕, 赵英男. 混合特征及多头注意力的中文短文本分类[J]. 计算机工程与应用, 2024, 60(9): 237-243. |
[4] | 车运龙, 袁亮, 孙丽慧. 基于强语义关键点采样的三维目标检测方法[J]. 计算机工程与应用, 2024, 60(9): 254-260. |
[5] | 邱云飞, 王宜帆. 双分支结构的多层级三维点云补全[J]. 计算机工程与应用, 2024, 60(9): 272-282. |
[6] | 叶彬, 朱兴帅, 姚康, 丁上上, 付威威. 面向桌面交互场景的双目深度测量方法[J]. 计算机工程与应用, 2024, 60(9): 283-291. |
[7] | 李钟华, 林初俊, 朱恒亮, 廖诗宇, 白云起. 基于结构感知和全局上下文信息的小目标检测[J]. 计算机工程与应用, 2024, 60(9): 292-298. |
[8] | 王彩玲, 闫晶晶, 张智栋. 基于多模态数据的人体行为识别方法研究综述[J]. 计算机工程与应用, 2024, 60(9): 1-18. |
[9] | 廉露, 田启川, 谭润, 张晓行. 基于神经网络的图像风格迁移研究进展[J]. 计算机工程与应用, 2024, 60(9): 30-47. |
[10] | 杨晨曦, 庄旭菲, 陈俊楠, 李衡. 基于深度学习的公交行驶轨迹预测研究综述[J]. 计算机工程与应用, 2024, 60(9): 65-78. |
[11] | 史涛, 崔杰, 李松. 优化改进YOLOv8实现实时无人机车辆检测的算法[J]. 计算机工程与应用, 2024, 60(9): 79-89. |
[12] | 窦智, 高浩然, 刘国奇, 常宝方. 轻量化YOLOv8的小样本钢板缺陷检测算法[J]. 计算机工程与应用, 2024, 60(9): 90-100. |
[13] | 蔡腾, 陈慈发, 董方敏. 结合Transformer和动态特征融合的低照度目标检测[J]. 计算机工程与应用, 2024, 60(9): 135-141. |
[14] | 张俊三, 肖森, 高慧, 邵明文, 张培颖, 朱杰. 基于邻域采样的多任务图推荐算法[J]. 计算机工程与应用, 2024, 60(9): 172-180. |
[15] | 许智宏, 张天润, 王利琴, 董永峰. 融合图谱重构的时序知识图谱推理[J]. 计算机工程与应用, 2024, 60(9): 181-187. |
阅读次数 | ||||||
全文 |
|
|||||
摘要 |
|
|||||