计算机工程与应用 ›› 2023, Vol. 59 ›› Issue (10): 123-133.DOI: 10.3778/j.issn.1002-8331.2201-0094

• 模式识别与人工智能 • 上一篇    下一篇

多特征融合的MOOC推荐模型

舒新峰,曹望美,王曙燕   

  1. 西安邮电大学 计算机学院,西安 710121
  • 出版日期:2023-05-15 发布日期:2023-05-15

Multi-Feature Fusion Based Model for MOOC Recommendation

SHU Xinfeng, CAO Wangmei, WANG Shuyan   

  1. School of Computer Science & Technology, Xi’an University of Posts & Telecommunications, Xi’an 710121, China
  • Online:2023-05-15 Published:2023-05-15

摘要: 为充分利用MOOC(massive open online course)上下文信息,精确表示学习者和课程特征,提出一种多特征融合的MOOC推荐模型(multi-feature fusion based model for MOOC recommendation,MFF-MOOCREC)。利用文本卷积神经网络和双向长短时记忆网络捕获数据中的文本和时序特征,并设计多级注意力机制提取学习者交互序列、评论文本和课程多元属性中的关键信息;基于前缀投影的模式挖掘和亲和力传播算法对原始课程类别进行关联聚类分析以增加推荐的覆盖率;采用概率矩阵分解训练模型参数,将优化后的学习者隐向量和课程隐向量点乘产生预测评分。实验表明,和现有推荐方法相比,MFF-MOOCREC的命中率、归一化折损累计增益和覆盖率指标在Coursera数据集上平均提高46.86%、41.19%和10.95%,在iCourse数据集上平均提高44.08%、28.79%和9.81%,对于缓解数据稀疏问题,提升推荐质量具有一定优势。

关键词: MOOC推荐, 文本卷积神经网络, 双向长短时记忆网络, 注意力机制, 关联聚类分析, 概率矩阵分解

Abstract: In order to make full use of the contextual information of MOOC(massive open online course) and accurately represent the feature of learners and courses, a multi-feature fusion based model for MOOC recommendation(MFF-MOOCREC) is proposed. Text convolutional neural network as well as bidirectional long short-term memory network are introduced respectively to capture the textual and sequential feature from data, and a multi-level attention mechanism is designed to extract key information from learning records, review texts and multiple attributes of course. In order to increase the coverage ratio of recommendation, prefix-projected pattern growth and affinity propagation algorithm are adopted in combination for a relevant clustering analysis on the original courses’ category labels. Probabilistic matrix factorization is used for parameters training, and the predicted ratings are obtained from the dot product between learners’ latent vectors and courses’ latent vectors. Experiments show that, compared with the available methods, MFF-MOOCREC achieves the improvements of average 46.86%, 41.19%, 10.95% and 44.08%, 28.79%, 9.81% on hit ratio, normalized discounted cumulative gain and coverage ratio indicators over Coursera dataset and iCourse dataset respectively, which indicates the proposed model can effectively alleviate the problem of data sparseness as well as improve the performance of recommendation to some extent.

Key words: MOOC recommendation, text convolutional neural network, bidirectional long short-term memory network, attention mechanism, relevant clustering analysis, probabilistic matrix factorization