基于空洞因果卷积的学生成绩预测及分析方法

    Student Performance Prediction and Analysis Method Based on Dilated Causal Convolution

    • 摘要: 针对使用循环神经网络对学生长序列行为数据进行特征提取存在梯度消失或者爆炸、提取长期依赖关系能力不足、深度学习模型缺乏解释归因能力等问题,提出一种面向长序列数据的空洞因果卷积(dilated causal convolution,DCC)成绩预测及分析方法。首先,采用生成对抗网络(generative adversarial network,GAN)生成符合少数类学生原始行为数据分布规律的新样本,并将新样本加入到学生数据集中以达到均衡数据集的目的;然后,提出一种基于DCC的成绩预测模型,DCC和门控循环单元(gated recurrent unit,GRU)相结合的结构提高了模型对长序列数据的依赖关系提取能力;最后,使用沙普利加性解释(Shapley additive explanations,SHAP)方法并结合三因素理论对影响学生成绩的因素进行重要性分析和解释。在公开数据集上的实验结果表明,在成绩预测任务中,提出的方法与基线方法相比在加权F1指标上提高了约6%,并进一步验证了所提方法中关键模块的有效性和模型的泛化能力。此外,通过对比优秀学生和风险学生的学习特点发现,良好的学习习惯、课堂学习的主动性以及不同行为环境等因素会对学生成绩产生重要影响。

       

      Abstract: To address gradient vanishing or explosion, inadequate capacity for capturing long-term dependencies, and lack of interpretability in deep learning models when using recurrent neural networks for extracting features from long sequential student behavioral data, a method for student performance prediction and analysis is proposed based on dilated causal convolution (DCC) tailored for long sequential data. First, employing a generative adversarial network (GAN) to generate new samples of the minority class conforming to the distribution pattern of original student behavioral data, these new samples were merged into the student dataset to achieve class balance. Second, a prediction model based on DCC was introduced, wherein the combination of DCC and gated recurrent unit (GRU) enhances the model􀆳s capability to extract dependencies from long sequential data. Finally, the Shapley additive explanations method combined with three-factor theory was employed to analyze and interpret the importance of factors influencing student performance. The experimental results on public datasets indicate that the proposed method outperforms the baselines methods in student performance prediction by raising the weighted F1 score by approximately 6% , and further validates the effectiveness of key modules proposed in the method and the generalization ability of the model. Additionally, the comparison of the learning characteristics between high-achieving and at-risk students show that factors such as good study habits, proactive classroom engagement, and different behavioral environments exert significant impacts on students􀆳 academic performance.

       

    /

    返回文章
    返回