论文标题
预测,解释和改善学习成果的框架
A framework for predicting, interpreting, and improving Learning Outcomes
论文作者
论文摘要
长期以来,人们已经认识到,学术成就是认知和非认知维度共同起作用的结果。因此,任何旨在改善学习成果的智能学习平台(LOS)都必须在这些方面为学习者提供可行的输入。但是,在可扩展的生产环境中操作此类输入并不是微不足道的。我们开发了一个Essibe评分商模型(ESQ),以根据学生观察到的学术,行为和测试特征来预测测试分数。 ESQ可用于预测学生的未来评分潜力,并提供个性化的学习午睡,既对改善LOS至关重要。为预测任务评估了多个机器学习模型。为了向学习者提供有意义的反馈,计算每个功能的个性化Shapley功能归因。通过应用非参数分位回归来获得预测间隔,以量化预测中的不确定性。我们将上述建模策略应用于由Ebmibe Learning平台上超过一亿学习者互动组成的数据集。我们观察到,观察到的分数和预测分数之间的中值绝对误差为4.58%,而预测响应和观察到的响应之间的相关性为0.93。在反事实示例上,类似游戏的场景是什么样的场景来查看LOS的变化。我们简要讨论如何通过将上述模型视为Oracle来应用最佳策略来影响学习成果。
It has long been recognized that academic success is a result of both cognitive and non-cognitive dimensions acting together. Consequently, any intelligent learning platform designed to improve learning outcomes (LOs) must provide actionable inputs to the learner in these dimensions. However, operationalizing such inputs in a production setting that is scalable is not trivial. We develop an Embibe Score Quotient model (ESQ) to predict test scores based on observed academic, behavioral and test-taking features of a student. ESQ can be used to predict the future scoring potential of a student as well as offer personalized learning nudges, both critical to improving LOs. Multiple machine learning models are evaluated for the prediction task. In order to provide meaningful feedback to the learner, individualized Shapley feature attributions for each feature are computed. Prediction intervals are obtained by applying non-parametric quantile regression, in an attempt to quantify the uncertainty in the predictions. We apply the above modelling strategy on a dataset consisting of more than a hundred million learner interactions on the Embibe learning platform. We observe that the Median Absolute Error between the observed and predicted scores is 4.58% across several user segments, and the correlation between predicted and observed responses is 0.93. Game-like what-if scenarios are played out to see the changes in LOs, on counterfactual examples. We briefly discuss how a rational agent can then apply an optimal policy to affect the learning outcomes by treating the above model like an Oracle.