利用启动来识别最佳类订购以减轻灾难性遗忘

论文标题

利用启动来识别最佳类订购以减轻灾难性遗忘

Utilizing Priming to Identify Optimal Class Ordering to Alleviate Catastrophic Forgetting

论文作者

Mantione-Holmes, Gabriel, Leo, Justin, Kalita, Jugal

论文摘要

为了使人工神经网络开始准确模仿生物学网络，它们必须能够适应新的紧急情况，而不会忘记他们从以前的培训中学到的东西。人工神经网络的终身学习方法试图努力实现这一目标，但进展不足以现实地部署在自然语言处理任务中。灾难性遗忘的众所周知的障碍仍然来自适当的终身学习模型的守门研究人员。尽管正在努力平息灾难性的遗忘，但缺乏研究来研究课堂订购的重要性，以培训新课程以进行增量学习。这是令人惊讶的，因为人类学习的“阶级”的顺序受到了严格的监控，而且非常重要。尽管已经研究了开发理想班级顺序的启发式方法，但本文研究了类订单，因为它涉及启动作为逐步学习的方案。通过研究人类发现的各种启动方法之间的联系，以及如何模仿这些方法，但在终身的机器学习中仍无法解释，本文可以更好地理解我们的生物系统与合成系统之间的相似性，同时改善当前的实践以打击灾难性的遗忘。通过将心理启动实践与课堂排序的合并，本文能够确定一种可概括的方法，用于在NLP增量学习任务中始终优于随机类订购。

In order for artificial neural networks to begin accurately mimicking biological ones, they must be able to adapt to new exigencies without forgetting what they have learned from previous training. Lifelong learning approaches to artificial neural networks attempt to strive towards this goal, yet have not progressed far enough to be realistically deployed for natural language processing tasks. The proverbial roadblock of catastrophic forgetting still gate-keeps researchers from an adequate lifelong learning model. While efforts are being made to quell catastrophic forgetting, there is a lack of research that looks into the importance of class ordering when training on new classes for incremental learning. This is surprising as the ordering of "classes" that humans learn is heavily monitored and incredibly important. While heuristics to develop an ideal class order have been researched, this paper examines class ordering as it relates to priming as a scheme for incremental class learning. By examining the connections between various methods of priming found in humans and how those are mimicked yet remain unexplained in life-long machine learning, this paper provides a better understanding of the similarities between our biological systems and the synthetic systems while simultaneously improving current practices to combat catastrophic forgetting. Through the merging of psychological priming practices with class ordering, this paper is able to identify a generalizable method for class ordering in NLP incremental learning tasks that consistently outperforms random class ordering.

下载PDF全文

下载文献需遵守相关版权规定

论文标题