论文标题
未观察到的局部结构使组成概括很难
Unobserved Local Structures Make Compositional Generalization Hard
论文作者
论文摘要
尽管最近的工作令人信服地表明,序列到序列模型难以推广到新的组成(称为组成的概括),但在特定测试实例上使组成概括很难的原因知之甚少。在这项工作中,我们调查了什么因素将某些测试实例概括为具有挑战性。我们首先证实,确实有些示例比其他示例更困难,表明不同的模型在同一测试实例上始终如一地失败或成功。然后,我们提出了一个示例难度的标准:如果测试实例包含在训练时间未观察到的局部结构,则很难。我们基于此标准制定了一个简单的决策规则,并从经验上表明,它可以很好地预测5个不同语义解析数据集的实例级概括,这比替代决策规则要好得多。最后,我们表明,可以利用本地结构来创建困难的对抗性构图,并通过战略性地选择培训集的示例来改善有限培训预算下的组成概括。
While recent work has convincingly showed that sequence-to-sequence models struggle to generalize to new compositions (termed compositional generalization), little is known on what makes compositional generalization hard on a particular test instance. In this work, we investigate what are the factors that make generalization to certain test instances challenging. We first substantiate that indeed some examples are more difficult than others by showing that different models consistently fail or succeed on the same test instances. Then, we propose a criterion for the difficulty of an example: a test instance is hard if it contains a local structure that was not observed at training time. We formulate a simple decision rule based on this criterion and empirically show it predicts instance-level generalization well across 5 different semantic parsing datasets, substantially better than alternative decision rules. Last, we show local structures can be leveraged for creating difficult adversarial compositional splits and also to improve compositional generalization under limited training budgets by strategically selecting examples for the training set.