新兴语言的构图和概括

论文标题

新兴语言的构图和概括

Compositionality and Generalization in Emergent Languages

论文作者

Chaabouni, Rahma, Kharitonov, Eugene, Bouchacourt, Diane, Dupoux, Emmanuel, Baroni, Marco

论文摘要

自然语言使我们能够通过根据系统规则（称为\ emph {compositionality}的属性组合表达其部分表示零件来参考新颖的综合概念。在本文中，我们研究了深度多代理模拟中出现的语言是否具有类似的参考新原始组合的能力，以及它是否通过类似于人类语言组成的策略来实现这一壮举。配备了新的方法来衡量受到代表学习中分离启发的新兴语言的组成性，我们建立了三个主要结果。首先，鉴于足够大的输入空间，新兴语言自然会发展出参考新型复合概念的能力。其次，新兴语言的组成程度与其概括能力之间没有相关性。第三，虽然构成性对于概括不是必需的，但它在语言传输方面提供了优势：语言的组成越多，即使后者在架构上与原始代理人不同，它就会越容易被新学习者拾取。我们得出的结论不是由简单的概括压力引起的，但是如果出现的语言确实有机会，那么它将更有可能生存和壮成长。

Natural language allows us to refer to novel composite concepts by combining expressions denoting their parts according to systematic rules, a property known as \emph{compositionality}. In this paper, we study whether the language emerging in deep multi-agent simulations possesses a similar ability to refer to novel primitive combinations, and whether it accomplishes this feat by strategies akin to human-language compositionality. Equipped with new ways to measure compositionality in emergent languages inspired by disentanglement in representation learning, we establish three main results. First, given sufficiently large input spaces, the emergent language will naturally develop the ability to refer to novel composite concepts. Second, there is no correlation between the degree of compositionality of an emergent language and its ability to generalize. Third, while compositionality is not necessary for generalization, it provides an advantage in terms of language transmission: The more compositional a language is, the more easily it will be picked up by new learners, even when the latter differ in architecture from the original agents. We conclude that compositionality does not arise from simple generalization pressure, but if an emergent language does chance upon it, it will be more likely to survive and thrive.

下载PDF全文

下载文献需遵守相关版权规定

论文标题