论文标题
用胶囊网络分解单词嵌入
Decomposing Word Embedding with the Capsule Network
论文作者
论文摘要
词感官歧义试图在给定的上下文中学习适当的模棱两可的词。现有的预训练的语言方法和基于单词多插入的方法并未充分探索无监督的单词嵌入的力量。 在本文中,我们讨论了一种基于胶囊网络的方法,利用胶囊识别高度重叠的功能和处理细分的潜力。我们提出了一种基于胶囊网络的方法,将模棱两可的单词嵌入无监督的单词嵌入到上下文特定的意义嵌入中,称为capsdece2s。在这种方法中,无监督的模棱两可的嵌入被输入到胶囊网络中,以产生其多种词素般的矢量,这些媒介被定义为基本的语义语言单位。通过注意操作,CapSdece2S将单词上下文集成在一起,以将多个类似词素的向量重构为上下文特定的意义嵌入。为了培训CapSdece2,我们提出了一种感官匹配的训练方法。在这种方法中,我们将感觉学习转换为二进制分类,该分类明确地通过匹配和非匹配标签来了解感官之间的关系。 CapsDece2在两个感官学习任务中进行了实验评估,即上下文和单词意义上的歧义。结果是两个公共语料库中的语言和英语全字词sense sense dismampaution表明,capsdece2s模型在上下文和单词sense sense消除歧义任务中实现了该单词的新最新技术。
Word sense disambiguation tries to learn the appropriate sense of an ambiguous word in a given context. The existing pre-trained language methods and the methods based on multi-embeddings of word did not explore the power of the unsupervised word embedding sufficiently. In this paper, we discuss a capsule network-based approach, taking advantage of capsule's potential for recognizing highly overlapping features and dealing with segmentation. We propose a Capsule network-based method to Decompose the unsupervised word Embedding of an ambiguous word into context specific Sense embedding, called CapsDecE2S. In this approach, the unsupervised ambiguous embedding is fed into capsule network to produce its multiple morpheme-like vectors, which are defined as the basic semantic language units of meaning. With attention operations, CapsDecE2S integrates the word context to reconstruct the multiple morpheme-like vectors into the context-specific sense embedding. To train CapsDecE2S, we propose a sense matching training method. In this method, we convert the sense learning into a binary classification that explicitly learns the relation between senses by the label of matching and non-matching. The CapsDecE2S was experimentally evaluated on two sense learning tasks, i.e., word in context and word sense disambiguation. Results on two public corpora Word-in-Context and English all-words Word Sense Disambiguation show that, the CapsDecE2S model achieves the new state-of-the-art for the word in context and word sense disambiguation tasks.