文档级关系提取与关系相关性

论文标题

文档级关系提取与关系相关性

Document-level Relation Extraction with Relation Correlations

论文作者

Han, Ridong, Peng, Tao, Wang, Benyou, Liu, Lu, Wan, Xiang

论文摘要

文档级别的关系提取面临两个被忽视的挑战：长尾问题和多标签问题。以前的工作主要是为实体对获得更好的上下文表示，几乎无法解决上述挑战。在本文中，我们分析了关系的同时相关性，并首次将其引入DOCRE任务。我们认为，相关性不仅可以在数据富的关系和数据筛选之间转移知识，以帮助训练尾部关系，而且反映了语义距离指导分类器以识别多标签实体对的语义紧密关系。具体而言，我们将嵌入关系嵌入为介质，并提出了从粗粒和细粒度的角度来捕获关系相关性的两个共同出现的预测子任务。最后，学习的相关性嵌入用于指导关系事实的提取。对两个流行的DOCRE数据集进行了大量实验，与基准相比，我们的方法取得了优越的结果。洞察力分析还证明了关系相关性的潜力，以应对上述挑战。

Document-level relation extraction faces two overlooked challenges: long-tail problem and multi-label problem. Previous work focuses mainly on obtaining better contextual representations for entity pairs, hardly address the above challenges. In this paper, we analyze the co-occurrence correlation of relations, and introduce it into DocRE task for the first time. We argue that the correlations can not only transfer knowledge between data-rich relations and data-scarce ones to assist in the training of tailed relations, but also reflect semantic distance guiding the classifier to identify semantically close relations for multi-label entity pairs. Specifically, we use relation embedding as a medium, and propose two co-occurrence prediction sub-tasks from both coarse- and fine-grained perspectives to capture relation correlations. Finally, the learned correlation-aware embeddings are used to guide the extraction of relational facts. Substantial experiments on two popular DocRE datasets are conducted, and our method achieves superior results compared to baselines. Insightful analysis also demonstrates the potential of relation correlations to address the above challenges.

下载PDF全文

下载文献需遵守相关版权规定

论文标题