论文标题
脚本事件预测的丰富事件建模
Rich Event Modeling for Script Event Prediction
论文作者
论文摘要
脚本是从文本中提取的一种结构化知识,其中包含一系列事件。基于此类知识,脚本事件预测旨在预测后续事件。为此,应考虑事件的两个方面,即事件描述(即事件应包含的内容)和事件编码(即应如何编码它们)。大多数现有方法通过动词以及仅几个核心参数(即主题,对象和间接对象)描述事件,这是不精确的。此外,现有事件编码器仅限于固定数量的参数,这些参数无法灵活地处理额外的信息。因此,在本文中,我们提出了脚本事件预测的丰富事件预测(REP)框架。从根本上讲,它基于提出的丰富事件描述,该描述丰富了现有的事件描述,即具有三种重要信息,即动词的感觉,额外的语义角色和参与者的类型。 REP包含一个事件提取器,以从文本中提取此类信息。基于提取的丰富信息,预测器然后选择最可能的后续事件。预测变量的核心组件是基于变压器的事件编码器,可灵活地处理任意数量的参数。广泛使用的Gigaword语料库的实验结果显示了所提出的框架的有效性。
Script is a kind of structured knowledge extracted from texts, which contains a sequence of events. Based on such knowledge, script event prediction aims to predict the subsequent event. To do so, two aspects should be considered for events, namely, event description (i.e., what the events should contain) and event encoding (i.e., how they should be encoded). Most existing methods describe an event by a verb together with only a few core arguments (i.e., subject, object, and indirect object), which are not precise. In addition, existing event encoders are limited to a fixed number of arguments, which are not flexible to deal with extra information. Thus, in this paper, we propose the Rich Event Prediction (REP) framework for script event prediction. Fundamentally, it is based on the proposed rich event description, which enriches the existing ones with three kinds of important information, namely, the senses of verbs, extra semantic roles, and types of participants. REP contains an event extractor to extract such information from texts. Based on the extracted rich information, a predictor then selects the most probable subsequent event. The core component of the predictor is a transformer-based event encoder to flexibly deal with an arbitrary number of arguments. Experimental results on the widely used Gigaword Corpus show the effectiveness of the proposed framework.