用教学文本评估计划生成方法

论文标题

用教学文本评估计划生成方法

Towards Evaluating Plan Generation Approaches with Instructional Texts

论文作者

Chowdhury, Debajyoti Paul, Biswas, Arghya, Sosnowski, Tomasz, Yordanova, Kristina

论文摘要

通过语言基础了解行为理解的最新研究表明，可以自动从文本指令中生成行为模型。这些模型通常具有面向目标的结构，并以计划域的不同形式主义（例如计划域定义语言）进行建模。仍然存在的一个主要问题是，没有用于比较不同模型生成方法的基准数据集，因为通常在特定于域的应用程序上评估每种方法。为了允许从文本说明中对模型生成的不同方法进行客观比较，在本报告中，我们介绍了一个由83种英语文字说明组成的数据集，它们以更结构化的形式进行了完善，并为每个说明提供了手动制定计划。该数据集可公开提供社区。

Recent research in behaviour understanding through language grounding has shown it is possible to automatically generate behaviour models from textual instructions. These models usually have goal-oriented structure and are modelled with different formalisms from the planning domain such as the Planning Domain Definition Language. One major problem that still remains is that there are no benchmark datasets for comparing the different model generation approaches, as each approach is usually evaluated on domain-specific application. To allow the objective comparison of different methods for model generation from textual instructions, in this report we introduce a dataset consisting of 83 textual instructions in English language, their refinement in a more structured form as well as manually developed plans for each of the instructions. The dataset is publicly available to the community.

下载PDF全文

下载文献需遵守相关版权规定

论文标题