论文标题
朝鲜语的普遍依赖性的另一种格式
Yet Another Format of Universal Dependencies for Korean
论文作者
论文摘要
在这项研究中,我们提出了一种基于词素的依赖性解析的计划,并采用拟议的计划来普遍依赖。我们介绍了语言原理,该基本原理说明了采用基于词素的格式的动机和必要性,并开发了脚本,这些脚本会在通用依赖项使用的原始格式和所提出的基于词素的格式自动之间进行转换。然后,统计和神经模型(包括udpipe和stanza)都证明了提议格式对韩国依赖解析的有效性,并以我们精心构建的基于词素的单词嵌入韩语。 Morphud的表现优于所有韩国UD Treebanks的解析结果,我们还提供了详细的错误分析。
In this study, we propose a morpheme-based scheme for Korean dependency parsing and adopt the proposed scheme to Universal Dependencies. We present the linguistic rationale that illustrates the motivation and the necessity of adopting the morpheme-based format, and develop scripts that convert between the original format used by Universal Dependencies and the proposed morpheme-based format automatically. The effectiveness of the proposed format for Korean dependency parsing is then testified by both statistical and neural models, including UDPipe and Stanza, with our carefully constructed morpheme-based word embedding for Korean. morphUD outperforms parsing results for all Korean UD treebanks, and we also present detailed error analyses.