论文标题

Autolex:语言探索的自动框架

AUTOLEX: An Automatic Framework for Linguistic Exploration

论文作者

Chaudhary, Aditi, Sheikh, Zaid, Mortensen, David R, Anastasopoulos, Antonios, Neubig, Graham

论文摘要

每种语言都有自己的复杂单词,短语和句子构建系统,其指导原则通常在语法描述中总结,以消费语言学家或语言学习者。但是,手动创建此类描述是一个充满烦恼的过程,因为创建描述,这些描述在没有偏见或错误的“自身术语”中描述语言需要对手头语言和整个语言学的深刻理解。我们提出了一个自动框架自动赛,旨在减轻语言学家的发现和提取语言现象的简明描述。具体而言,我们将此框架应用于三种现象的描述:形态学一致,案例标记和单词顺序,跨多种语言。我们在语言专家的帮助下评估描述,并在人类评估不可行时提出一种自动评估的方法。

Each language has its own complex systems of word, phrase, and sentence construction, the guiding principles of which are often summarized in grammar descriptions for the consumption of linguists or language learners. However, manual creation of such descriptions is a fraught process, as creating descriptions which describe the language in "its own terms" without bias or error requires both a deep understanding of the language at hand and linguistics as a whole. We propose an automatic framework AutoLEX that aims to ease linguists' discovery and extraction of concise descriptions of linguistic phenomena. Specifically, we apply this framework to extract descriptions for three phenomena: morphological agreement, case marking, and word order, across several languages. We evaluate the descriptions with the help of language experts and propose a method for automated evaluation when human evaluation is infeasible.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源