论文标题

钓鱼者:下一代自然语言探索框架

ANGLEr: A Next-Generation Natural Language Exploratory Framework

论文作者

Knez, Timotej, Bajec, Marko, Žitnik, Slavko

论文摘要

自然语言处理用于解决各种问题。一些与语言资源一起工作的学者和利益集团在编程方面并不精通,因此需要一个良好的图形框架,使用户可以快速设计和测试自然语言处理管道而无需编程。现有的框架不满足这种工具的所有要求。因此,我们提出了一个新框架,为其用户提供了一种简单的方法来构建语言处理管道。它还允许一种简单的编程语言不可知论的方法来添加新的模块,这将有助于自然语言处理的开发人员和研究人员采用。所提出的框架的主要部分包括(a)基于底座的架构,(b)常规数据模型以及(c)APIS描述以及图形用户界面。所提出的设计用于实施一个新的自然语言处理框架,称为钓鱼者。

Natural language processing is used for solving a wide variety of problems. Some scholars and interest groups working with language resources are not well versed in programming, so there is a need for a good graphical framework that allows users to quickly design and test natural language processing pipelines without the need for programming. The existing frameworks do not satisfy all the requirements for such a tool. We, therefore, propose a new framework that provides a simple way for its users to build language processing pipelines. It also allows a simple programming language agnostic way for adding new modules, which will help the adoption by natural language processing developers and researchers. The main parts of the proposed framework consist of (a) a pluggable Docker-based architecture, (b) a general data model, and (c) APIs description along with the graphical user interface. The proposed design is being used for implementation of a new natural language processing framework, called ANGLEr.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源