论文标题
通过错误检测和校正提高语法误差校正的效率
Improving the Efficiency of Grammatical Error Correction with Erroneous Span Detection and Correction
论文作者
论文摘要
我们提出了一种新颖的与语言无关的方法,以通过将任务分为两个子任务来提高语法误差校正(GEC)的效率:错误的跨度检测(ESD)和错误的跨度校正(ESC)。 ESD通过有效的序列标记模型来识别语法错误的文本跨度。然后,ESC利用SEQ2SEQ模型将带有注释的错误跨度作为输入的句子,仅输出这些跨度的校正文本。实验表明,我们的方法的性能与英语和中文GEC基准的常规SEQ2SEQ方法相似,推理的时间小于50%。
We propose a novel language-independent approach to improve the efficiency for Grammatical Error Correction (GEC) by dividing the task into two subtasks: Erroneous Span Detection (ESD) and Erroneous Span Correction (ESC). ESD identifies grammatically incorrect text spans with an efficient sequence tagging model. Then, ESC leverages a seq2seq model to take the sentence with annotated erroneous spans as input and only outputs the corrected text for these spans. Experiments show our approach performs comparably to conventional seq2seq approaches in both English and Chinese GEC benchmarks with less than 50% time cost for inference.