将NURC/SP带到数字生活：开源自动语音识别模型的作用

论文标题

将NURC/SP带到数字生活：开源自动语音识别模型的作用

Bringing NURC/SP to Digital Life: the Role of Open-source Automatic Speech Recognition Models

论文作者

Gris, Lucas Rafael Stefanel, Junior, Arnaldo Candido, Santos, Vinícius G. dos, Dias, Bruno A. Papa, Leite, Marli Quadros, Svartman, Flaviane Romani Fernandes, Aluísio, Sandra

论文摘要

NURC项目始于1969年，旨在研究五个巴西首都的文化语言城市规范，负责为每个资本编译一个大型语料库。数字化的NURC/SP包括在圣保罗资本（SãoPauloCapital）进行的334小时记录中的375个查询。尽管有47个查询有成绩单，但音频转录之间没有对齐，没有转录328个查询。本文介绍了三种自动语音识别模型的评估和错误分析，该模型在葡萄牙语中自发言语训练，并通过培训准备的语音训练。评估使我们可以在NURC/SP的手动对齐样本中使用WER和CER指标选择最佳模型，以自动转录284小时。

The NURC Project that started in 1969 to study the cultured linguistic urban norm spoken in five Brazilian capitals, was responsible for compiling a large corpus for each capital. The digitized NURC/SP comprises 375 inquiries in 334 hours of recordings taken in São Paulo capital. Although 47 inquiries have transcripts, there was no alignment between the audio-transcription, and 328 inquiries were not transcribed. This article presents an evaluation and error analysis of three automatic speech recognition models trained with spontaneous speech in Portuguese and one model trained with prepared speech. The evaluation allowed us to choose the best model, using WER and CER metrics, in a manually aligned sample of NURC/SP, to automatically transcribe 284 hours.

下载PDF全文

下载文献需遵守相关版权规定

论文标题