视频指导机器翻译挑战2020

论文标题

视频指导机器翻译挑战2020

Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020

论文作者

Hirasawa, Tosho, Yang, Zhishen, Komachi, Mamoru, Okazaki, Naoaki

论文摘要

视频指导的机器翻译是一种多模式神经机器翻译任务之一，该任务是针对生成高质量文本翻译的，通过切实吸引视频和文本。在这项工作中，我们在接近视频引导的机器翻译挑战2020时介绍了视频引导的机器翻译系统。该系统采用基于密钥帧的视频功能提取以及视频功能位置编码。在评估阶段，我们的系统得分为36.60级BLEU-4，并获得了视频引导机器翻译挑战2020的第一名。

Video-guided machine translation as one of multimodal neural machine translation tasks targeting on generating high-quality text translation by tangibly engaging both video and text. In this work, we presented our video-guided machine translation system in approaching the Video-guided Machine Translation Challenge 2020. This system employs keyframe-based video feature extractions along with the video feature positional encoding. In the evaluation phase, our system scored 36.60 corpus-level BLEU-4 and achieved the 1st place on the Video-guided Machine Translation Challenge 2020.

下载PDF全文

下载文献需遵守相关版权规定

论文标题