论文标题
视频指导机器翻译挑战2020
Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020
论文作者
论文摘要
视频指导的机器翻译是一种多模式神经机器翻译任务之一,该任务是针对生成高质量文本翻译的,通过切实吸引视频和文本。在这项工作中,我们在接近视频引导的机器翻译挑战2020时介绍了视频引导的机器翻译系统。该系统采用基于密钥帧的视频功能提取以及视频功能位置编码。在评估阶段,我们的系统得分为36.60级BLEU-4,并获得了视频引导机器翻译挑战2020的第一名。
Video-guided machine translation as one of multimodal neural machine translation tasks targeting on generating high-quality text translation by tangibly engaging both video and text. In this work, we presented our video-guided machine translation system in approaching the Video-guided Machine Translation Challenge 2020. This system employs keyframe-based video feature extractions along with the video feature positional encoding. In the evaluation phase, our system scored 36.60 corpus-level BLEU-4 and achieved the 1st place on the Video-guided Machine Translation Challenge 2020.