论文标题
序列学习的时间相关的任务计划
Temporally Correlated Task Scheduling for Sequence Learning
论文作者
论文摘要
近年来,序列学习吸引了机器学习社区的很多研究关注。在许多应用程序中,序列学习任务通常与多个时间相关的辅助任务相关联,这些任务在使用多少输入信息或预测的未来步骤方面有所不同。例如,(i)在同时机器翻译中,可以在不同的延迟下进行翻译(即,在翻译之前要读取/等待多少个输入单词); (ii)预测股票趋势,可以预测在不同的未来几天(例如,明天,下一天之后)的股票价格。虽然很明显,这些时间相关的任务可以互相帮助,但关于如何更好地利用多个辅助任务来提高主要任务的性能的探索非常有限。在这项工作中,我们向序列学习介绍了可学习的调度程序,该调度程序可以根据模型状态和当前的培训数据自适应地选择辅助任务。调度程序和主要任务模型通过双层优化共同训练。实验表明,我们的方法显着提高了同时机器翻译和库存趋势预测的性能。
Sequence learning has attracted much research attention from the machine learning community in recent years. In many applications, a sequence learning task is usually associated with multiple temporally correlated auxiliary tasks, which are different in terms of how much input information to use or which future step to predict. For example, (i) in simultaneous machine translation, one can conduct translation under different latency (i.e., how many input words to read/wait before translation); (ii) in stock trend forecasting, one can predict the price of a stock in different future days (e.g., tomorrow, the day after tomorrow). While it is clear that those temporally correlated tasks can help each other, there is a very limited exploration on how to better leverage multiple auxiliary tasks to boost the performance of the main task. In this work, we introduce a learnable scheduler to sequence learning, which can adaptively select auxiliary tasks for training depending on the model status and the current training data. The scheduler and the model for the main task are jointly trained through bi-level optimization. Experiments show that our method significantly improves the performance of simultaneous machine translation and stock trend forecasting.