多卡克车辆路线问题的强化学习

论文标题

多卡克车辆路线问题的强化学习

Reinforcement Learning for Multi-Truck Vehicle Routing Problems

论文作者

Levin, Joshua, Correll, Randall, Ide, Takanori, Suzuki, Takafumi, Takaho, Saito, Arai, Alan

论文摘要

深厚的增强学习（RL）已被证明可以有效地为某些车辆路由问题（VRP）产生近似解决方案，尤其是在使用由Encoder-Decoder注意机制产生的策略时。尽管这些技术对于相对简单的问题实例非常成功，但仍未研究和高度复杂的VRP变体，尚未证明有效的RL方法。在这项工作中，我们专注于这样的VRP变体，其中包含多个卡车和多腿路由要求。在这些问题中，需要需求沿节点的序列移动，而不仅仅是从启动节点到末端节点。为了使Deep RL成为现实世界中供应链物流的可行策略，我们为现有的编码器核对器注意模型开发了新的扩展，使他们能够处理多个卡车和多腿路由要求。我们的模型的优势是可以为少数卡车和节点进行培训，然后嵌入大型供应链中，以产生大量卡车和节点的解决方案。我们测试了日本汽车零件制造商Aisin Corporation运营中产生的实际供应链环境的方法，并发现我们的算法表现优于Aisin以前的最佳解决方案。

Deep reinforcement learning (RL) has been shown to be effective in producing approximate solutions to some vehicle routing problems (VRPs), especially when using policies generated by encoder-decoder attention mechanisms. While these techniques have been quite successful for relatively simple problem instances, there are still under-researched and highly complex VRP variants for which no effective RL method has been demonstrated. In this work we focus on one such VRP variant, which contains multiple trucks and multi-leg routing requirements. In these problems, demand is required to move along sequences of nodes, instead of just from a start node to an end node. With the goal of making deep RL a viable strategy for real-world industrial-scale supply chain logistics, we develop new extensions to existing encoder-decoder attention models which allow them to handle multiple trucks and multi-leg routing requirements. Our models have the advantage that they can be trained for a small number of trucks and nodes, and then embedded into a large supply chain to yield solutions for larger numbers of trucks and nodes. We test our approach on a real supply chain environment arising in the operations of Japanese automotive parts manufacturer Aisin Corporation, and find that our algorithm outperforms Aisin's previous best solution.

下载PDF全文

下载文献需遵守相关版权规定

论文标题