在信号交叉点上针对连接和自动化车辆的基于混合学习的基于学习的生态驾驶策略

论文标题

在信号交叉点上针对连接和自动化车辆的基于混合学习的基于学习的生态驾驶策略

Hybrid Reinforcement Learning-Based Eco-Driving Strategy for Connected and Automated Vehicles at Signalized Intersections

论文作者

Bai, Zhengwei, Hao, Peng, Shangguan, Wei, Cai, Baigen, Barth, Matthew J.

论文摘要

利用车辆到全部用途（V2X）通信和自动驾驶技术，连接和自动化的车辆正迅速成为许多运输问题的变革性解决方案之一。但是，在信号交叉点的混合交通环境中，考虑到交通系统的复杂性和不确定性，提高整体吞吐量和能源效率仍然是一项艰巨的任务。在这项研究中，我们提出了一个混合增强学习（HRL）框架，该框架结合了基于规则的策略和深层增强学习（DEEP RL），以支持混合流量中信号交叉点的连接的生态驱动。视觉感知方法与车辆到基础设施（V2I）通信集成在一起，以实现混合连接交通的更高的活动性和能源效率。 HRL框架具有三个组件：基于规则的驾驶经理，该管理器在基于规则的策略与RL策略之间运行协作；一个多流神经网络，它提取了视觉和V2I信息的隐藏特征；以及基于RL的深层政策网络，既可以产生纵向和横向生态驾驶动作。为了评估我们的方法，我们开发了一个基于统一的模拟器，并设计了混合交叉的相交场景。此外，实施了几个基线以与我们的新设计进行比较，并进行了数值实验以测试HRL模型的性能。实验表明，与基于最先进的模型的生态驾驶方法相比，我们的HRL方法可以将能耗减少12.70％，并节省11.75％的行进时间。

Taking advantage of both vehicle-to-everything (V2X) communication and automated driving technology, connected and automated vehicles are quickly becoming one of the transformative solutions to many transportation problems. However, in a mixed traffic environment at signalized intersections, it is still a challenging task to improve overall throughput and energy efficiency considering the complexity and uncertainty in the traffic system. In this study, we proposed a hybrid reinforcement learning (HRL) framework which combines the rule-based strategy and the deep reinforcement learning (deep RL) to support connected eco-driving at signalized intersections in mixed traffic. Vision-perceptive methods are integrated with vehicle-to-infrastructure (V2I) communications to achieve higher mobility and energy efficiency in mixed connected traffic. The HRL framework has three components: a rule-based driving manager that operates the collaboration between the rule-based policies and the RL policy; a multi-stream neural network that extracts the hidden features of vision and V2I information; and a deep RL-based policy network that generate both longitudinal and lateral eco-driving actions. In order to evaluate our approach, we developed a Unity-based simulator and designed a mixed-traffic intersection scenario. Moreover, several baselines were implemented to compare with our new design, and numerical experiments were conducted to test the performance of the HRL model. The experiments show that our HRL method can reduce energy consumption by 12.70% and save 11.75% travel time when compared with a state-of-the-art model-based Eco-Driving approach.

下载PDF全文

下载文献需遵守相关版权规定

论文标题