论文标题
高移动性MMWave车辆网络的最佳梁协会:轻巧的平行增强学习方法
Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach
论文作者
论文摘要
在智能运输系统(ITS)中,预计车辆具有高级应用和服务,需要超高的数据速率和低延迟通信。为此,毫米波(MMWave)的通信一直是一个非常有前途的解决方案。但是,由于车辆的高移动性以及MMWave梁对动态堵塞的固有灵敏度,将MMWave纳入其中特别具有挑战性。本文通过在高移动性下为MMWave车辆网络开发最佳光束关联框架来解决这些问题。具体而言,我们使用半马尔可夫决策过程来捕获环境的动态和不确定性。然后,Q学习算法通常用于查找最佳策略。但是,Q学习以其缓慢的连接而臭名昭著。我们没有采用深厚的增强学习结构(就像文献中的大多数作品一样),而是利用了这样一个事实,即通常有多辆车在道路上加快学习过程。为此,我们开发了一种轻巧但非常有效的并行Q学习算法,以通过同时向各种车辆学习来快速获得最佳政策。广泛的模拟表明,与其他解决方案相比,我们提出的解决方案可以将数据速率提高47%,并将断开概率降低29%。
In intelligent transportation systems (ITS), vehicles are expected to feature with advanced applications and services which demand ultra-high data rates and low-latency communications. For that, the millimeter wave (mmWave) communication has been emerging as a very promising solution. However, incorporating the mmWave into ITS is particularly challenging due to the high mobility of vehicles and the inherent sensitivity of mmWave beams to dynamic blockages. This article addresses these problems by developing an optimal beam association framework for mmWave vehicular networks under high mobility. Specifically, we use the semi-Markov decision process to capture the dynamics and uncertainty of the environment. The Q-learning algorithm is then often used to find the optimal policy. However, Q-learning is notorious for its slow-convergence. Instead of adopting deep reinforcement learning structures (like most works in the literature), we leverage the fact that there are usually multiple vehicles on the road to speed up the learning process. To that end, we develop a lightweight yet very effective parallel Q-learning algorithm to quickly obtain the optimal policy by simultaneously learning from various vehicles. Extensive simulations demonstrate that our proposed solution can increase the data rate by 47% and reduce the disconnection probability by 29% compared to other solutions.